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EDITORIAL 


It seems appropriate for an incoming Editor 
to share some initial reflections with the Journal’s 
readers and authors. The Journal of Applied 
Psychology was founded in 1917 by G. Stanley 
Hall, who served as its Editor until 1920. Subse- 
quent Editors have included James Porter (1921- 
1942), Donald Paterson (1943-1954), John 
Darley (1955-1960), and Kenneth E. Clark 
(1961-1970). With the present volume the 
Journal has a new Editor for only the sixth time 
in its 53-year history! My first reflection, then, 
is on the long tradition of the Journal and on my 
responsibility to maintain the historical continu- 
ity, while encouraging an editorial policy tuned 
to the times and open to new departures. 

At this writing, I have already spent 10 months 
processing manuscripts for the current volume. 
During this difficult “breaking in” period I have 
tried to formulate some tentative ideas about 
journal policy. These ideas might give some clues 
as to the kinds of manuscripts we would like 
to encourage. The Journal of Applied Psychology 
has been the primary outlet, published by the 
American Psychological Association, for research 
in various fields of personnel and organizational 
psychology, as well as engineering and consumer 
psychology. This tradition will continue. It is 
our intention, however, to encourage a broadened 
base of articles in applied psychology with 
increasing emphasis upon research on socially 
relevant problems. 

The times have generated critical questions 
that need to be answered by sound research. 
Many of the problems and concepts of the real 
world can be translated into psychological lan- 
guage and put to empirical test. Much of what 
was formerly the province of conventional 
wisdom, traditional morality, or political action 
can be dealt with by the psychologist with his 
quantitative empiricism and from his theoretical 
base. Similarly, there is a need for validation of 
concepts and generalizations developed in the 
laboratory or in hypothetical situations, or with 
college sophomores, against the hard social reali- 
ties which surround us, Can we apply the re- 
search knowledge and methods already developed 
toward the solution of pressing problems? Can 
we be sufficiently innovative to develop new 
methods for dealing with these research problems, 


1 Darley (1968) has Provided an excellent review 
of the Journal's first 50 years, During the first two 
years of the Journal, L. R, Geissler and John W. 
Baird served as co-editors with Hall. William F. 
Book served as co-editor with Porter from 1921 
until 1927. 


often in an action-oriented setting, with sufficient 
scientific rigor to allow dependable generaliza- 
tions? We have heard the challenge from many 
quarters. Should not the Journal of Applied 
Psychology be a primary outlet for research 
addressed to these questions? 

There are already signs that many of the tradi- 
tional methods of industrial psychology are 
increasingly being applied in a wider arena of 
social problems. Research on selection, training, 
criterion development, job attitudes, group influ- 
ences, motivation, communication, organizational 
behavior, working conditions—all of these have 
relevance to today’s issues. A modern Journal of 
Applied Psychology can contribute by encouraging 
publication of research in a wide variety of 
settings in our society. In addition to industry, 
such settings would include problems of urban 
areas, criminal justice systems, health systems 
and institutions, transportation and defense sys- 
tems, space and other new environments, educa- 
tional systems, etc. If the Journal is truly to 
concern itself with psychological research and 
Knowledge applied to life, we need to bring 
various psychological disciplines into a more 
cohesive relationship moving toward the under- 
standing and solution of applied problems in 
these settings. 

Examples of work which would be relevant 
to these pages would surely include: methods 
of upgrading the disadvantaged in employment 
and educational settings; problems of test bias 
with minority groups; police selection, training, 
and community relations; job restructuring, in- 
cluding improved utilization of paraprofessionals; 
changing occupational roles of women; adapta- 
tion to technological innovation; selection and 
training for, and adaptation to jobs in foreign 
cultures (including Peace Corps assignments); 
studies of value conflict between youth and 
current institutions; problems of stress in jobs 
with sensory underload (or overload); longitudi- 
nal studies of planned and unplanned change and 
the effects of intervention programs; factors 
related to more effective decision making in a 
variety of situations. 

At the same time there is a need for an inte- 
grated body of generalized, verifiable, substantive 
knowledge. We will favor conceptually based 
research—studies which emerge from concepts 
or theories which have been or are being devel- 
oped, particularly if the theory appears relevant 
to applied issues. We want to avoid an overem- 
phasis on techniques and on articles showing 
that what we have done well before, we can 
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do again with still another sample. Much of 
what gets submitted provides no explanation of 
what has been learned to serve as a basis for 
generalization. , 

We would encourage authors not to send in 
articles prematurely on some fragment or limited 
aspect of their research. Rather, we feel the 
Journal of Applied Psychology should encourage 
articles describing more comprehensive research, 
in some cases organized statements resulting from 
several experiments. This may mean some longer 
articles, but the author is cautioned that concise 
writing is still required with introduction, litera- 
ture references, and discussions sharply focused 
on the main points of the article. In the competi- 
tion for precious journal space, each manuscript 
will be evaluated on four primary criteria: 
(a) significance in contributing new knowledge 
to the field, (b) technical adequacy, (c) appropri- 
ateness for the Journal of Applied Psychology, 
(d) clarity of presentation. We want to dis- 
courage articles of limited interest. We will not 
publish simple test validation studies, for ex- 
ample, unless some new concept or important 
methodological or substantive development is 
reported; nor is a study of the best hammer size 
for a given size nail likely to excite us. 

We recognize the communication sharing func- 
tion of the Journal and expect to have a Short 
Notes section for brief reports, such as replica- 
tion studies, methodological notes, presentation 
of ideas or theoretical discussion, especially inter- 
esting pilot studies, etc. These will normally be 
confined to two journal pages. 

I realize the above is a highly idealized state- 
ment of policy. As Darley (1968) has com- 
mented, “Editors may propose, but authors and 
the availability of manuscripts dispose.” An APA 
journal is Tepresentative of the research of APA 
members rather than of the aspirations of its 
editors. Authors and readers have justifiable 
2xpectations which place constraints on new 
departures. Any change is likely to be a gradual 
srocess and must reflect the changing nature of 
the field and many other social forces. We have 


tried to anticipate some of these. 


The reader may be interested in our experience 


ts submitted during the first 10 
0. The 286 manuscripts submitted 
during this period run heavily toward the area 
of job satisfaction—motivation, with one in every 
five manuscripts falling into this category. The 
second largest category of submissions has been 
in the area of human performance and engineer- 
ing psychology, followed by selection Studies, 
organizational studies, and research on group 


processes, consumer psychology, and management. 


The number of manuscripts submitted by if 
end of the year will undoubtedly exceed the 
total of 311 submitted last year. The ace 
rate is running about 27%. Since this inc uin 
acceptances made provisional on extensive At 
sions, it is likely that the “actual” acceptal fs 
rate will be lower. Of the first 200 ae 
received, only one was accepted “as is.” ld 
underscores our feeling that the Editor ee 
be more than a gatekeeper. Our aim is TONG t 
careful editorial feedback on each anuson 
including rejected as well as “accepted at 
revision” manuscripts. We have been ae oe 
the expressions of appreciation from our EP i, 
even from those to whom we must regre 
say, “No.” 4. 4 al 

‘An organized enterprise like a scientific jot | 
is possible because of the cooperakion, pg 
altruism of a great many people. One im 
most pleasant duties as Editor has bet. 
appoint the new Board of Consulting Editol 
As the new masthead indicates, we have scent 
tinguished Board and they have been see l 
during this period. All accepted appa thei! 
without qualification and have contribute 1 
time and talent generously. The person 
professional pleasure gained in my intera as 
with this network of reviewers and author rts: 
been one of the rewards of this first years av e 
The Board was chosen to represent the as the 
traditional areas of the Journal as well @ 7 
increased scope to which we aspire. Wa 5 of 
representation from many different ki 
institutions. ro 

I have also made use of a large nurs will 
special reviewers among our colleagues aill pe 
continue to do so. Their contributions 
acknowledged during the year. 

This is as good an opportunity as I Kenne 
to thank my illustrious predecessor, Dr: of mY 
E. Clark for his counsel at the time DF 
appointment. The Journal owes much o ye?" 
Clark for his stewardship over the past 1 ny ol 
and, of course, we will be continuing ™@ “orf 
the policies he established. At APA, Helen an 
Managing Editor of APA publications, ce 
Marlyn Kahn, our Technical Editor, have ical 
especially helpful. Finally, at the Ame thi 
Institutes for Research, I am assisted in hay’ 
task by Pat Smyke and Sally Stein, o an 
helped me get this operation organized 
launched during this transition period. 
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DEVELOPMENT OF FIRST-LEVEL BEHAVIORAL 
JOB CRITERIA 


LAWRENCE FOGLI, CHARLES L. HULIN, ax» MILTON R. BLOOD * 


University of California at Berkeley 


A procedure for the development of behaviorally anchored rating scales was 
carried out in order to establish reliable job criteria for grocery clerks. 
Examples of checker behavior were obtained by interviewing staff personnel 
of a large Western grocery chain. Dimensions of the job were defined by the 


researchers, and the grocery staff allocated each behavioral example to the 
dimension in which it belonged. Items with high agreement of dimension 
allocation were retained and then rated on a scale of desirability. Final scale 
reliabilities of items ranged above .97. These procedures provide information 
with implications for policy development and worker training throughout the 


organization. 


f The task of developing reliable and valid 
job criteria is one of the most challenging and 
desired objectives in contemporary industrial 
’ psychology. Dunnette (1966) has emphasized 
the need for developing job-behavior descrip- 
tions in order to establish relationships be- 
tween specific job behaviors and global di- 
mensions of success. At the same time, pre- 
dictor batteries can be developed which are 
aimed at predicting these specific behaviors 
rather than global measures (which are nor- 
è mally predicted with a consistent, albeit dis- 
couraging, lack of success). The relevance of 
the specific behaviors to more global criterion 
measures and predictors should be determined 
empirically, The present study systematically 
develops criteria for observing and evaluating 
the behavior of grocery Checkers, l 
con, and Kendall’s (1963) development 
=e of nursing performance provides a 
r Cure for obtaining dimensions and spe- 
ioe re of job behavior based on descrip- 
n actual behavior, The objective for 
those authors was to develop rating scales 
that are equivalent from rater to rater and 
from occasion to occasion in terms of dimen- 
a levels of job performance, even 
en the task was complex and varied. The 
Present study strives to establish criteria 
scales with high interrater reliability that can 
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be used by raters to judge the magnitudes of 
attributes of workers. 

The goal of such research is to develop 
synthetic criteria which may be substituted 
for actual specific behavior with little loss of 
information, Scales and items must indicate 
the same quality of performance among raters 
and among divisions of a company. If a sam- 
ple of potential raters judged the relative 
quality of performance indicated by each of a 
set of items, we would expect that the rank 
order, the location, and the spread of the 
items would be highly similar to the same 
statistics computed for the set of items from 
another sample of potential raters. Tdeally, 
this means that the regression equation re- 
lating the scale values of the items obtained 
from one sample to the scale values obtained 
from a second sample would have an intercept 
of 0.0 and a slope of 1.0. That is, 41; = Xai, 
where Xi; is the scale value of the ith item in 
the first sample and X»; is the scale value of 
the same item computed from the second sam- 


ple under the constraint that S*,,; = S°, 


METHODS 


The initial step was to conduct interviews with 
staff personnel of a grocery chain concerning the 
purpose of a grocery checker’s job at the same time 
critical incidents of good-poor behavior of checkers 
were obtained. The most frequent behaviors were 
selected and the grocer’s terminology was retained. 

Three groups of managers, assistant managers, and 
department heads were selected from three districts 
of a large grocery store chain in the San Francisco 
Bay area. The three groups consisted of staff per- 
sonnel from 10 stores in each district, which differed 
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on the basis of geographical location, as designated 
by company districts—A, B, C. The content of the 
items and job description is limited to behavior 
performed in the checkstand. S ; 

One hundred and twenty staff personnel were 
available for participation in the procedures for the 
development of rating scale criteria. Interviewing 
was terminated after a total of 43 interviews for 
obtaining behavioral examples and critical incidents 
of checkstand behavior because of redundancy in 
behavioral incidents in the job domain, 

The interviews were structured so that the inter- 
viewer recorded the verbatim responses to the struc- 
tured questions. The questionnaire commenced with 
the following discussion designed to convince the 
interviewees of the importance of the study and 
their cooperation: 


We are making a study of grocery checking 
trying to learn in detail just what successful 
as a grocery clerk includes. In order to facilitate 
the performance of grocery clerks for checkout 
stand procedures, we need to know what aspects 
of checking performance make an especially effec- 
tive or ineffective clerk, Managers, assistant mana- 
gers, and department heads such as yourselves are 
constantly observing and evaluating the work of 
the clerk. Therefore, you are in an unusually good 
Position to report effectiveness, 


and 
work 


Participation in the study w: 
convincing the respondents of the usefulness of both 
the expected results and procedures of the study. 
Interviewees were informed that the incidents re- 
ported would serve as characteristic levels of dif- 
ferent dimensions of the job, and that later they 
would be asked to allocate those behavioral 
dents to different dimensions, 

_ Questions about primary and specific aspects of 
job behavior were similar to the critical incident 
method of Flanagan (1954), Many respondents ga 


as further assured by 


inci- 


ve 


TABLE 1 


AGREEMENT OF ALLOCATION or Trems 


TO CHARACTERISTICS 


: , High Lower 
Characteristic agreement | agreement 
(Above60%)| (406 46-59%) 
Conscientiousness 10 | 5 
Knowledge and judgment 37 13 
lin human relations 36 5 
Skill in operation of i 
register 34 3 
Skill in bagging 36 1 
Organizational ability of 
checkstand work 13 11 
Skill in monetary 
transactions | 34 0 
Observational ability 11 2 
Total no. of items 211 40 
a n 


general behavioral examples, even to such E, 
ments as “describe specifically the behavior E 
would expect from the best checker you have eva 
known.” Global descriptions of “he is an aaa 
checker,” “he is friendly,” and “he is a Be 
checker” are too general and are unacceptable a 
behaviorally anchored scales. The pe onan 
able to provide more definite and well though e 
examples when the generality of their responses 
indicated to them. ; "x 

maar the questions designed to obtain specific 
behaviors were: 


1. Please describe the critical characteristics of 
the behavior of a grocery checker that make 
ffective—be specific. $ 
Paani caries the specific behavior sf ntl 
extremely ineffective checker. Remember poe 
in your store that displayed vividly poor perior a 
ance. It might help you to describe the are 
you would expect from the worst checker you ha 
ver known. í 
i 3. Now, think of the last time you saw pe 
the checkers do something that was very ra 
cant in displaying effective performance AP es- 
checkstand. How did this action result in a A 
ing the effectiveness of the clerk’s overa 
formance? 


an 


the 
; a s to 
After analyzing the verbatim response 


structured questionnaire, the first two authori i 
gorized the behavioral incidents into eight 
dimensions. Store personnel were not asked i 
the job dimensions so that demands on th 
ing time would not be excessive. dime 
Staff personnel were presented with place 
defined by the researchers and instruct? nsii for 
each behavioral incident into the dime incident 
which it was an example. Each behavior 
was judged by 15 independent raters. # oe 
was climinated if there was not majority #° 
as to the dimension to which it belonged- 


ort! 
J the ite? 

A final rating format composed of all et wae 
that were maintained in the eight dimens ae 


g 
sent to all 120 Ss, and 97 Ss completed i oe 
rated the degree to which each item ice allt 
poor performance on a 7-point scale. Large 5 n th 
deviations indicated a lack of agreement On 
meaning of the behavior, and items were arbit! 
eliminated if their standard deviations were iteh 
greater. If items had similar mean values, the } el 
with the smallest standard deviation was retain 
Items were selected for each categorical scale A 
these criteria. The mean scale positions of each F 
assigned by personnel in districts A, B, and C e 
intercorrelated to give estimates of scale reliabili : 


arta 
s 0 


This procedure assumes that the rater can m 
inferences from observations about the interrelati af 
of behaviors and that a group of HAMAR EES a 
sistant managers, and department heads will sa | 
well-differentiated and standardized judgments a 
checking performance. $ ii! 

The behavioral examples that were used in pe 
final rating format were reworded as expected ”, 


. 
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TABLE 2 
INTERCORRELATIONS, STANDARD DEVIATIONS, AND REGRESSION DATA OF MEAN SCALE 
Positions ASSIGNED BY THREE DISTRICTS FOR SEVEN SCALES 
| s | 
co ah, | ee | Intercepts* 
Characteristic No. |Dis-| Telatons g o £ 
items | trict =] a | 
A |B | A | B €E | Ape @ 
| | 
| 
A 1.08 1.14 | 1.03 | J —.33 -01 
Conscientiousness 13 B | .98 .93 89 | 84 .37 .50 
C | 97 | .97 1.01 | .92 | 1.06 | —.28 | —.08 
A | | 1.74 | 07 | 90 | 30| 66 
Knowledge and judgment 24 B | .98 | 1.76 99 | 92 | —AL A3 
C .99 | .99 1.90 | 1.08 | 1.07 | —.58 | —.36 
| | 
| | | 
A 4.88 | 1.57 1.04 OF 09 33 
Skillin human relations 16 B | .98 4.61 | 148 93 90 08 35 
C } .99 | 98 | 4.72 | 1.61 | 1.01 | 1.07 —.21 | —.21 
Skill in operation A 5.04 | 1.49 | 1.13 | 1.04 —.29 Ys) 
of register 26 | B | 98 4.71 | 1.32 | 86 92) 36 „| 30 
C | 98] 98 | 4.78 | 141 -93 | 1.05 12: | =.17 
A 4.27 | 1.93 | 1.07 | 1.01 | —.10 | —.10 
Skill in bagging 21 B 99 4.09 | 1.78 | .91 93 | .02 24 
C | .99 | .99 | 4.14 | 1.90 98 | 1.05 —.03 | —.14 
A 4.79 | 1.79 | 1.07 | 95 —.18} 39 
Organizational ability 30 B | .99 4.64 | 1.65 92 88 25) .22 
of checkstand work C | 99] .99 | 4.62 | 1.86 | 1.03 | 1.11 —.31 | —.53 
Skill in monetary A 4.36 | 1.72 1.07 95 | —.02 AL 
transactions 32 B | .99 4.09 | 1.58 | 1.03 91 | —.40 31 
C | .98 | .99 | 4.16 | 1.78 | 1.01 | 1.01 | —.24 | -03 


^ Regression coefficients and intercepts are given for predicting the variable d 


by the column. 


haviors rather than actual behaviors. This restate- 
ment of the items followed the procedures of the 
Smith and Kendall (1963) study. Instead of state- 
ments like “When everybody is checking, this 
checker leaves the checkstand to get the customer an 
item,” statements consisted of expectations such as 
“If everybody is checking, this checker can be ex- 
pected to leave the checkstand to get the customer 
an item.” According to Smith and Kendall the 
expected-behavior format will improve the final 
ratings by making the predictions 


(a) so concrete that, in view of previous agree- 
ment by the peer group, central tendency or judg- 
ing effects will be minimized; and (b) so verifiable 
that insight, judgment, values, etc. of the rater are 
potentially challenged if later behavior of the 
ratee would fail to confirm the prediction [p. 151]. 


RESULTS 
The dimensions that proved to be meaning- 
ful after the judges allocated items to dimen- 


gnated by the row by the variable designated 


sions are listed in Table 1. Even though the 
researchers, rather than job incumbents, de- 
fined the dimensions of the job, the total 
number of items (211) in the high agreement 
column of Table 1 is indicative of the useful- 
ness of those dimensions. The rating ques- 
tionnaire was composed of items of at least 
60% agreement except for Categories 1, 6, and 
8. These categories contained some items with 
only 50% agreement. 

Table 2 shows scale reliabilities of items 
retained in the final rating format. The mean 
score of each of the retained items for the 
three districts was intercorrelated to obtain 
scale reliabilities. The lowest scale reliability 


he reader should be aware that pooling item 
scores leads to more accurate estimates of scale 
values in this assessment of scale reliability. With 
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Extremely good 
performance 
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gal 


i ji e expected to 
By knowing the price of items, this checker would be expe 
look for mismarked and unmarked items. 


cS Whe antly 
$ You can expect this checker to be aware of items that constar 
Good performance fluctuate in price. 
3 si e cans—NO- 
—— You can expect this checker to know the various sizes of cans 
| 303, No.2, No. 23. 
5 i+ - if the item 
tl nay R is checker w ask the other clerk if t 
Slightly good W Bon in doubt, this checker would a ec 
performance | is taxable. ‘ 
: checker a dis- 
Aea checker can be expected to verify with annie: eT A 
| crepancy between the shelf and the marked price before g 
that item. 
Neither poor nor 5 
good performance 4 — 
' ing, this checker 
—— When operating the quick check, the lights are flashing, this € 
can be expected to check out a customer with 15 items. 
Slightly poor 3 = 
performance P a8 
—— You could expect this checker to ask the customer the price 
item that he does not know. a 
fy 4 2 jonehins, quae be expected 
—— In the daily course of personal relationships, may be eckel 
Poor performance | linger in long conversations with a customer or another ch 
y Lo 
ai k ofl 
x ploc 
=> In order to take a break, this checker can be expected t0 
the checkstand with people in line. 
Extremely poor 
performance 1 
Fic. 1. An example scale for the knowledge and judgment dimension. 
ales 
igs s t scal 
was 97. Means, standard deviations, and re- Figures 1 and 2 present sana of the 
ao Mforiation are also given for each made from items which represent nent an 
. z . A n 
istrict in Table 2. An examination of these dimensions—knowledge and judgment, @ 
data substantiate the similarity of rating be- 


median of .05. 


30 Ss per sample, this is the em 
of adjusting the average interrater 
Spearman-Brown formula with a 
tor of 30 (see Noble, 1955, for a 
scale reliability estimation). 


a 
pirical equivalent 
correlation by the 
multiplicative fac- 
discussion of such 


k. 
organizational ability of checkstand hol 
Category 8, observational ability, was dis’ 
nated as a useful scale because of poe 
persion of behavioral items along the #8 
poor continuum. 


nf 
*A complete list of items retained for each of A 
seven scales and the scale values of the itema 
been deposited with the National Auxiliary Pu mi 
tions Service. Order Document No. 01257 ie 
the National Auxiliary Publications Service of i 
American Society for Information Science, c/o Ca 
Information Sciences, Inc., 909 3rd Avenue, N í 
York, New York 10022. Remit in advance $5.00 J 
photocopies or $2.00 for microfiche and make da , 
payable to Research and Microfilm Publications, 


BEHAVIORAL JOB CRITERIA 7 


CONCLUSIONS 


The method of criterion scale development 
reported in this article, which involves staff 
members in each step of the development, 
provides behavioral scales that are specific 
and apparently nonambiguous. In the case of 
the specific job studied, seven major and dis- 
criminable dimensions were obtained: (a) 
conscientiousness, (b) knowledge and judg- 
ment, (c) skill in human relations, (d) skill in 
operation of register, (e) skill in bagging, (f) 
organizational ability of checkstand work, and 
(g) skill in monetary transactions. The pro- 
cedure provides carefully designated behav- 
ioral incidents for each scale. The examples of 
behavior are in the grocer’s terminology, and 
there is excellent agreement of the allocation 
of incidents, discrimination, and high scale 
reliability. The degree of colinearity among 
these scales, when used as rating devices, re- 
mains to be determined. 

This method eliminates ambiguity by hav- 
ing those who will eventually use the rating 
criteria develop the scales with their own ob- 


Extremely good 7 
performance 


servations about behavior. Furthermore, the 
staff allocates incidents to dimensions and 
makes final judgments about the level of per- 
formance each behavior indicates. Since high 
agreement is obtained in the three samples, 
the scales may serve as standards to evaluate 
specific behaviors that can be expected in the 
checkstand. The reliability of ratings of per- 
formance may increase since the dimensions 
and incidents are easily and accurately dis- 
tinguished from one another. Because there 
are multiple ratings, raters are not restricted 
to the gross generalizations which are neces- 
sary when global performance evaluations are 
made. The success of a job incumbent can be 
evaluated in terms of specific behaviors and 
dimensions. This behavioral information 
should prove more useful to both the worker 
and his immediate supervisor than general 
statements of overall performance level. 
There are many items with standard devi- 
ations less than 1.5 which were eliminated 
from the scales because their mean scores were 
similar to other items on the same dimension. 


6 


Good performance 


Slightly good 
performance 


an 


~~ This checker would organize the order when checking it out by 


placing all soft goods like bread, cake, etc. to one side of counter; all 
meats, produce, frozen foods, to the other side, thereby leaving the 
center of the counter for can foods, boxed goods, etc. 


When checking, this checker would separate strawberries, bananas, 
cookies, cakes, and breads, etc. 


You can expect this checker to grab more than one item at a time 
from the cart to the counter. 


After bagging the order and customer is still writing a check, you 
can expect this checker to proceed to the next order if it is a small 


This checker may be expected to put wet merchandise on the top of 


This checker can be expected to lay milk and by-product cartons on 
their sides on the counter top. 


This checker can be expected to damage fragile merchandise like soft 
goods, eggs, and light bulbs on the counter top. 


a a 
Neither poor nor 
good performance 

order. 

3 — 
Slightly poor 
performance the counter. 
Poor performance 2 = 
Extremely poor 
performance 1 


Fic. 2 


. An example scale for the organizational ability of checkstand work dimension. 
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These items still provide reliable job criteria. 
This company could use these items plus the 
behavioral scales for development of a train- 
ing program for checkers. A training program 
based upon the behavioral incidents and di- 
mensions would inform checkers about the ex- 
pected behaviors for their job. Since the level 
of the behaviors is highly agreed upon from 
store to store, a uniform training program 
could be developed for all of the stores. 

An additional use of the behavioral items 
would be to ask trainees and/or incumbent 
checkers to rate the items on the 7-point good— 
poor scale. Their ratings of items could be 
compared to the ratings made by managers. 
Discrepancies could be discussed to teach the 
checker what he should do. Training programs 
could be modified to take account of the re- 
sults of such item ratings, 


Another benefit of this procedure is that it 
provides information about specific behaviors 
that are not highly agreed upon among staff 
personnel. Items with very large variances 
can be investigated by the organization to 
change or develop policy about such behavior. 
The following are examples of behavioral inci- 
dents which had large variances on the good- 
poor continuum in this study: (a) When cus- 
tomer feels that the total bill was too high, 


you can expect this checker to rering the order 
(S? = 3.20). (b) When dealing with an out 
ring, this checker would subtract overcharged 
items from another item in the same depart- 
ment—a grocery item from a grocery item 
(S* = 3.02). (c) You can expect this checker 
not to separate wet produce from other ic 
chandise on the table top (S? = 2.53). (d) 
You can expect this checker to check items 
without looking at the prices because of apie 
orizing prices (S? = 3.50). (e) This moog = 
can be expected not to bother to change n 
partmental sales ring-up error but would igr 
nore the mistake (S? = 2.43). For these ber 
haviors and others with large variances, the — 
organization may wish to enunciate or com — 
municate its policy more clearly. 
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HUMAN FACTORS IN OPERATING SYSTEMS RELATED TO 
DELAY AND DISPLACEMENT OF RETINAL FEEDBACK * 


VERNON PUTZ axo KARL U. SMITH? 


Behavioral Cybernetics Laboratory, University of Wisconsin 


A central question of the theory of human factors in man-machine relation- 
ships concerns the way in which operators correct for and adapt to spatial 
noncompliances in feedback between eye, head and manual movements, and 
visual input. Therefore, the nature of response to displaced and delayed vision 
represents also a theoretical issue for human factors theory of machine opera- 
tion. Theories of response to altered retinal feedback—i.e., associative learning 
doctrines and the feedback-compensation hypothesis—were evaluated in relation 
to their application in defining human factors principles in machine and per- 
ceptual training designs. Using 12 Ss, controlled comparisons were made of the 
relative effects of reversed and delayed feedback of head and eye movements 
under conditions in which head movements could not compensate altered 
feedback of eye movements and vice versa. Accuracy of ocular tracking was 
affected more by reversal of retinal feedback of eye movements than head 
tracking was changed by reversal of visual effects of head movements, Eye 
tracking was reduced to highly irregular, unstable vision, whereas saccadic 
head movements retained a relatively high level of tracking accuracy. Head-eye 
tracking combined the effects of reversal of feedback that were found with 
each of the specific types of movement. Delay of retinal feedback of the 
movement systems at intervals of 0.0, 0.2, 0.4, 0.6, 0.8, and 1.5 sec. caused 
S progressive elimination of unstable ocular guidance, but also greatly increased 
the error of such tracking. Other results indicated that there was little or no 
learned adaptation to the reversed and delayed vision produced by head and 


eye movements. The findings give support to a behavioral cybernetic interpre- 
tation of the guidance factors in man-machine and perceptual systems relation- 


ships by showing that the effects 


of altered feedback in machine and systems 


operation are determined by movement capabilities in compensating displace- 


ments and delays in sensory input. The results 


also suggest that visual impair- 


ments may be produced by delays in the retinal feedback effects of eye and 
head movements and that these defects may require dynamic methods of 
optometric diagnosis and training for their measurement and correction. 


This experiment assesses the validity of the 
theories of adaptation to displaced and de- 
layed retinal feedback in behavior relative to 
the human factors problems of design of ma- 
chines and tools, and of optometric diagnostic 
and visual training and rehabilitation devices. 
Although the events of optical distortion and 
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displaced vision have been studied extensively 
in the attempt to justify the learning of space 
perception (Freedman, 1968), the relevance 
of the results of such research to the human 
factors problems of design and operation of 
devices and perceptual measurement and 
training systems has not been made clear. The 
purpose of this research is to evaluate the va- 
lidity of past theory of perceptual adaptation 
in relation to a behavioral interpretation of 
displaced and delayed vision and to suggest 
applications of the more accurate view to 
current theory of human systems design. 

In the behavioral feedback approach to be 
defined here, the core experimental problem 
of displaced and delayed vision concerns the 
relative effect of such feedback alteration on 
head and eye movements. The theoretical issue 
is whether adaptation to displacement and 
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delay of retinal feedback of these two move- 
ment mechanisms is determined by movement 
compensation of the altered feedback, or the 
result of learned association between visual 
and tactual experience. To test this specific 
question, the present experiment investigated 
the problem of response to reversal of the 
retinal image in a new way by dynamically 
isolating the retinal effects of delay and 
space displacement of eye and head move- 
ments. The purpose of the experiment was 
twofold: (a) to test the feasibility of de- 
velopment and use of controlled systems 
methods in the measurement and diagnosis of 
visual distortions; and (b) to measure the 
effects of retinal feedback delay and distortion 
on eye and head movements under controlled 
conditions in which it was not possible for 
eye movements to compensate feedback alter- 
ations of head movements, and vice versa. 
The methods aspect of the study has implica- 
tions for optometry and perceptual training 
by defining certain experimental foundations 
for dynamic feedback methods of optometric 
measurement and visual training in perceptual 
disorders. 

As noted above, two general theories have 
been formulated regarding adaptation to 
altered visual feedback: the classical theory 
of von Helmholtz (1856-66) and the move- 
Mment-compensation hypothesis of Smith and 
Smith (1962) and of Hay and Pick (1966). 
According to the classical view and its modern 
restatement (Freedman, 1968: Held & Hein, 
1958; Rock & Harris, 1967), normal space 
perception and adaptation to optical distor- 
tions is dependent on learned association 
between visual and tactual (or proprioceptive) 
processes. Proponents of this view assume 
that individuals can adapt more or less com- 
pletely to different conditions of displaced 
vision and that the adaptive change consists 
of a general process of perceptual learning 
The movement-compensation hypothesis pa 
serts that the capability of response and 
adaptation to displaced vision depends specif- 
ically on the movement mechanisms—body 
limb, head, or eye—affected by altered feed. 
back and the capacity of a second movement 
system to compensate the one that is affected 
by the visual distortion. Smith and Smith 
provided specific evidence for this hypothesis 
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by showing that the extent of adaptation to 
displaced vision varied with the type of mo- 
tion affected and the conditions of displace- 
ment. Hay (1968) showed that adaptation 
to altered feedback of head movements varied 
in relation to the degree that the eyes could 
correct for the head movement feedback dis- 
placement. Wallach, Kravitz, and Lindauer 
(1963) showed also that rapid adaptation to 
displaced visual direction depended on the 
mode of movement and position of S relative 
to the altered spatial environment. The pres- 
ent study extends these past investigations by 
developing controlled systems methods for 10 
lating the sensory effects of delayed and aisi 
placed retinal feedback of head and eye 
motions. 


METHOD 


The experimental vision station and hybrid-com 
puter system diagramed in Figure 1 was usec on 
yoke eye, head, and coordinate head-eye moven 
to a light target that was projected on a eyna 5 
viewing screen located 36 in. from S’s eyes- p 
task consisted of moving his eyes, head, or hea 
eyes together to keep the target centered OP 
screen, as it was displaced horizontally by the 
puter system. In addition, the computer syster i 
programmed to reverse the spatial relation be ; 
movement direction and target direction, one 
time machine to delay the action of the °. 
feedback signals of the different movement *” 
for predetermined periods of time. 
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ts 


com- 
was 
en 


justrates 
acad, ® 
l duce 


iomete” 
corded by a headgear that rotated a potentior 


ad- 

located at the top center of rotation of the pe 
gear. The two transducing systems were APP ally 
mately linear in their output and could be exet < 
equated electrically by the calibration proc 
used. yl 
The experiment was controlled and feedback-c¢ es 
variations made in the yoked eye and head mov J 
ments by the computer system. To effect this ue 
trol, the continuous analog signals of the transduce’ 
movements were amplified and converted to dig 5# 
form by the analog-digital converter, This converte" 
also measured the incoming signals on a nine-b} 
digital voltage scale. The digitized signals were then 
selected by the digital computer in which they wert 
programmed for calibration of the transducing SS oi 
tems, for experimental measurement, and for intro” 
duction of delays into the feedback outputs of te 
systems. After these programmed operations we! 


ay 


edures | 


= lod 
completed, the signals were reconverted to analoë 
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LIGHT 
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PROJECTION 


EXPERIMENTAL 
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ITINING AND DATA 
CONVERSION EQUIPMENT 


Fic. 1. Experimental vision station and hybrid computer laboratory system for yoking projected 


visual targets to eye and head motions, 
used in the experiment.) 


form by a digital-analog converter. Thereafter, they 
were used to activate a light galvanometer which 
threw a light spot of 1.5° in diameter on the 
projection screen. 

Details of hybrid computer control and calibration 
of the transducers of the experiment are illustrated 
in Figure 2, To measure head and eye tracki 
error, the zero point of the screen was established as 
a zero voltage value and voltage variations from that 
zero point, as produced by the transducers, measured 
by the computer system. This measured error was 
outputted to control movement of the target which 
S saw. Before each trial was run, S was instructed 
to view the center point of the screen and the com- 
puter established this fixation position of the par- 
ticular transducer as a zero value. 

Both the eye- and head-motion signals were sub- 
jected to a second calibration between experimental 
trials. As indicated in Figure 2a, targets of known 
deviation were projected on the screen, and S was 
required to fixate these targets by head or eye move- 
ment. A deviation from equivalence between target- 
voltage value and transducer-voltage value at the 
fixated positions was corrected by increasing or de- 
creasing the eye and/or head transducer signal with 
a pair of potentiometers which were wired into the 
control circuits. The relations between this linear 
function relating calibrated transducer position with 
respect to target position, oscillograph pen deflection, 


(Insert: Eye movement and head movement transducers 


and tracking error magnitude are given in Figure 2b. 

The sequence of automated computer control of 
different experimental operations is illustrated by the 
oscillograph record of tracking error shown in Figure 
2c. In tracking the yoked light spot on the screen, S 
observed all the phases of operation depicted by this 
record. The error record, which represented 55 sec. 
of performance, was seen as a deviation of the light 
spot from the center of the screen. The black marks, 
which represented warnings to Ss to fixate the center 
of the screen, were seen as rapid 5° oscillations of 
the projected light target. The specific operations 
labeled were all controlled as phases of experimental 
automation that were regulated by dynamic com- 
puter programming. 

In viewing the target on the screen during an 
experimental period, S saw a light spot about 1 in. 
in diameter which deviated from the center zero 
position. By moving his head or eyes in a direction 
opposite this deviation, the target could be brought 
back to center. Error voltage was measured as 3 
deviation in making this compensatory centering 
movement. To cause the target to deviate in the 
manner just indicated, the computer was programmed 
to induce a random sine-wave variation in target 
position which had a mean frequency of 4.31 cpm 
Also, in different trials, the computer was pro- 
grammed to hold incoming digitized eye- or head. 
motion signals in memory for specified periods o 
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OSCILLOGRAPH RECORD SHOWING PHASES OF EXPERIMENTAL 
AUTOMATION 
Fic. 2, Dynamic Programming techniques used in experimental calibration 
(a and b) and experimental a 


Sat k utomation (c). ([a] Curve showing how target 

oltage an = í voltage were equated and related to measured 
tracking error and oscillogra [b] Calibration records 
; : 3 ntrol and recording system. [c] 
Steps in experimental automati 


bY dynamic programming as indicated by 
e-tracking error.) 
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0.0, 0.2, 0.4, 0.6, 0.8, and 1.5 sec. before outputting 
these signals for delayed feedback control of the 
yoked target. Under the delayed feedback condi- 
tions, S experienced a lag effect in action of the 
target with respect to his head or eye movements 
which was observed as an optical error or displace- 
ment of the target. In the course of observations, 
the computer automatically measured the tracking 
error for each selected sample of the digitized input 
movement signal—that is, .128 times per second. 
These sample error measures were reduced on-line to 
5-sec. error means, 

A repeated-measures plan with five factors was 
used in which each of 12 Ss acted as his own con- 
trol in performing the three modes of tracking with 
six delay intervals. The design was three modes of 
movement or tracking of 18 trials each with each S, 
and nested the six delay magnitudes of 0.0, 0.2, 0.4, 
0.6, 0.8, and 1.5 sec. within each mode of trackinz. 
Conditions of delay were ordered according to a 
counterbalancing scheme. This scheme resulted in 
three trials of 55 sec. each for each delay. Within 
each trial there were ten 5-sec. error measurement 
periods. Between successive trials, Ss were rested for 
35 sec. and for periods of 5 min. after 15 min, of 
observation. Periods of observation lasted for 2 hr, 

The crux of experimental control in this study 
concerned the isolation of head and eye motion under 
particular modes of head, eye, and head-eye tracking. 
Under a particular mode of tracking, only the 
transducer or transducers peculiar to that mode 
would actuate the yoked target. In eye tracking, S’s 
head was held fixed, he was instructed to move only 
his eyes, and was informed that any head movement 
would only increase error. In head tracking, S was 


instructed to keep his eves fixated on the center point 
of the projection screen, and was told that any use 
of the eyes in tracking would serve only to increase 
error. In coordinated head-eye tracking, both the 
head and eye transducers were activated and S could 
control the yoked target with either head or eye 
movement equivalently. 


RESULTS 


The main results consist of comparisons of 
tracking accuracy between the three move- 
ment modes and between the different delay 
intervals. 


Effects of Retinal Feedback Reversal and 
Delay 


The three movement records shown in Fig- 
ure 3 will tell generally what happened in the 
experiment with the three modes of tracking 
control. The records shown are of the actual 
movements that the eye or head performed in 
tracking a computer-generated sine-wave tar- 
get variation like that shown to the left of 
the records. To track accurately, the eye or 
head had to oppose or compensate this wave 
pattern to keep the target centered on the 
projection screen. As the records show, the 
head performed the smoothest tracking, the 
eye movements were far less regular and ac- 
curate, and the head-eye motions were least 


Fic. 3. Qscillograph movement records from one § with reversed retinal 
feedback with the three modes of tracking. 


EYE HEAD-EYE 
TYPE OF TRACKING 


Fic, 4. Bar graph showing the differences in accuracy 
of eye, head, and head-eye tracking. 


accurate. In the case of all three movement 
modes, tracking motions, instead of being 
smooth pendular Pursuit movements as they 
should have been, were step-wise saccadic 


0.2 0.4 0.6 0.8 
DELAY MAGNITUDE (SECONDS 
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motions. These were smallest for head move- 
ments, which tracked the target most accu- 
g-i bar graph in Figure 4 describes the 
relative error of the compensatory eye, be 
and head-eye tracking for all experimental 
conditions combined. The graph indicates 
that head control gave the most aaora 
tracking, whereas accuracy of the eye A 
head-eye tracking were both considerably 
lower than the precision of head nie oe ! 

The functions relating tracking error in the 
three movement modes with feedback delay 
magnitude are given in Figure 5. The super a 
accuracy of head tracking was most penne 
with lower magnitudes of delay, but wa a 
duced progressively as the feedback ke 
interval was lengthened. Accuracy of eye a i 
head-eye tracking was almost identical n 
magnitudes of feedback delay. For all Peg 
movement modes, the effects of peren 
and head-retinal transmission lags were Ve 
nearly linear. 


1.6 
) 


delay for eye, head, 
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Head and Eye Adaptation to Reversed and 
Delayed Feedback 


Characteristics of adaptation of eye and 
head motions to the reversed and delayed 
feedback were analyzed in terms of oscillo- 
graph records of tracking movements and 
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tracking errors under the variable feedback 
conditions. Records of head, eye, and head-eye 
movements of three Ss are shown in Figure 6. 
Each series of records represents performance 
in one trial at each of the delay intervals. 
Aside from the seconds time line given to the 
left, each record contains three lines repre- 


HEAD-EYE TRACKING 


ae 
[Ve 


1.5 
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Fic. 6. Oscillograph records of head, eye, and head-eye tracking of three Ss in one trial 
at the six retinal feedback-delay intervals. 
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16. 6. Continued. 


senting the target wave, head or eye move- 
ment, and tracking error (to the right). In the 
case of the coordinate head-eye movements, 
each record has four lines representing target 
wave, eye movements, head movements, and 
coordinate head-eye tracking error, 


Overlooking some variations in the mag”, 
tude of tracking error, the three Ss showe 
quite similar head, eye, and coordinate he? 
eye movements under the six delay conditio™, 
At zero delay, the head movements were ver? 
accurate, composed of fine saccadic mov 
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HEAD TRACKING 
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Fic. 6. Continued. 


erratic; Sg did almost all the tracking in this 
situation with his eyes. 

The records showed that with increasing 
magnitudes of retinal feedback delay, track- 
ing error increased progressively. This in- 
creased error was manifest especially in 


Ments, and duplicated the pattern of the tar- 
Set wave, In contrast, eye movements at zero 
delay were highly erratic although also com- 
Posed of saccadic movements. Under these 
Same conditions of reversed feedback at zero 
delay, head-eye movements were also highly 
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marked change in irregularity 
of eye movements, In the ca 
movements, head motions bec 
nent in the coordinate action 
as the delay interval was incre 
especially true of Sẹ and So. 
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TABLE 1 
$ MULTIPLE-RANGE TESTS FOR DIFFERE 
RACKING ERROR FOR MOVEMENT MODES 
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TABLE 2 


DUNCAN MULTIPLE-RANGE TESTS FOR DIFFERENCES 
IN MEAN TRACKING ERROR FOR 


HEAD, AND HEAD-EYE TRACKING FEEDBACK-DELAY INTERVALS 
RR a pio eae S 
Level of | Level of | | | 
. A ar Pye A eye y z | 
significance | aeh g Eye | Head-eye signifi- | 0.0 | 0.2 | o4 | 0.6 | 08 | 1.5 
= -$ SO a aE = cance | 
3% | 14.80 20.40 19.82 | 
5 Š = soz 5.39 75 9.71 ¢ 
10 | t480 20.40 19.82 5% 11.60 15.32 16,00 17.51 1RN, 29.96 
- < | apie a 1% | 11.60 15.32 16.00 17.51 19.70 29.96 


dX’ Lines underlying common means are not significantly 
erent from one another. 


ticular delay interval. Some sequences of 
trials, such as those at 1.5-sec. delay, show 
Some indications of improvement over the 
three trials, while other sequences, such as 
those for the head and eye at 0.6-sec. delay, 
Sive evidence of increased irregularity with 
Practice. Overall, the changes with practice in 
the head, eye, and head-eye movements were 
deed quite limited. i 


Statistical A nalysis 


A five-way analysis of variance was carried 
Out on the error data. This analysis was based 
a mg 5-sec. error means within trials or ob- 
trials ae and encompassed the means for 
the es ‘movement modes, and Ss. All 
1.0% | in variables were significant at the 
terms a as tested by the appropriate error 
effect of ne trial variable, which indicated the 
evidenc practice, showed only the slightest 
reductio: of learning change. A similar slight 
sive 5. on in error level occurred over succes- 
tions sec. means during the trials or observa- 
rela termine the main sources of variance 
multi to modes of tracking, Duncan (1955) 

iple-range tests were computed at the 
wy 5% levels, as shown in Table L 
an seh ole, two or more means linked by 
tom ei ine are not significantly aaa 
Means ne another. The table indicates tha 
isien for modes were all significantly dif- 
but vias one another at the 5% level, 
racki at the means for eye and head-eye 
one ng Were not significantly different from 
anal another at the 1% level. A similar 
_ YSIS was made of the means for delays, as 


_Note.—Lines underlying common means are not significantly 
different from one another. 


given in Table 2. This table indicates that at 
both the 1% and 5% level, all means were 
significantly different from one another ex- 
cept those for the 0.2-sec. and 0.4-sec. delay 
conditions. 

Analysis of the significance of the differ- 
ences between mean tracking error for delay 
intervals with specific movement modes of 
tracking, as described earlier by the curves in 
Figure 6, are given in Table 3. The table, 
which is based on range tests at the 1% level 
of significance, indicates again that the effects 
of delay on the specific modes of eye and 
head tracking were highly reliable except for 
the 0.2- and 0.4-sec. delay levels. The relia- 
bility of the differences for the combined 
head and eye tracking was less than that for 
the specific movement modes. 


SUMMARY 


This experiment has evaluated the validity 
of the doctrines of response to distorted vi- 


TABLE 3 


DuNCAN Muttipte-RANcE Tests AT tHe 1% 
LEVEL OF THE SIGNIFICANCE € E DIFFERENCE 
BETWEEN MEANS FOR DELAYS WITH THE 
THREE Movement MODES OF 


TRACKING 
Movement gij | 0.2 | 04 | 0.6 | os |15 
mode | | 
| aK ESE | = 
Head S3 11.37 10.98 14.89 
Eye 52 95 19.38 18.66 


Head-eye | 13.93 18.63 17.53 18.6: 


Note.—Lines, including bi 
common means are not signific 


~antly different from one another, 
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sion relative to their application to specifica- 
tion of the human factors in design of ma- 
chines and visual measurement and training 
procedures. Observations confirmed prior work 
(Smith & Molitor, 1968) in showing that re- 
versal of eye-moyement-retinal feedback 
caused marked rapid unstable vision, to which 
S made little or no adaptation. Such reversal 
of vision eliminated smooth pursuit move- 
ments of the head and eyes and degraded vi- 
sion to use of saccadic movements. Restriction 
of vision and visual tracking to reversal of 
the feedback effects of eye, head, and head- 
eye motions impaired eye-movement functions 
more than head movements. Head movements 
with compensatory visual feedback were also 
reduced to saccadic movements, but these head 
saccades were very small and still made pos- 
sible accurate head tracking. 
The experiment specifically tested the hy- 
pothesis that the effects of revers 
movement feedback are distinct fro 
of head movement control of visual tracking 
and localization, as the latter has been studied 
in inversion and reversal of the visual field by 
prisms and telescopes mounted in experi- 
mental spectacles. The results indicate that 
reversal of eye-movement-retinal feedback is 
table vision and in- 
on than anything 
with displaced vision 
spectacles mounted on 


al of eye- 
m reversal 
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ments between two mechanisms—that is, be- 
tween the eye and head or between the head 
and eye. Under such controlled systems condi- 
tions, it was found that the capability of the 
eye system to correct for reversed feedback of 
ocular movements was next to impossible be- 
cause the built-in directional specificity of 
ocular movements in controlling positioning of 
the retinal image cannot be altered. In con- 
trast, the head movement system could com- 
pensate reversals of feedb 
because the head does not possess the in- 
grained directional feedback relations between 
eye motion and retinal image positioning, The 
findings indicate that the role of feedback 
relations between ocular dynamics and retinal 
stimulation in determining space perception 
and visual guidance cannot be assessed by 
tests of effects of optical distortions on head 
and body movements, 

The experiment ev 
the movement-com 
the visual tactual learnin 
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combining the transduced signals of head and 
eye motions into a yoked feedback control of 
a visual target, combined the characteristics 
of movements observed in the isolated pat- 
terns of head and eye tracking. 

The results give added meaning to the be- 
havioral feedback interpretation of the critical 
human factors in defining man-machine and 
man-systems relationships. According to this 
interpretation, the effectiveness of machines 
and devices for human use and the require- 
ments for learning in operating them depend 
on the degree of compliance in the space, 
time, and force factors between living motion 
and the action of machines. Tools and ma- 
chines introduce an instrumental feedback 
factor into behavior which occurs as an ad- 
Junct to the feedback effects from movement 
itself and from operational control of the en- 
vironment which occurs in unaided behavior. 
These different sources of feedback must cor- 
relate with one another in terms of their 
Space, time, and force variables. The greater 
the discrepancy in feedback between these 
different sources of feedback, the greater the 
demands for learning the use of tools and 
machines and the more limited the effective- 
ness for adaptation in their operation. Results 
of the present experiment suggest that the 
more tools and machines alter the critical 
retinal feedbacks of the eye itself, the less 
effective the device for human use and learn- 
ing. 

As indicated by the results on delayed eye- 
movement-retinal feedback, visual disturb- 
ances can result from alterations in feedback 
timing which are continuously involved in 
ocular and head control of retinal input. The 
Conclusion may be made that visual percep- 
tual disabilities, such as those found in edu- 
cational skill, may be based on asynchronism 
of control of retinal stimulation by eye and 
head motion. Inasmuch as several levels of 
dynamic control of retinal stimulation are 
always involved in vision, many possibilities 
exist for feedback asynchronism in use of the 
eyes and in producing feedback disturbances 


of perception. The systems methods of dy- 
namically isolating response variables in 
vision for the study of delay effects may rep- 
resent a solution to the problems of measur- 
ing visual functions of the eye in action. The 
results on delayed feedback indicate that such 
dynamic methods of visual diagnosis may rep- 
resent a crucial need in understanding per- 
ceptual deficiencies which cannot be reduced 
to optometric or ophthalmologic defects. 
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; PAY AND UNDERPAY 
FECT OF HOURLY OVER 
2 INEQUITY WHEN TESTED WITH A NEW 

3 INDUCTION PROCEDURE: 


E. R. VALENZI anv I. R. ANDREWS? 


Bowling Green State University 


This study tested Adams’ inequity theory with an induction which did not 
challenge the workers’ job qualifications, eliminating devalued sel aig os i 
confounding variable. Thirty-one workers were hired for an hourly pay ietical 
task in a real job situation and assigned to one of three groups overpay, 
underpay, and control—in a before-after design. Contrary to inequity theory 
predictions and to previous inequity theory experiments, there were no signifi- 
cant work performance differences among the three groups, but 3 of 11 under- 
pay Ss quit the job while no other Ss quit. It was concluded that the results 
of this experiment combined with the results of other experiments strongly 


suggest that inequity effects previously reported are probably due mainly to 


the self-esteem variable. It was sug 
focus more on variables such as tur: 
than solely on work performance, 


Five recent articles (Andrews & Valenzi, 
1970; Friedman & Goodman, 1967; Gordon 
& Lowin, 1965; Lawler, 1968a; Moore, 1969; 
Pritchard, 1969) have suggested that when 
perception of overpay inequity is created by 
challenging the worker’s qualifications to per- 
form the job, results are seriously confounded 
by the simultaneous arousal in the worker of 
motives to protect self-esteem, The purpose 
of this study was to test inequity theory 
(Adams, 1965) with an induction procedure 
which did not challenge the worker’s job 
qualifications, eliminating devalued sel f-esteem 
as a confounding variable. 

For a variety of reasons, this study used 
hourly paid workers rather than piece-rate 
workers. In his review of wage inequity re- 
search, Lawler (1968b) concluded that re- 
sults from studies that paid workers on a 
piece-rate basis can be explained by expect- 
ancy theory (Vroom, 1964) with only a slight 
modification based on inequity theory. By 
using hourly paid workers, the authors 
avoided this alternate explanation since ex- 
pectancy theory predicts no difference þe- 


tween hourly paid inequity groups and equity 
groups. 
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ggested that research on wage inequity should 
nover, satisfaction, and recruitment rather 


Second, hourly pay is a more stringent 
test of inequity theory because of the differen- 
tial effort required to restore equity. Under 
the piece-rate condition, perceived overpay 
inequity can be corrected by limiting quantity 
of work, thereby reducing outcomes, Even an 
attempt to improve work quality wouldn’t 
require much effort since the work pace 15 
very slow. With hourly overpay, on the other 
hand, the worker can influence only the input 
side of the outcomes/input ratio. Therefore; 
to correct a perceived inequity, he must in- 
crease his work effort rather substantially. 


In the hourly underpay condition, the worker 
can decrease his inputs by doing 


and/or poorer quality work. This st 
quires a deliberate intent to perfo 
and has a kind of revenge or get e 
associated with it. It may be psy 
more difficult to rationalize sy 
than it is to correct piece-rate 


increased work quantity with 
work quality. 
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Third, no experimental 


study has been re- 
ported that tested 


inequity theory for the 
effect of hourly underpayment. Thus, the 


present study extends Previous work 
inequity theory in this respect, 

Fourth, hourly Payment is far more com- 
mon in the industrial settin 
payment and therefore is o 
interest. 
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The present study is the first experimental 
test of inequity theory in the hourly over- 
Payment and underpayment condition that: 
(a) Created overpay and underpay inequity 
Without complications from job security and 
Self-esteem as confounding variables, (b) gave 
Ss inequity relative to self and to other salient 
Workers, (c) allowed each S to come in con- 
tact with other Ss in his work group, and 
(d) manipulated outcomes instead of inputs. 

Stated in its most general form, the in- 
equity theory hypothesis states that if Person 
ae inequity, the resulting dissonance 

Motivate him to reduce the inequity. 
Adams (1965) defines inequity as follows: 


a ge exists for Person whenever he perceives 
a the ratio of his outcomes to inputs and the 
“to of Other’s outcomes to inputs are unequal. 
ae 5 may happen either (a) when he and Other 

e in a direct exchange relationship or (b) when 


both are in a direct relationship with a third 
Tp a80n Person compares himself to Other 


eee the modes of inequity reduction 
Cites : r erson may engage in, Adams (1965) 
cotnbe tering inputs, altering outcomes, or a 
era: of both as possible alternatives. 

ment a hourly pay system of this experi- 
Were: e more specific hypotheses tested 
Paid on Tf Person perceives himself as over- 
is ts . hourly basis, then he will increase 

and/or. effort leading to better quantity 
Ceives ne of work; (b) if Person per- 
asis, t Imself as underpaid on an hourly 

> “fen he will decrease his work effort 


€ading : c 
of work, Poorer quantity and/or quality 


Sub jects MrTHOD 


Thirty- 
yr. ay p oile Ss between the ages of 18 and 


Wi ie 
Work Were ob 


the Fi 
inancial As 1 0 
Versity ancial Aids Office of a large midwestern unl- 


on other job opportunities by accepting 

on this project, 

ag signifi, the experimental manipulation, there were 

the thre cant dependent variable differences among 
€ groups to which Ss were randomly assigned. 


Procedur ë 


Eac ` 
s a 79 
her S was contacted by telephone and told that 


® Was obtained from the Financial Aids 


Office. The E explained that he was a research 
assistant on the survey research Project on campus 
and that he was presently hiring part-time help to 
prepare data for computer analysis. The S was asked 
if she were still interested in part-time work. If she 
was interested, she was asked to come to a group 
meeting with other girls who would be working 
on the project. In this way, E met with six Ss 
at a time. The purpose of the group meeting was to 
make each S aware of all the other workers on the 
project and of the wage they were to receive. At 
the group meeting, E again explained that he was 
the research assistant on the survey research project 
and that he was in charge of processing the data. 
The office was set up to look like a legitimate re- 
search operation with piles of questionnaires, IBM 
cards, and boxes scattered about. At the group meet- 
ing, E obtained class schedules from each S, briefly 
explained the nature of the project that they would 
be working on, showed them the nature of the task 
they were being hired for, and had each S complete 
a “request-to-hire-student-employee” form obtained 
from the Financial Aids Office. The Ss were told 
the job would last at least 6 wk. It was also made 
clear that the project was financed by a grant from 
the National Institute of Mental Health (NIMH) 
and was therefore independent from the Financial 
Aids Office as far as pay was concerned. This was 
necessary since the Financial Aids Office determined 
the pay rate for all student jobs. The Ss were also 
told that they would receive their pay in cash upon 
the completion of each work period and that they 
would merely sign a receipt for the money they 
received. This method of payment was justified as 
a matter of convenience to eliminate a lot of paper- 


work. 
After the hourly rate for the job was given as $1.40 


per hr., Æ asked, “Do you feel that this rate is fair 
for this job?” All Ss agreed that this was a fair 
wage for the job. Finally each S was scheduled for 


two, 2-hr. work sessions with 2 or 3 days between 
the two work sessions. i 
The task required S to transfer X marks from 


a three-page questionnaire consisting of 72 semantic 
differential scales to IBM answer sheets. It was 
explained that this transformation prepared the ques- 
tionnaires for machine scoring. The pattern of Xs 
on the questionnaire was determined by using a table 
of random numbers. Ten different patterns were used 
so that every tenth questionnaire was identical. Post- 
experimental interviews indicated that no S was 
aware of scoring the same questionnaire more than 
once. This task permitted the collection of both 
quality data (average number of errors per question- 
naire) and quantity data (number of questionnaires 
scored). 

The corrected split-half reliability for work quality 
in Work Session 1 was .50 and for Work Session 2 
it was .64. Both estimates were Spearman rank-order 
correlations and both were significant at the .01 level. 
To obtain the correlations, the number of errors 
made on the even-numbered questionnaires was cor- 
related with the number of errors made on the odd- 
numbered questionnaires within each work session. 
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It was not possible to obtain the split-half zei 
ability for the work quantity measure since me on! : 
data available for quantity was the total number or 
questionnaires completed in each work session. A 
test-retest reliability was computed for the quantity 
measure by correlating the number of questionnaires 
completed in Session 1 with the number completed 
in Session 2. The Spearman rank-order correlation 
was .88, significant at the .01 level. , = 

Unlike the tasks previous! y used in equity studies, 
work quality and work quantity were not negatively 
correlated. Difference scores (Session 2 minus 
Session 1) for work quality correlated 06 with 
difference scores for quantity of work, essentially à 
zero correlation even after a correction for 
attenuation. 

The sensitivity of work quantity on this task as 
a dependent variable was demonstrated in a pretest 
study with 28 introductory psychology students. In 
their first 20-min. work session, students were given 
a standard experimental set designed to elicit a high 
level of motivation. In their second 20-min. work 
session, 8 small cash prizes were offered to those 
students who could score the most questionnaires 
without losing accuracy. Roughly speaking, work 
quantity doubled in the second work session. More- 
over, there was no reduction in average work quality. 


Design 


A before-after design with a control group was 
used. The Ss were randomly assigned to one of three 
conditions (underpay, overpay, or control) by use 
of a table of random numbers with the restriction 
that from each group of 6 Ss that met together, two 
were assigned to each experimental condition. The 
Ss worked individually in an office down the hall 
from E’s, 

At the conclusion of the first day’s work period, 
S was paid $2.80, required to sign a receipt for the 
money, and reminded of her next scheduled work 
period. Upon reporting to the office for her next 
day’s work, an overpay S was told the following: 


Good morning (afternoon) Miss Brown. 

_As you know we are financed by a grant from 
NIMH. Well, when we applied for the grant we 
had to submit a budget to account for expendi- 
tures of the funds requested. At the time we 
submitted the budget, we had need for a statistical 
clerk. As it turned out the new 360 computer does 
more work than the old 1620 so we don’t need 
the statistical clerk to do some of the computa- 
tions. We took the money allocated for that func- 
tion and divided it up among the clerical workers 
to increase their wage rate. Well, the business 
office controls the grant money and this is what we 
got from them (yesterday, day before). 


At this point, S was shown a memo typed on uni- 
versity stationery from the office of the controller 


to the project director. The S read the memo which 
said: 


It is felt that eliminating the statistical clerk 
from the budget and appropriating funds allocated 
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for his function and distributing them among the 
general clerical help involves the taking of too 
much liberty with the budget as approved by the 
office of NIMH. 

We advise as close adherence as possible to the 
original budget as submitted. A change such as 
that proposed by you should be effected by a 
formal application to NIMH for budget revision. 


After S finished reading the memo, E continued: 


By the time we submitted the request for a 
budget change and got approval for it, the project 
is likely to be done. Besides we are paying you 
as you work which complicates matters. So what 
the project director did was to choose a name at 
random to receive the pay of the statistical clerk 
so as to conform to the original budget. Now the 
pay of the statistical clerk is $2.00 per hr. and your 
name was chosen. It has nothing to do with you 
or your work, it’s just a matter of chance that you 
were picked. It’s important that you don’t tell 
anyone else about your pay change. We don’t 
want to cause any bad feelings among other girls 
working on the project. 


After the induction, E answered any questions S 
might have and then said, “Now you understand 
what this means, you will be receiving $2.00 per hr. 
while everyone else still gets the $1.40 per hr.” 

The underpay induction was similar to the over- 
pay, except that a messenger was substituted for 
statistical clerk and the wage of the messenger was 
$1.20 per hr. The underpay induction finished with 
the question, “Now you understand what this means, 
you will be receiving $1.20 per hr. while everyone 
else still gets the $1.40 per hr.” The control Ss were 
engaged in routine conversation for a comparable 
length of time. 

It should be noted that this induction procedure 
was designed to produce wage inequity relative to 
all other workers in S’s work group, relative to self 
in the first work session and relative to the going 
rate ($1.40 per hr.) for other student workers hold- 
ing jobs on campus. It was felt that this manipula- 
tion was much stronger than that given Lawler’s 
(1968a) overpay-by-circumstance group since his 
workers had equity relative to other workers on the 
same project. 

After the workers had completed the second work 
session, the true nature of the research project was 
explained in general terms. Experimental Ss were 
— as ek two open-ended questions 

hey ha a o the announcement of the 
wage increase or decrease. In addition, experimental 
and control Ss were asked to rate 
7-point wage inequity scale 
of 1, very underpaid, to a value of 7, very overpaid. 


The group mean rati 

g an ratings for und 1 
erpay, co and 
overpay were 2.3, 4.3, and pay, control, 


~ 5.8, ively n 
differences between the nee » respectively. Mea 


sneer rol group and each of 
the experimental groups were significant at the 001 


level. Additional evidenci 
pee evidence for the success of the 


al manipulation i a 
* z n is presented i Jis- 
cussion section. presented in the I 


Before leaving, each S was thanked for his par- 


the job on a 
» ranging from a value 


i 


D 
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aS pay Was paid $5.00 for the second work session 
and was asked to refrain from talking about the 
Project with other students, 


RESULTS 
A quantity difference score was obtained 
for each S by subtracting the number of ques- 
tionnaires completed in Session 1 from the 
number completed in Session 2. The difference 
Scores for the overpay group and the under- 
pay group were compared separately against 
the difference scores for the control group by 
the Mann-Whitney U test (see Table 1). The 
value of y obtained for the overpay versus 
control comparison was 48; for the underpay 
versus control comparison, U was 47. A U of 
id or less would have been significant at the 
wl a for each comparison. Since neither 
„| Parison was significant, the null hypothe- 
Sis (that the control group represented the 
Same population distribution on quantity as 
the overpay and underpay groups) was not 
rejected, 
A comparison of the extreme groups 
Kunderpay vs. overpay) produced a U value 
$ IS with a value of 23 or less needed for 
nificance at the .05 level. Thus, not even 
„© Xtreme groups comparison produced a 
Significant difference. 
wee difference scores were obtained for 
i Y subtracting the quality score for 
oO 1 from the quality score for Session 2 
oF aie able 1). The quality difference scores 
compar, oe and underpay groups were 
erenc red separately against the quality dif- 
neither scores for the control group. Again, 
level The Parisai was significant at the .05 
and th e overpay versus control U was 45 
€ underpay versus control U was 41. A 
ee of 26 or less would have been signifi- 
nt at the .05 level, 
Pay ve Parison of the extreme groups (over- 
` Underpay) on quality yielded a U 


TABLE 1 

MEAN : 
Mean Quantity anp Quatity DIFFERENCE 
Scores ror tim ‘Tiree GROUPS 


Deinen tne erin 
Teatment group Quantity Quality 
Overpay 4.40 —.376 
ontrol 4.30 - .090 
Underpay 5.20 282 


TABLE 2 


NUMBER OF PERSONS WHO WERE ABOVE oR 
BeLow THE MEDIAN on Bota Work 
QUALITY AND Quantity 


Treatment Above | Below Total 
group 

Overpay 3 0 3 

Control 3 3 6 

Underpay 2 2 3 
Total | 8 5 | iy 


value of 34 with a value of 23 or less required 
for significance at the .05 level. 

The quantity and quality measures were 
considered simultaneously in the following 
way. The median quantity difference score 
and the median quality difference score were 
found for the entire sample. For each group, 
the number of Ss above the median on both 
dependent measures and the number below 
the median on both dependent measures was 
determined (see Table 2). To evaluate the 
statistical significance of these frequencies, 
they were entered into three 2 X 2 contin- 
gency tables—one for overpay versus control 
comparison, one for the underpay versus con- 
trol comparison, one for the underpay versus 
overpay comparison—and analyzed by the 
Fisher exact probability test (Siegel, 1956). 
None of the comparisons were significant: 
overpay versus control, p= .24; underpay 
versus control, p = .50; overpay versus under- 
pay, p= .35. 

Discussion 


Hourly Overpayment 

Since these results, no differences between 
overpaid and equitably paid workers, are con- 
trary to those of previous hourly overpay 
studies (Adams & Rosenbaum, 1962; Ar- 
rowood, as cited in Adams, 1965: Goodman 
& Friedman, 1958), it is essential to estab- 
lish that overpay inequity was in fact per- 
ceived. Confidence in the success of the ex- 
perimental induction is based on four pieces 
of supporting evidence. First, on a postwork 
questionnaire, workers in the overpaid group 
rated themselves on a 7-point wage equity 
scale as overpaid relative to other workers 
on the project. Second, on an open-ended 
question which asked, “What were you think- 
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ing about immediately after the an aa 
told you of the change in pay rate and A 
left you to start work?” 7 out of 10 wor ers 
mentioned overpay inequity as part of thrig 
response. Third, when the experimental work 
situation was described in detail and the crit- 
ical part of the experimental manipulation 
was acted out in front of 43 students from 
an introductory psychology class, 28 of the 
43 students responded to the open-ended 
question described above in terms of wage 
inequity. (For a fuller explanation of this role 
projection technique, see Andrews & Valenzi, 
1970.) Confirming those results, out of 9 
structured paragraphs describing possible re- 
actions of the worker to the induction, a 
paragraph depicting wage inequity relative 
to Other was ranked by the 43 student ob- 
servers as the most plausible reaction to the 
treatment. Last, one can consider the face va- 
lidity of the manipulation. Inequity relative 
to self was fostered by manipulating the wage 
rate from an equitable level in the first work 
session to a much higher rate in the second 
work session. Inequity relative to Other was 
encouraged in two ways: first each overpaid 
worker knew that she was making 60¢ per 
hr. more than the other workers who were 
familiar to herself on relevant input charac- 
teristics (age, education, sex). Each overpaid 
worker met all the other workers in the group 
meeting at the time of hire. Second, $2.00 
per hr. was considerably more than the stan- 
dard campus wage ($1.40 per hr.) for that 
type of work. Taken together, the authors be- 


lieve that the above data show that the over- 
paid workers did in fact 


perceive overpay 
inequity, 


The lack of a performance difference be- 
tween overpaid and equitably paid workers 
can be interpreted as evidence for the posi- 
tion that previous overpay inequity studies 


were confounded by nonequity motivations 
aroused by the challenge to job qualifications 


induction (Andrews & Valenzi, 1970: Lawler 
1968a, 1968b; Moore, 1960; Pritehacy’ 
1969). Apparently, when these honequity 
motivations are eliminated, the Perception of 
hourly overpay inequity does Not lead to 
performance improvement, 

This finding is consistent 
(1968a) who found no performa 
between unchallenged overpaid 


with Lawler 
nce difference 
workers and 


equitably paid workers while his challenged 
overpaid group did do significantly more 
work than the equitably paid group. Lawler 
concluded that his results, combined with 
those of Gordon and Lowin (1965), argue 
strongly that it is the challenge to qualifica- 
tions, and not the feelings of overpayment, 
that accounts for the performance differences 
in the hourly overpay condition. 


Hourly Underpayment 


This was the first experiment designed to 
test inequity theory hypoptheses in the hourly 
underpayment condition. The results did not 
support inequity theory predictions concern- 
ing the effect of underpayment on work 
performance and, in a sense, were contrary 
to previous hourly pay studies (overpay) 
in that perceived inequity did not affect 
performance. 

Since the induction for underpayment was 
basically the same as the induction for over- 
payment, evidence for the success of the 
underpayment induction is essentially the 
same as the evidence offered above in support 
of the overpay induction. Additional evidence 
is the fact that 3 out of 11 underpay Ss quit 
the job, 1 immediately upon being informed 
of the pay change and 2 after working theiy 
scheduled work session. No persons in tife 
equitable paid or overpaid groups quit the 
job. Thus, it is difficult. to argue that the 
lack of an underpayment performance dif- 
ference was due to a failure of the induction. 

It is of interest to note that 
perception of underpay 
workers indicated that 
during the second work s 
the pay cut made them 
both the work performan 
naire responses indicat 
and the resulting dissati 


despite the 
inequity, only 1 of 11 


work harder, Thus, 
ce data and question- 
e that underpayment 
sfaction did not affect 


e. sult is consistent with 
current thinking on the relationship among 


satisfaction, rewards, and performance (Smith 
& Cranny, 1968) which states that rewards 
and satisfaction can influence performance 
only if mediated by an effort or intention 
variable, 

On the other hand, the fact that 3 of 11 
underpay workers quit the job and 5 of the 
remaining 6 said they thought about quitting, 
demonstrates that underpayment does have 


she decreased effort | 
ession and 2 said that 


a 


i 


ba 
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an effect. These turnover data suggest that 
perhaps attention has been focused on the 
wrong dependent variables in wage inequity 
research. The primary effect of underpay in- 
r be on turnover and on other forms 
aae the field such as absenteeism and 
age requests to more equitable paying 
hs miaj the primary effect of overpay 
i I on ease of recruitment and the labor 

rom which you are able to select 
workers, É 


Limitations 


ead manipulation might have been 
Workers : the workers thought that their co- 
possible oor about the pay increase. It is 
a t nat overpayment relative to co- 
Notke not particularly distressing if the 
ignora zers do not know about it. Co-worker 
es a neg combined with Person’s knowledge 
be e ee required one worker to 
waa could have enabled Person to 
alize the overpayment. 
oe underpay Ss were upset by the 
take i they may have been reluctant to 
Project ei on the research assistant or the 
edian fee since they were merely inter- 
real cues. in the pay change. After all, the 
“hi prit was someone in Washington, D. C. 
theme a was especially apparent in the 
La a to Ss during the induction. 
restrict Mer pay: Ss may have been hesitant to 
Work (seas or turn in poor quality 
a reven cause such behavior has overtones of 
with it K or a get even motive associated 
felt th Se ne college coeds may have 
their eal such behavior was inconsistent with 
addition tural norms of acceptable conduct. In 
naires a it was obvious how many question- 
numbered = completed since they were pre- 
tion in consecutively, A purposive aed 
Aten a would have been immediately 
Caused § and a fear of reprimand could have 
Gene: S to maintain their quantity level. 
Neralization of these hourly pay results 
with ere condition should be done 
o dalif To the extent that the challenge 
e hou ron iS a confounding variable in 
Blece rare” condition, it also confounds De 
at oar: studies that used it. It is Possible 
o ere E differences under piece-rate 
Oundi E might disappear if self-esteem con- 
ng is eliminated. But it is also possible 


co 


that in the piece-rate situation, performance 
differences might still occur because workers 
are more attentive to the relationship between 
pay and productivity. Also, as explained ear- 
lier in the paper, it is much easier to correct 
a perceived wage inequity under piece-rate 
than under the hourly pay condition. 

Last, it should be noted that the behavior 
of full-time permanent employees might be 
quite different from the behavior of our 
temporary, part-time employees. 
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AFFECTIVE RESPONSE TOWARD USES 
j OF RECLAIMED WATER 
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Thurstone-type affective attitude scales were developed greets 
n public reaction to high contact uses of reclaimed waste water. The scales 
assess 


were evi 


aluated by field survey research conducted in northern and southern 


California. Test-retest reliability coefficients ranged from 82 to vt a 
results met expectations regarding the influence of degree of = ad Aa ae 
vironment upon attitudes toward waste water reuse. The pre a ve I sn 
should be useful in future applied and theoretical research involving waste 


water reclamation. 


With a rapidly increasing population, there 

is currently in the United States an urgent 
need to reclaim domestic and industrial waste 
water in order to combat further degradation 
of the environment and to provide sorely 
needed new sources of water for beneficial 
uses (Metzler & Russelman, 1968). With 
recent advances in technology, it is possible 
to renovate waste water to such a degree that 
it meets existing domestic water quality 
standards. However, even though the tech- 
nical capabilities for thorough waste water 
renovation are available, there are serious 
questions regarding public acceptance of re- 
claimed water for higher contact uses. The 
idea of consuming water recovered from do- 
mestic sewage may be extremely repugnant 
to many in spite of needs for new sources of 
water, the imperative to combat degradation 
of the environment, and current technological 
expertise. 

The controversies created by proposals to 
fluoridate domestic water (Kirscht & Knut- 
son, 1961) have emphasized the role of 
public attitudes in water resources matters. 
Awareness of the importance of public atti- 
tudes has led to the initiation of a program 
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of research which will aid water utilities and 
involved public agencies to deal rationally 
with these attitudes when planning and initi- 
ating new uses of reclaimed water. This re- 
search is intended to aid the maximum devel- 
opment of new uses of reclaimed water with 
a minimum of damaging public controversy. 
It is believed that these goals can best be at- 
tained by anticipatory concern on the part 
of water utilities and public agencies regard- 


ing the attitudes and desires of the consumer 
population. 


One very useful means of obtaining infor- 
mation about public reaction toward specific 
water reclamation projects and proposals is to 
assess attitudes toward various uses of re- 
claimed water. Such data, along with other 
input regarding public reaction obtained by 
other procedures, can be of great benefit in 
helping utilities and agencies to plan proper 
uses of reclaimed water. The aims of the 
research reported here were: (a) to develop, 
scales assessing the affective component of 
attitude toward the use of reclaimed water 
for drinking (primary domestice use), laundry 
(secondary domestic use), and swimming 
(primary recreational use); and (b) to in- 
vestigate the reliability and validity of these 
scales by actual public surveys, à 


METHOD 
Scale Construction 


_Ttems. Thurstone (1931) has emphasized the affec- 
tive component of attitu 


attitude scaling. A conte: 
component fits into oy 
by Dillehay (1965). 


de in defending differential 
mporary view of how this 
crall attitude structure is given 
Torgerson (1958) has warned 
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against difficulties involved in attitude measurement 
using the judgment approach to scale construction 
and the response method of measurement. Specif- 
ically, Torgerson’s comments can be interpreted to 
mean that attitude items scaled for affectivity may 
be responded to on bases other than affect pro- 
ducing an obfuscated result. Thus, in gathering 
Potential attitude items. every effort was made to 
include only those that clearly and directly expressed 
affect toward a specific use of reclaimed water 
without injecting other attitudinal dimensions such 
as beliefs about disease producing potential or the 
lack of need for new water resources. 

With these and other well known criteria (Ed- 
Wards, 1957) as guides, 83, 58, and 82 items were 
developed dealing specifically with the use of re- 
Claimed water for drinking, swimming, and laundry, 
respectively, 

, Subjects. Fifty individuals volunteered to serve as 
Judges for the item scaling procedure. Twenty-seven 
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students, 17 were engineers, and 15 were otherwise 
employed. All Ss resided in the San Francisco Bay 
area. Ages of Ss ranged from 17 to 57 yr. 

Procedure. Items were typed, one to a plain white 
3X 5 in. card, and labeled at random with a one- 
letter three-digit number combination which ap- 
peared in the upper right corner of each card. Order 
of items within sets, and order of the sets them- 
selves, were randomly arranged before each indi- 
vidual judging session. Actual judgment procedures 
» described by the instructions read to each S by 
E. These instructions are reproduced below. 


Each card in the following group bears a state- 
ment describing an attitude about reclaimed 
sewage water. Consider the statement on each card 
in turn and decide how favorable or unfavorable 
it is toward reclaimed water. You are asked to 
judge the favorableness or unfavorableness ex- 
pressed by each statement in terms of the follow- 
ing 11 category scale, The distance between integers 


vere males and 23 were females. Eighteen were is equal in the scale shown below. 
1 2 3 4 5 6 7 8 9 10 11 
l | | | | | | | | | 
extremely neutral extremely 
unfavorable favorable 


Do not respond in terms of your own agree- 
ment or disagreement with the statements, but 
in terms of how favorable you judge each state- 
ment to be. Record the number of each statement 
and your rating of the statement on the appropri- 
ate line of the following data sheets. We are not 
interested in your personal attitude toward re- 
claimed water but where these statements fall on 
the continuum described above. The idea is that 
you are to act as neutral transducer, that is, you 
are to set your own attitude aside and judge each 
Statement on content only. 


Pied receiving the instructions, each judge rated, 
states only, under the observation of E, five 
for tec dealing with the use of reclaimed water 
dlize © ing. Questions regarding judgment proce- 
jud Were fully discussed before S proceeded to 
to Se the major item sets. Each S worked alone 
te Pe the major judgment task. Integers from 
ee ler for each item were entered in a specially 
aboy, red data booklet. The 11-point scale shown 
Wete dunn the last two sentences of the instructions 
data b uplicated at the top of each page of the 
a booklet. 
cag wat. Median and Ọ values were obtained for 
ated zat) using standard methods of analysis associ- 
dw. vith the equal appearing intervals procedure 
Other ards, 1957). On the basis of these values and 
Scales usual considerations (Edwards, 1957), three 
Select Containing 26 items each were constructed. 
Show td items, and their scale and Ọ values, are 
Own in Table 1. 


Survey Work 

Towns, Two towns were chosen for the survey 
work between which significant differences in public 
attitudes toward uses of reclaimed water were ex- 
pected. Town 1, an eastern suburb of San Francisco, 
had experienced the use of reclaimed water only for 
golf course irrigation, and this use had been under- 
way for some 2 yr. at the time of the survey. 
Town 2, an eastern suburb of San Diego, was 
roughly equal to Town 1 in size, socio-economic 
level, and type of dwelling unit. However, Town 2 
had experienced considerable reuse of reclaimed 
water for some 8 yr, including golf course irrigation, 
recreational lakes (boating, fishing), and swimming. 
Further, Town 2 was located in an arid area having 
a general scarcity of water. while Town 1 was located 
in a fertile agricultural valley having no general 
scarcity of water. 

Respondents. Twenty-five respondents were se- 
lected from each of the two towns by quota 
sampling. First, five square blocks were selected in 
order to be geographically representative of each 
town. Households were then contacted by direct 
visit until five individuals from each block agreed 
to participate in the survey. Not more than three 
women were selected from any one block. There 
were 10 male and 15 female respondents in Town 1, 
and 11 male and 14 female respondents in Town 2. 
Ages of the respondents ranged from 19 to 61 yr. 

Procedure. Interviewing was conducted in the 
respondent’s home at his convenience by the writer 
and two interviewers working under the auspices of 
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TABLE 1 
SELECTED ĪTEMS AND THEIR SCALE AND Q VALUES FoR Scares 1-3 
Item | Scale} @Q | Item 
no. |value | value ay 
Scale 1 
| 
1 | 10.57] 1.41 | Iam enthusiastic about the idea of drinking reclaimed water. 
2 | 10.27} 1.32 | Lam decidedly pleased with the idea of drinking reclaimed water. 
3 9.78 | 1.60 | I would be happy to drink reclaimed water. 
4 9.55 | 1.46 | I would be glad to drink reclaimed water. 
5 8.65 | 1.85 | Drinking reclaimed water is agreeable to me. 
6 8.28 | 1.88 | I would have no objections to drinking reclaimed water. 
a: 8.04 | 1.29 | Drinking reclaimed water seems like a reasonable idea to me. 
8 7.92 | 1.81 | I don’t mind drinking reclaimed water. 
9 7.70 | 1.22 | I feel that reclaimed water could be used for drinking. 
10 7.10 | 0.71 | I am slightly in favor of drinking reclaimed water. 
11 6.91 | 1.25 | I suppose it would be okay to drink reclaimed water. 
12 6.54 | 1.79 | While I would rather not drink reclaimed water, I think that I might be able to get used to 
the idea. 
13 6.20 | 1.01 | It wouldn’t make any difference to me if I were drinking reclaimed water or not. 
14 5.95 | 0.56 | Pm neither greatly in favor of nor greatly opposed to drinking reclaimed water, 
15 5.13 | 1.38 | I don’t know whether I could accept the idea of drinking reclaimed water or not. 
16 4.62 | 1.21 | I would be slightly disappointed if I had to drink reclaimed water. 
a te i ya a yy enthusiastic about the idea of drinking reclaimed water, 
1 7 ` would prefer not to drink reclaimed water. 
19 3.62 1.86 I'm not in sympathy with the idea of using reclaimed water for drinking. 
20 3.22 1.54 Drinking reclaimed water is a rather disagreeable idea to me. 
21 2.83 | 1.52 | Drinking reclaimed water is too radical an idea for me. 
22 2.44 | 1.88 | I think drinking reclaimed water is fundamentally unsound. 
23 2.00 | 1.08 It will be a sad day when we have to drink reclaimed water. 
Ae ee me The notion of actually drinking reclaimed water is ridiculous to me 
5 F .05 | The idea of actually drinking reclaimed water must be discarded i amediately 
26 1.17 | 0.71 | I would never drink reclaimed water, Y 
Scale 2 E d a a 
] ] — ee = 
1 | 10.44) 1.18 | Using reclaimed water for swimming i a i sbi ; 
2 |10.34| 1.24 | I am enthusiastic about using telained. ioe = tagot 
A ee ise I would be happy to use reclaimed water for swimming. i 
o as | Ls ean cane iA ema vie 
8. ` vould hav toblems swimming in reclaimed water. 
f po 1.52 Swimming in reclaimed water is satisfactory to me. 
4 Fas 1.50 | I see no reason why one could not swim in reclaimed water, 
Š i Ye Swimming in reclaimed water would be agreeable to me. 
Y he rel It wouldn’t bother me to swim in reclaimed water, 
: .97 | Lam somewhat in favor of using reclaimed water for swimmi 
11 | 6.97] 1.48 | T really can’t critici i n 
12 | 655| 163 | lanes ae icize the use of reclaimed water for swimming. 
A 4 not wild a i Amming i ; 
13 622| 1.08 | lewa bal about the idea of sw imming in reclaimed water, but I could get used to it- 
n’t make any diff . H g 
14 6.04] 0.54 | Tanna on sa i anes to me if I were to swim in reclaimed water. 
a a2 a greatly in r i . bape r 
16 Lu SA I have no feeling one aye DE Ges me gs ‘he ae water for swimming- 
J i ae ray v in recla; P 
i isr 106 es Bother me a little to swim in reclaimed water. aerma 
18 | 431| 117 nthe toe? tot sates the idea of swimming in reclaimed water. 
Ps ally swi ing i i ri Ly = 
19 | 3.71) 171 | I would never get enthusiastic abeat ene water makes me feel a little uncomfortable. 
20 3.43 | 1.66 | T wouldn't feel usiastic about swimming in reclaimed wter, 
21 | 3.17| 134 | Personally, T vould Taaa ang În reclaimed water. l 
2 AR nally, I wou d have problems swimming in reclaimed w 
z é 88 | I don’t need to swim bad enough to swim in reclaimed water. 
24 an T : sn mae what they say, I just couldn’t get "Had to ss i 
«l 17 would certainly hate ene Pathe: rimming ji i / k 
25 1.59 | 1.14 | I cant idk cera i Rn, ee Meanie’ 
26 | 139] 1.08 | Usi ; tung much more unpleasant than swimming į ; 
E d r $ meee n 7 
ng reclaimed water for Swimming is a terrible idea, Bi seeldimed watt: 


y 
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Table 1— (Continued) 


Item | Scale Q 


no. | value | value Item 


Scale 3 


re 12 Using reclaimed water for laundry is an advanced idea which I highly favor. 
3 116 1.29 | Tam enthusiastic about the idea of using reclaimed water for laundry. 
H oF 1.23 I am definitely pleased with the idea of using reclaimed water for laundry. 
5 he 1.76 | I would be very satisfied to use reclaimed water for laundry. 
x 9.09 | 1.21 | It would be a good idea to use reclaimed water for laundry. 
7 8.50 | 1.36 | Using reclaimed water for laundry is agreeable to me. 
$ 8.21 | 1.23 | Using reclaimed water for the laundry seems like a reasonable idea to me. 
5 8.00 1.09 | I feel that reclaimed water could be used for laundry. 
7.71 | 1.20 | I wouldn’t mind using reclaimed water for laundry. 


H Lar 0.99 | I am somewhat in favor of the idea of using reclaimed water for laundry. 
12 -70 | 1.82 Maybe I could get used to reclaimed water for laundry. 
6.65| 1.91 | W hile pam not wild about the idea of reclaimed water in general, maybe I could use it for 
laundry. 
A 6.01 0.53 The idea of using reclaimed water for the laundry doesn’t affect me one way or the other. 
15 5.98 0.53 | I’m neither greatly in favor of nor greatly opposed to using reclaimed water for laundry. 
16 5.50 | 1.28 | I don’t know whether I could accept the idea of using reclaimed water for laundry or not. 
4.87 | 0.67 | The idea of using reclaimed water for the laundry is a little more unfavorable than favorable 
to me. 
Hs 4.64] 1.19 | I would be slightly disappointed if I had to use reclaimed water for laundry. 
19 4.14 1.74 | I would feel somewhat concerned if I had to use reclaimed water for the laundry. 
20 3.79 | 1.54 | I would prefer not to use reclaimed water for laundry. 
21 3.43 | 1.44 | I have reasons why I would rather not use reclaimed water for the laundry. 
2 2.40] 1.40 | Ihave plenty of objections to the use of reclaimed water for laundry. 
2.38 | 1.66 | I don’t think reclaimed water would even be good enough for the laundry. 


I will be most unhappy if I have to use reclaimed water for my laundry. 


44 1.81 | 0.96 Using reclaimed water for laundry is very disagreeable to me. 
As far as I’m concerned, reclaimed water should definitely not be used for the laundry. 


y j 
26 1.12 | 0.62 | Reclaimed water for laundry disgusts me. 


the California State Department of Public Health. careful instruction to read all items first, and then 
© first phase of the survey was completed late in to mark only those items closest to the respondent’s 
5 z 1969. The interview form employed in this personal attitude. The same random item arrange- 
te ^se Of the work consisted of a series of questions ment was used for each scale. The order of scale 
Barding water resources matters and the three administration was randomized separately for each 


Scal 5 
S described herein, Scales were administered after respondent. 
The second phase of the survey was conducted late 


in July 1969. The 50 respondents were mailed letters 

TABLE 2 preceding the follow-up study thanking them for 
Mrans Anp STANDARD DEVIATIONS FOR THE their earlier participation and requesting their co- 
FIELD Survey DATA . operation in the second and final phase of the 

research. All 50 participated in the follow-up study. 


Replication 2 


Replication 1 
TABLE 3 


TEST-RETEST RELIABILITY COEFFICIENTS 
FOR THE THREE SCALES 


Scale | Scale = 


Scale Coeficients 
1 .82 
2 80 
3 79 
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TABLE 4 


ANALYSIS OF VARIANCE FOR THE 
FELD Survey Data 


Source | SS df MS F 
ee 77.00 1 77.00 3:65" 
SwG 1013.14 48 1.11 
oe 12.79 1 12.79 6.77** 
12 2,33 1 2.33 1.23 
2 X SwG 90.54 | 48 1.89 
ig 130.64 2 65.32 | 35.12*#* 
13 62 2 3i — 
3 X SwG 178.43 96 1.86 
23 39 2 .20 zn 
123 -59 2 .30 — 
23 X SwG 48.93 96 cou 


Note.—SwG = Ss within groups. 
p <.075. 
$ <.025. 
b< 


* 
ee 
head 001, 


Interviewing was performed in the respondent’s home 
by the two State Health interviewers. A different 


random order of item arrangement wa: 
in the follow- 


tration again 
dent. A few b: 
matters prece 


s employed 
up study, with order of scale adminis- 
randomized separately for each respon- 
rief questions regarding water resources 
ded scale administration, 


RESULTS 


The respondents score for each scale ad- 
ministration was taken as the median of all 
checked items. Means and standard deviations 
for each scale, replication, and town are 
shown in Table 2. Table 3 records the test- 
retest reliability coefficients for each scale. 
The coefficients of Table 3 are each based 
upon 50 pairs of scores, 

Statistical analysis of differences between 
the means of Table 2 posed two problems, 
The first involved the many possible pairwise 
comparisons and the second the legitimacy of 
comparisons between different scales. The first 
problem is well handled by an analysis of 
variance approach. Also, since the three scales 
were constructed by the same judgment pro- 
cedure on a common quantitative base giving 
highly similar, if not identical, Meaning to 
response scores on the affective continuum as 
here optionally defined, the decision was made 
to use analysis of variance as a defensible 


expedient to efficiently assess 
between all the means of Table 2. 
The selection of two towns, the use of 
three scales of common quantitative base, and 
the two replications made possible a three- 
factor analysis with repeated measures on the 
last two factors listed (Winer, 1962). The 
results of this analysis are shown in Table 4. 


differences 


Discussion 


The reliability coefficients of Table 3 show 
that individual scores obtained from the two 


problem than reli- 
ment. The lack of 
results of affective 
alidated seems to 
of some form of @ 


wn 2 respondents, and 
attitudes toward reuse of 


Support for both expectations was provided 


= 


Ç 


AFFECTIVE Response Toward Uses or RECLAIMED WATER 


by the present study as evidenced by results 
of the analysis of variance and the means of 
ag The fact that the F ratio for towns 
x quite meet the customary level re- 
e for significance is ameliorated by the 
“ne n size and by the consistency of ex- 
ed and observed mean differences between 

towns, 
Tel thg results of this study have shown 
Te Scales developed can perform ade- 
püblic a. field research with the general 
ot A herefore, the scales should be useful 
nie les designed to aid planning for 
Ses of reclaimed water and in theo- 


reti F m : 
fee Oriented research designed to assess 
a lons of the affective component of atti- 


ud i : 
a with beliefs, knowledge, and behavior 
Sarding use of reclaimed water. 
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STABILITY OF THE STRONG VOCATIONAL 
INTEREST BLANK FOR MEN’ 


CHARLES B. JOHANSSON * 
Macalester College 


AND 


DAVID P. CAMPBELL 
University of Minnesota 


Inspection of test-retest reliability studies for the Strong Vocational Interest 
Blank (SVIB) revealed that two factors were imporant when considering the 
stability of this instrument: Ss’ initial test ages and length of test-retest inter- 
vals. To further investigate these two factors, median test-retest correlation 
coefficients were computed for over 1,400 college educated males who had 
been tested with the SVIB at various ages and retested at varying intervals. 
Greater stability resulted the older the S at initial testing and the shorter 


the time lapse to retesting. Stability decreased as the test-retest interval length- 
ened or the initial test age of S decreased. 


In the field of interest measurement, nu- 
merous studies have been done attesting to the 
high test-retest reliability of the Strong Vo- 
cational Interest Blank (SVIB). Inspection 
of the reliability coefficients presented by 
Campbell (1966) indicates that two factors 
are important when considering the stability 
of the SVIB—age at first testing, and the 
length of time between test and retest. 

This study looks more closely at the influ- 


ence of these two factors on the test-retest 
stability of the SVIB. 


Subjects 


Item responses for the SVIB were available 
for six male groups. Each group was initially 
tested just prior to, during, or shortly after 
attending college, and retested at various in- 
tervals ranging from 2 wk. to 23 yr. 

A 2-wk. test-retest sample (N = 140) of 
students from an introductory psychology 
class at the University of Minnesota consisted 
mostly of sophomores, ages 19, 20, or 21 yr., 
from the College of Liberal Arts. 

A 3-yr. test-retest group (N = 334) con- 
sisted of students attending Harvard Univer- 
sity who were tested late in their freshmen 
year (1960 or 1961) and retested as gradu- 
ating seniors. These Ss were mostly 17, 18, or 
19 yr. old at initial testing. They form part of 
a longitudinal study currently being con- 


1 This article has been accepted for publication by 
the outgoing Editor, Dr. Kenneth E. Clark. 

* Requests for reprints should be sent to Charles 
Johansson, 301 Old Main, Macalester College, St. 
Paul, Minnesota 53101, 


ducted by Stanley King and Bruce Finnie at 
Harvard. 

An 8-yr. test-retest sample (N = 171) 
were graduates of the University of Minne- 
sota in several professional areas. They took 
the test initially as high school seniors (early 
1950s) and all were retested 4 yr, after col- 
lege by Vera Schletzer (1963). Initial test 
ages were 17 and 18 yr. and retest intervals 
were from 8 to 10 yr. 

The remaining three groups were tested and 
retested by E. K. Strong (1955). One group 
consisted of students tested while in graduate 
school at Stanford University in 1927 and 
retested in 1932 and again in 1949, Initial 


test ages were mostly 21, 22, and 23 yr. Test 
distribution was as follows: 


Taking all three tests 


N 
Taking only two tests 


N 


162 
94 


Total N = 256 


The second sample tested by Strong from 
Stanford University were college seniors 
tested in 1927 and retested three times: 1932, 


1939, and 1949. Test distributions were as 
follows: 


Taking all four tests 


l N = 143 
Taking three tests N= 82 
Taking only two tests N= 37 

Total N = 262 


The last group tested by Strong consisted of 
Stanford University students tested as fresh 


| 
| 


à 
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2 r 2 3 € 3 6 
r. i 


yr. yr. yr. yr. yt. yr 


33 | | | | 
| J | 
32 | E A 
- Ba 81.80 
20 | | St = 
29 | | 
ig Ki |.78 | | | 
g 2% la | 
z 25 7 Err e | — a 
a | -79 howd 73 | .72 
| i | 78 | | | 73 | .65 
N, 22 | 75 | | 69 70 | £63 
21 | 92 | | 71 | D | 64 | 63 | .66 
20 | 92 [77 | 7 [ fa] | |} ss | 
19 20 “80 70| | 75| iú 6| 69 o | -64 69 | 54 a 
18 82 co} | | | .65 | .66 68 | 60 | .64 | .70 
17 78 70 | | 64 | 56 64 | 63 67 67 Pl 
ig E z 9 10 11 12 18 19 20 21 22 23 


Retest Interval 


Fic. 1. Median test-retest correlation. 


Men į 
2 rhe 1930 and retested in 1931, 1939, and 
n + Test distributions were as follows: 


Taking all four tests N = 124 
zo iig three tests N= 92 
aking only two tests N = 67 


— 


Total WV = 283 


= ts Ove six groups contained over 1,400 
male most 3,800 tests. All tests were scored 
1966) PT for the SVIB (Campbell, 
Coefficie earson product-moment correlation 
and ae were computed between the test 
Next St for each of the 59 SVIB scales. 
> all test-retest correlation coefficients 
ued in terms of a bivariate distribu- 

test age versus retest interval. Figure 


S 


Were 
à} tion 


1 represents the median correlation, based on 
sample sizes greater than 20, as a function of 
test age and retest interval. 

For very short intervals (2 wk.) the test- 
retest reliability of the SVIB is about the 
same as the split-half reliability (rs in the 
low .90s). Test-retest intervals of 1-5 yr. indi- 
cate reliability coefficients in the high .70s for 
most test ages. Long intervals, 10 yr. and 
greater, show that the initial test age was the 
more important; the following coefficients 
would be expected for long test-retest inter- 
vals: .65 if tested during the late teens, .70 if 
tested in the early twenties, .75 if tested in the 
late twenties, and .80 if tested after age 30. 

Table 1 collapses the initial test ages and 
the test-retest intervals into a more readable 
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TABLE 1 correlations increase as one reads down the 

MEDIAN TEST-RETEST CORRELATIONS columns. Also, for a given age of S, the median 

reliability is lower the longer the interval be- 

Test-retest interval tween test and retest, that is, the correlations 

AR - - 7 decrease as one works across the table. 
| a 
2 wk. pee yr.{6-10 yr. wae oe REFERENCES 

| ee | 25 CAMPBELL, D. P. Manual for the Strong Vocational 

16-19 90 42 64 | 64 | 38 Interest Blank for Men and Women. Stanford: 
20-22 92 73 | 70 | 8 | .67 

a = E ‘81 | D | als Scuretzer, V. M. A study of the predictive effective- 


| ness of the Strong Vocational Interest Blank for 
job satisfaction. Unpublished doctoral dissertation, 


a University of Minnesota, 1963. 
form. The trend was quite clear that when  Srronc, E. K. Jr. Vocational interest 18 years after 


the test-retest interval is held constant, the college. Minneapolis: University of Minnesota 
median reliability is higher in direct relation Press, 1955. 
to the age of S, that is, the median test-retest 
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READABILITY OF TECHNICAL TRAINING MATERIALS 
PRESENTED ON MICROFICHE VERSUS OFFSET COPY * 


THOMAS S. BALDWIN anp LARRY J. BAILEY? 


Department of Vocational and Technical Education, University of Illinois 


The purpose of the study 
reliability and content va 


vas to develop psychometric instruments of known 
y for measuring an individual's ability to process 


each of several types of information through the visual modality, and to 
conduct an experimental study in which the instruments developed were used as 
the dependent variable. Twelve reading tests were constructed using each of 
four types of technical training materials. Three forms of each test master 
were reproduced: offset copy, positive image microfiche, and negative image 
microfiche. A control group received the offset copy form, and positive and 
negative image forms were administered to the two experimental groups. An 
analysis of variance revealed statistically significant differences in favor of 


offset copy for 3 of the 12 subtests. 


nan existing widespread use of microfilm 
wee growth of this method of 
Major a “n storage are well recognized. 
mediur reasons for the wide acceptance of this 
age y a the practical advantages of stor- 
NOt tez evel, and economy. A very real, but 
the ah ily apparent, problem has to do with 
the ee the medium itself has on 
vid ormation processing ability of indi- 
i uals who use microfilm as a major reference 
aterial, 

to A comprehensive review of literature related 
compl a applications of microfilm has been 
view ci by Field (1968). Following the re- 
recent a author concluded that “very Tittle 
P. 1 » definitive work has been published 

in the He No attempts have been reported 
tive die erature to compare positive and nega- 
same Crofiche with an offset copy form of the 
tion gee materials. A possible explana- 
or the lack of such studies is the neces- 


sit 5 
Y for conducting well-designed experimental 


1 The 

research i i 

du ch reported in this pap! 

fen ae the Department of Vocational and Tech- 
ducation, University of Illinois under Con- 

F33615-68-C-1522 with the Behavioral 


aboratory, Wright-Patterson Air Force 
he study re- 


erein was submitted by the authors as a 
ements 


PEOIEEL report in fulfillment of the requir 
his research contract. 
e oale has been accepted for publica 
2 fe Editor, Dr. Kenneth E. Clark. 
lley sche for reprints should be sent to Larry J. 
is now Assistant Professor, Technical 
is Uni- 


er was con- 


ase, Ohi 

Ported te An expanded version of t 
nal 
of th 


tion by 


alley, who 


Versig sta Education, Southern Illino 
» Carbondale, Tllinois 62901. 
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investigations which employ adequate psycho- 
metric techniques. 

The purposes of this study were to: (a) 
develop psychometric instruments of known 
reliability and content validity for measuring 
an individual’s ability to process each of sev- 
eral types of information through the visual 
modality, and (b) conduct an experimental 
study in which the instruments developed 
were used as the dependent variable. The 
study was designed to determine the influence 
of each of three methods of presentation (in- 


dependent variable) on information processing 


ability. 
The null hypothesis states that ability of 


readers to assimilate and utilize information 
contained in technical training materials is the 
e for each of the following methods of 
presentation: (a) material presented as black 
on white offset copy, (b) material presented 
via microfiche with positive image, and (c) 
material presented via microfiche with nega- 
tive image. From the null hypothesis, two 
research hypotheses are derived. First, reader 
performance using black and white offset copy 
will be significantly better than performance 
on either positive or negative microfiche. The 
second research hypothesis is nondirectional 
and states that reader performance will not 
be significantly different for positive versus 
negative microfiche. 


sami 


METHOD 


In order to test the hypothesis of no significant 
differences in the ability of readers to assimilate and 
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utilize information contained in technical = 

materials presented by three ee ERT S 
zi ometric 

was necessary to develop psychom ner 

which were representative of Air Force training 

materials. 


Development of Psychometric Instruments 


sy etric instruments were developed to evalu- 
Be ee to perform reading tasks utilizing each 
of four types of technical training materials: (a) 
materials requiring the reading of continuous prose 
and short paragraphs; (b) material requiring the 
identiñcation of and discrimination among precise 
symbols and figures; (c) material utilizing the skills 
employed in reading charts, tables, graphs, maps, and 
isometric and orthographic drawings; and (d) ma- 
terial requiring the skills utilized in reading sche- 
matics, block diagrams, wiring diagrams, flow charts, 
and nomenclature verification. Following is an elab- 
oration of the four categories with a description of 
the representative tasks for each type of material.3 


Type 1 


Narrative-continuous prose (Tests 1 and 2). Ma- 
terial for this section was selected from Air Force 
student study guides concerning the subject of sur- 
vival training. The content was rewritten as con- 
tinuous, uninterrupted prose by eliminating figures, 
headings, questions, etc. After the text had been 
edited and typed, the manuscript was modified by 
deleting three words on each page and replacing the 
words with blanks. Deleted words were selected 
subjectively on the basis of their relevance to con- 
text and assumed ease of identification. The task is 
to read the material and supply the missing words 
(Part 1-50 min., Part 2-25 min.). 

Narrative-short paragraph (Test 3). The same con- 
tent material was used for this part as was used for 
the continuous prose exercises. Two and three sen- 
tence paragraphs were isolated from the text. The 
short paragraphs were then modified by changing a 
word in each paragraph. This technique is similar to 
that used for the Chapman-Cook Speed of Reading 
Test (Chapman, 1924). The Ss are to read the para- 
graph and identify the word which is inconsistent 
with the meaning of the paragraph (25 min.). 


Type 2 


Figure identification (Test 4 
checks which of five numbered geometrical figures or 


pictures in a row is identical to the given figure at 
the left end of the row (5 min.). 


). For each item, S 


2 Several of the test instruments utilized for this 
research were reproduced or adapted from the Kit 
of Reference Tests for Cognitive Factors, copyrighted 
by Educational Testing Service, Princeton, New 
Jersey. The reference kit was developed under 
Office of Naval Research Contract Nonr-2214 (00), 
Project Designation NR 151-174. Reproduction of 
these test instruments is permitted by and for 
United States Government use, 
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Length estimation (Test 5). Each item consists of 
two parts. Three curved or angular lines are drawn 
between these two points, The task of the examinee 
is to select the shortest of these lines (3 min.). : 

Symbol translation (Test 6). This is a test of Ss 
ability to translate symbols into alphabetic charac- 
ters. A legend illustrates alphabetic letters and their 
corresponding symbols (typewriter characters). The 
test items consist of short series of symbols. The task 
is to identify the alphabetic letter that corresponds 
to each of the given symbols (10 min.). 


Type 3 


Graphs (Test 7). Speed and accuracy in reading a 
graph is measured by this test. Minutes are marked 
on the horizontal scale of the graph and tempera- 
tures are shown on the vertical scale. Four curves 
plot the relationship of temperature to time under 
certain experimental conditions, For each item, the 
minutes are given. The task is to find the tempera- 
ture corresponding to the given number of minutes 
for each of the four curves (7 min.). 

Tables (Test 8). This is a measure of the ability to 
locate and record values found on a table, A table is 
presented which contains drill bit sizes and their 
corresponding decimal equivalents. The task is to 
refer to the table and furnish the decimal equivalent 
for each of the given drill sizes (4 min.). 

Mechanical drawing (Test 9). A pictorial drawing 
(three dimensional) of an object is presented along 
with three separate views of the object. Each surface 
of the pictorial is represented by an alphabetic 
letter. Lines and surfaces on the two dimensional 
views are represented by numbers. For each lettered 
surface in the pictorial view, S is to find the corre- 
sponding numbered line or surface in each of the 
three separate views (5 min.), 


Type 4 


Schematics (Test 10). Each 


i item consists of @ 
network of lines as in 


; an electrical-circuit diagram 
having Many intersecting and intermeshing wires 
with several sets of terminals. The task is to trace 
the lines and to determine for which pair of termi- 
nals, marked S (start) and F (finish), there is 4 
a circuit through a circle at the top qs 
min.). 


Flow diagrams (Test 11). Th 


d s e examinee sees dia- 
grammatic sections 


(Test 12), The S inspects 
mbers and indicates whethe! 
ach pair are the same or dif- 


k 


a 
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READABILITY OF MICROFICHE vs. Orrset Copy 


E e fie nea ary fiee validity to represent Air 
wee a N, Preliminary forms of all instruments 
on a ministered to a sample of students drawn 
testine Da population as those used for the final 
career m pretesting provided data from which to 
Time fe he final forms of the test instruments. 
ae ieee = for each test were established that were 

ast two standard deviations above the mean 


time f $ 
data, or completion, as computed from the pretest 


Test Forms and Equipment 


ee of each test was developed from which 
fon o i orms were reproduced. Following comple- 
and the bl foot tests, Itek plates were produced 
same i ae on white offset copy was printed. The 
tive ane oe tts were then used to film the posi- 
te mite oe image microfiche forms according 
the Ma y Speciheations Microfiche readers used for 
were Ped wee selected by the contract monitor and 
use th ned to be representative of the readers in 
roughout the Air Force. 


Reliability Procedure 


acer anal ifarma of all test instruments were char- 
reliability o primarily speed tests. Therefore, the 
retest 4 of each test was determined by a test- 
on Me Half of each test was administered 
copy oe days to a sample of 42 Ss, The ofiset 

fees only, was used for the reliability testing. 
of respo Were scored by counting the correct number 
computed. 1 for each exercise. After reliabilities were 
Was us a the Spearman-Brown prophecy formula 
test jenn to estimate the reliabilities for the total 

gth. 
able 1 presents reliabilities for the 12 testing 
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instruments. It can be seen that reliabilities are gen- 
erally quite high with the exception of Test 10, fow 
diagrams, which has a reliability of only .59. It 
would be necessary to increase this test to 2.78 times 
its present length to achieve a reliability of .80. 
Reliabilities for the remaining 11 tests range from 
.76 to .93, all acceptable for use in studying compari- 
sons between groups of individuals. 


Selection of Experimental Subjects 


The sample for the experimental study was se- 
lected from students in Personnel Awaiting Training 
Status (PATS) at Chanute Air Force Base, Illinois. 
The total sample was drawn from the population of 
PATS by a quasi-random procedure. Each Com- 
mander of 13 student squadrons was asked to ran- 
domly select $ students per day for testing through- 
out the duration of the study. The Ss were then 
randomly assigned to one of three sample groups. A 
total of 133 Ss were tested according to the follow- 
ing assignments: 43-positive image sample; 45- 
negative image sample; and 45-offset copy sample. 
To test the hypothesis that samples were equivalent 
with respect to general intellectual ability, an analy- 
sis of variance of Armed Forces Qualification Test 
(AFQT) scores was computed. The AFQT was de- 
signed to evaluate enlistees or draftees prior to their 
entering the Armed Forces and is a composite index 
of general learning ability. Results of the analysis of 
variance showed that there were no significant dif- 
ferences among the three groups in terms of AFQT 
= 692, df= 2/130, p> .05). 


scores (F 
Test Procedures 


The total test battery required approximately 4 hr. 


to administer. Project personnel administered all 


TABLE 1 
— Test MEANS AND STANDARD DEVIATIONS : 
ies Offset copy Positive Negative 
Subtest =| m g 
xa SD rea Xe SD Xa SD 
1. Narrative (50 min.) 37 15.2 92 38 19.1 40 an 
- Narrative (25 min.) 19 7.5 .85 19 8.7 20 11.2 
pri; Narrative—short paragraph 27 11 8 4 A ma z E 
5 “gure identification 61 12.5 5 a . 2 . 
>. Length estimation 30 6.0 90 28 8.8 27 6.2 
6. Symbol translation 141 28.4 79 114 28.3 117 30.7 
7. Graphs k 34 10.8 93 28 9.6 29 9.9 
8. Tables 17 5.7 76 16 5.7 15 6.0 
- Mechanical drawing 27 16.4 90 33 18.3 36 20.4 
10 Mow Ginmiame ing 19 5.5 59 18 4.1 20 5.1 
i chematics 10 5.7 88 11 5.7 12 6.4 
2. Number verification 30 6.9 ot 29 6.9 29 73 
N = 42. 


st, corrected to full length using Spearman” 


Brown prophecy formula. 
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instruments using standardized test Procedures and 
instructions. To counterbalance the effect of fatigue, 
the instruments were administered in three different 
sequences within each group. In addition, Ss were 
permitted to take a 10-min. break at the end of the 
first hour, 20 min. at the end of the second hour, and 
an additional 10 min. following the third hour of 
testing. 


RESULTS 


To determine the accuracy of the a priori 
groupings of the tests into the four categories 
established at the beginning of the study, the 
intercorrelations among the 12 tests were 
submitted to a principal axis factor analysis 
and extracted factors were rotated using a 
Varimax rotation. By comparing the com- 
munalities obtained from the factor analysis 
to the reliabilities for each of the 12 tests, it 
was evident that a number of tests in the bat- 
tery contained considerabl 
For these reasons, 
variance was comput 
than for test scores 


e specific variance. 
a separate analysis of 
ed for each test rather 
combined according to 
either the a priori categories or the empirical 
categories determined by the factor structure. 

Means and standard deviations for each 
group on each test are Presented in Table 1. 


the ability of readers 
ze information con- 


.48, ; and at the .01 level 
for Test 6, sym ion (F = 11.29, df 
). For these three tests the research 
hypothesis, which states that the method of 


Presentation does affect the readabili 
ial. į abilit 
material, is accepteq y of the 


Since the null 
Tests 4, 6, and 
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than for negative microfiche. One-tailed tests 
of significance were used in testing differences 
between offset copy and microfiche, since it 
was hypothesized that any advantage in read- 
ability should favor offset copy. While there 
was some reason to believe that positive image 
microfiche would be more readable than nega- 
tive image microfiche, the evidence was not 
considered to be sufficient to warrant a direc- 
tional hypothesis. Hence, a two-tailed test of 
significance was used. 

The ¢ tests for the significance of the dif- 
ferences between the means presented in Table 
1 indicate that the mean score for offset copy 
is significantly larger than the mean for either 
Positive or negative microfiche for Test 4, 
figure identification: Test 6, symbol transla- 
tion; and Test 7, graphs. In each case, the 
difference was significant at the .02 level. 
None of the differences between means for 


Positive and negative microfiche were signifi- 
cant, 


SUMMARY AND Co. 


The Primary purpose of this research was 
to determine the inf 


sented on microf 
Force Ss to read S 


NCLUSIONS 


ng materials. A search of 
aining materials was con- 


ere administered to three 
Air Force Ss, One group 
erial presented on offset copy: 
8roup received the identical ma- 
Positive image microfiche, and the 


3 SToup received the material on negative 
Image microfiche, 


Statistical anal 


ing 9 tests tl 


| 
| 


à 
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12 tests were mean scores for positive image 
microfiche significantly different from mean 
Scores for negative image microfiche. 


Implications for Further Research 


This study dealt with one problem bearing 
on the feasability of using microfilmed ma- 
terials for instructional purposes. Further re- 
search should be directed toward two prob- 
lem areas: (a) investigations of methods for 
Increasing the reading efficiency of microfiche 
materials and equipment, and (b) investigat- 
mg applications of microfiche materials for 
education and training purposes. Specifically, 
further research is needed to determine the in- 
fluence of the following variables on efficiency 
of reading: image sharpness, ambient lighting, 
Screen image illumination, surround pattern of 
image, screen glare, screen angle, color of 
image, type size, and type form. Also, com- 
Parative studies should be made among dif- 
ferent types of microfiche readers to identify 
hardware characteristics that contribute to 
reading efficiency. Research should also be di- 


rected toward determining the relationship of 
Ss’ preference for film polarity to reading effi- 
ciency and reader satisfaction. An informal 
observation during the experimental testing 
suggested that Ss reading the microfiche test 
forms were more highly motivated than Ss 
reading the offset copy. This suggests a study 
covering a significantly greater length of 
time, during which S is required to use micro- 
fiche, to investigate the level and stability of 
motivation. Finally, research is needed to 
compare the learning rates of students using 
microfilmed materials with students using 
materials reproduced on offset copy. 
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SIMULATION STUDY OF COMPETITION 
IN AN “OPEN WORLD”? 


JOHN L. KENNEDY ? 
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Five three-man teams competed in a business game in an eain E 
behavioral access to the simultaneous behavior of a competing team by means 
of closed-circuit TV and sound could be purchased. Switching to other teams 
made possible nearly continuous surveillance of all the competitors. Prelimi- 
nary results describe the various adaptations of the teams to this unusual 
technological capability. The article also contains speculations on the implica- 
tions of such research for the worlds of business and international relations. 


This article describes a preliminary study or 
exploration of the consequences of making 
available low-cost “behavioral access” as a 
regular condition of competition. Behavioral 
access, in this case, consisted of a closed- 
circuit TV picture and associated sound of the 
simultaneous behavior of a selected competi- 
tor. Only one other team could be observed 
at a given time, but the observing team could, 
for a fee, switch from one to another team at 
any time. The competing entities were five 
three-man teams of undergraduate students in 
a course entitled “Human Problems in Indus- 
try,” the 3-hr. laboratory period of which was 
devoted to team competition in a “business 
game,” the Research and Development Game. 
Twelve weekly 3-hr. sessions were conducted 
during the fall term of 1962-63. At the end 
of the first six sessions, a winner (the team 
that made the most fictitious money) was 
declared and the game was played a second 
time starting with the initial conditions of 
financial equality. 

Several important concepts have been men- 
tioned and need elaboration at this point. The 
first is behavioral access. We wish to differen- 
tiate this concept from the concept of inspec- 
tion. Behavioral access means access to the 
actual process of making plans, reaching deci- 
sions, and taking actions. It is obtained by 
unobtrusively getting inside the on-going be- 


1 This article has been accepted for publication by 
the outgoing Editor, Dr. Kenneth E. Clari $ 

* Requests for reprints should be sent to the 
author, Department of Management, University of 
Southern California, University Park, Los Angeles 
California 90007. à 
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havior of a group or team, in this case by 
fixed-installation television camera and micro- 
phone. Inspection, on the other hand, is usu- 
ally confined to the notion of counting identi- 
fiable products, such as number of missile 
bases, airfields, aircraft by type, etc. The in- 
ference of intent from inspection of products 
is a risky business—the same product may be 
planned for and utilized for a variety of pur- 
poses. Access to the plans and strategies re- 
sulting from the differing ways information is 
processed may be a more powerful tool for es- 
timating intent, and, when used in a mutual 
or interacting fashion, may create a new form 
of competition, which is based upon compara- 
tive productivity rather than secrecy. Behav- 
ioral access is a tool for mutual education. It 
sets a lower limit on competition. Tt represents 
a minimum amount of cooperation to establish 
an arena of international, cultural, or business 
competition which might exhibit self-correc- 
tive properties, 
eae important concept is that of the 
r y simulation study. We shal] mean 
y be the process of studying the be- 
people in a real-world environment 


by creating an operating model of the real- 
world reference envi 


; The study reported het® 
did not require a co dy reporte! 


mputer, 


ey 
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The third concept is that of the team. By 
team, we shall mean a group of people who 
expect to work together toward a common goal 
for an extended period of time. We view our 
three-man teams as miniature organizations. 
They were self-organizing in the sense that, 
although Es assembled the teams on the basis 
of certain criteria, the team members them- 
selves determined how they should organize to 
cope with the environment. 


Laboratory Environment 


; The study reported here is part of a con- 
tinuing program of research on cooperation 
and competition with simulation methods. It 
was conducted in a special laboratory which 
was designed and instrumented to gain access 
to the internal processes of competing teams 
of people. The principal tools were a closed- 
circuit television network, a voice-listening and 
recording network, a telephone switchboard 
and associated telephone and intercom net- 
work, and a variety of human observers. This 
study involved arranging the closed-circuit 
television and sound network so that a team 
could request picture and sound of one of the 
other four teams to be displayed on a monitor 
in its own game room and to have the associ- 
ated verbal behavior of the observed team 
switched to earphones in its game room. This 
was accomplished by calling a central switch- 
board, at which the telephone operator 
switched the appropriate picture and sound as 
requested. A nominal charge of $1,000 was as- 
sessed against the accumulated earnings of the 
requesting team whenever a request was proc- 
essed by the telephone operator. The charge 
Was nominal since a team could make a profit 
of up to around $500,000 per session, o 
pending upon how it fared in winning Con- 
tracts through competitive bidding. 


. A 
Task Environment 


The Research and Development Game was 
played by five three-man teams, which com- 
peted to earn the most money by winning con- 
tracts. In each of twelve 3-hr. periods, the 
teams bid on contracts awarded for products 
consisting of 5-letter word-pairs. The research 
phase of the research and development process 
was introduced by converting the first word 
(e.g., judge) into the second word (e.g, final) 


according to a set of rules specifying legal and | 
illegal intermediate five-letter words and the 
costs of the conversion processes. The total 
cost of the conversion processes thus became 
the manufacturing cost of the item, and the 
team devising the least expensive solution was 
in the most advantageous position for bidding 
on production contracts involving that item. 
The team would then arrive at a bid by add- 
ing to its manufacturing cost the profit per 
unit it hoped to make. A ceiling of twice the 
manufacturing cost was imposed to eliminate 
the fantastic profit resulting from a situation 
in which a team knew that it was the only 
bidder. Contracts were awarded to the lowest 
bidder. 

At the beginning of each game year (a year 
was played in one 3-hr. laboratory period), 
all the teams received an identical annual 
contracts list of products (word-pairs) for 
which contracts were awarded during the next 
12 mo. The list represented the possible prod- 
ucts the team could manufacture in its hypo- 
thetical factory if it was successful in win- 
ning a bid. Each team was given an identical 
product specification sheet for each word-pair, 
listing one possible word-chain and its cost of 
production. A team could elect to bid on a 
contract for this product using the given pro- 
duction cost, or it could elect to try to find a 
ve solution to the word-pair. The 
products of the different teams were assumed 
to be of equal quality and different only in the 
cost of production. 

Winning a contract scheduled a team’s 
hypothetical factory for a 3-mo. period of 
compressed game time and there were limits 
as to how far in advance a team could schedule 
its factory. Since the number and size of con- 
tracts was not enough to keep all factories 
fully occupied during a game year, a limited 
number of fixed-fee contracts were available. 
The award of a fixed-fee contract gave the 
team a modest profit, but tied up the factory 
for a 4-mo. period. These contracts were 
awarded on a first-come, first-served basis to 
requesting teams until the sẹpply for the 
year was used up. 

All teams were provided with telephones, by 
means of which they could submit bids, re- 
quest information about the bids or scheduling 
status of other teams, or request TV and 


less expensi 
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sound access to another team. All telephone 
calls were charged at the rate of $1,000 per 
call. These charges, when added to a fixed 
yearly overhead cost, represented the yearly 
costs of operations, to be offset by profits from 
taking fixed-fee contracts or successful bidding 
on competitive contracts. Starting the game 
with no capital, the teams’ profits and losses 
accumulated over each year until the game 
was completed. Thus, a performance ranking 
was available in dollars year by year, and in 
terms of net accumulated gain or loss. 

The problem faced by the team was a 
rather complex one because of the many inter- 
actions. It was essentially the problem of opti- 
mal allocation of limited resources (time, skill, 
effort) over the full range of information 
processing and decision making in a rapidly 
changing environment, In addition, since these 
were self-organizing teams, they confronted 
the problem of devising an effective internal 


organization for utilizing their human re- 
sources, 


RESULTS 
This article concentrates 
quences of opening up a microcosm of typical 
business competition by means of providing 
a technology for gaining mutual behavioral 
access. The preliminary results of this study 


are presented by attempting to answer a 
series of questions, 


on the conse- 


few sessions of the first ga 


d one team, 


mer, ignored this access com- 
the second 


poorest perfor 


teams, all the teams 


developed antiaccess strategies for keeping 
what they considered to be important infor- 
mation from their competitors. In some cases, 
rather elaborate codes were designed to ex- 
change information within the team. A com- 
mon device was to speak in a whisper on the 
telephone when sending in a bid or to disap- 
pear under the table with the telephone, to 
get away from the microphone. Noisemakers 
were used to mask out communications at 
crucial times. A second kind of antiaccess 
strategy consisted of holding team meetings 
outside the laboratory periods to permit the 
discussion of plans, strategies, and tactics in a 
place secure from the technological spy. The 
teams varied in their reaction to mutual ac- 
cess, ranging all the way from excessive con- 
cern with secrecy on the part of the teams in 
the middle range of performance, to less con- 


cern on the part of the winning team and the 
consistently losin 


ing team decided 


rying to obtain 
hains, The con- 
first game devel- 
of utilizing their substan- 


casting the improved 
n the expectation of 
her teams when that 
ding again, In sum- 
al behavioral access 
ved as an intolerable 
to conduct typical 
he attitudes toward 
Ms seemed to assume 
plans, strategies, and 
treatment, the teams 
hether they were be- 
) n the two-way case 
described above) and his uncertainty un- 


c the general level of 
paranoid behavior exhibited, 


k Se, for exam le, on T 
tape of the behavioral data onld remit mo 
tual access with delay, 


Š A progressively short! 
delay might be utilized in a learning design 8$ 
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the participants progressively adapted to the 
condition, We propose to conduct investiga- 
tions on the consequences of making available 
mutual behavioral access to the previous ses- 
sion during a given period of operations and to 
search for other variables which might lead to 
a progressive elimination of the strategies of 
secrecy, 

4. What are some possible implications of 
experimentation on mutual behavioral access? 
It is possible that mutual behavioral access, 
as a condition for organizational, national, 
and international development, may become 
a powerful tool for finding the minimum level 
of cooperation required to stabilize competi- 
tion during periods of explosive cultural 
change. The technology for obtaining, stor- 


ing, and disseminating the essential behavioral 
data is now well developed and should be ap- 
plied to a variety of competitive situations. 
In particular, for example, we could start to 
store high-level planning and decision-making 
meetings of our national government elite and 
encourage other nations to do so. With these 
essential behavioral documents in storage, we 
could engage in negotiations for exchange, 
with whatever delay would be mutually 
agreeable. Mutual behavioral access and the 
feedback from observing how plans and strate- 
gies are followed thus become the basis for the 
gradual development of a minimal level of 
trust and understanding between competing 
entities. 
(Received November 5, 1969) 
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i e BETWEEN TEAM COMPOSITION AND FINANCIAL 
PERFORMANCE IN BUSINESS GAMES? 
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Sixteen 3-5-man self-organizing teams, composed to vary on a dimension of 
conceptual complexity of the individual members, competed in a laboratory- 
based business game. Competitive performance was measured in terms of 
profit or loss accumulated over a 12-wk. period of operations. Variation in 


team size (3-5 members) was not related to performance. T] 
sition variable, conceptual complexity, as measured b: 
Test, was significantly related to financial performa 


A previous article (Kennedy, 1962) de- 
scribed several laboratory studies of miniature 
organizations ranging in size from 30-40 
members (Air Defense crews) to three-man 
management teams. These interrelated sys- 
tems of people may be referred to as organi- 
zations or teams as opposed to groups, be- 
cause they worked together on a common 
task over many sessions and achieved a rela- 
tively stable structure through the processes 
of organizational development. 

The study of development of the manage- 
ment teams was made possible by offering a 
regular undergraduate course, “Human Prob- 
lems in Industry,” the laboratory portion of 
which consisted of weekly periods of competi- 
tion between teams in a simulated business 
environment or game during a 3-mo. organiza- 
tional life. The present article reports the 
results of two such studies, the first conducted 
during the fall term, 1964, and the second dur- 
ing the fall term, 1965, in which the compo- 
sition of the miniature organizations was 
varied systematically and the effects of the 
several types of compositions were evaluated 


on the basis of team Profit or loss as the 
measure of performance, 


1 This article has been accepted for publication by 
the outgoing Editor, Dr. Kenneth E, Clark. 

* Requests for reprints should be sent to the au- 
thor, Department of Management, University of 


Southern California, University Park, Los A 
California 90007, on 


he primary compo- 
y a Paragraph Completion 
nce. 


Team Composition Variables 


The teams in both years were composed of 
Princeton students, 35 in 1964 and 26° in 
1965. The three composition variables were: 
(a) score on a Paragraph Completion Test 
(Schroder, Driver, & Streufert, 1967), a pro- 
jective test of cognitive complexity; (b) num- 
ber of participants in each team, and (c) col- 
lege class. 

The variable of cognitive com 
measure of the level of maturi 
information Processing capabilit 
graph Completion T 
0-14, which purpor 
vidual’s characteristi 
uli, that is, how he d 
uli to create classe 
how he 


plexity is one 
ty of human 
y. The Para- 
est yields a score, from 
ts to measure the indi- 
c ways of perceiving stim- 
ifferentiates between stim- 
t S of people and objects, and 
integrates these experiences into 
der concepts. At the low end of the 
scale will be found the people with “black- 
white, I like-I don’t like” kinds of differen- 
tiations and simple integrative concepts. At 
the high end of the scale are the more sophis- 
ticated differentiators and 


those whose atti- 
tudes and concepts are mo 


Eee re complex, Table 
1 shows the distribution of scores of partici- 


pants in the 1964 and 1965 studies. 
As indicated in Table 1, the distributions 


from this older group. 
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TEAM COMPOSITION AND FINANCIAL PERFORMAN 


were arbitrarily dichotomized (between Scores 
5 and 6) into abstract individuals and con- 
crete individuals and teams were composed by 
varying the percentage of abstract members. 
The eight 1964 teams consisted of three five- 
man teams and five four-man teams; the 
eight 1965 teams consisted of two four-man 
teams and six three-man teams. The variable 
of college class (a rough measure of chrono- 
logical age and maturity) was used to attempt 
to put at least one senior on each team to help 
the team solve the leadership problem; the 
annie was also used to distribute team mem- 
Pership over the college classes as evenly as 
Possible. Since there were 11 seniors, 13 
Juniors, 7 sophomores, and 4 freshmen in the 
1964 class, and 6 seniors, 6 juniors, 5 sopho- 
Mores, and 4 freshmen in the 1965 class, ex- 
actly even balancing could not be achieved. 

, Table 2 describes the compositions achieved 
In 1964 and shows the financial performance 
of the teams in terms of profit or loss over the 
first five 2-hr. periods of game operations. 
The data on the eight teams are presented in 
two groups or sections of four teams (orange 


TABLE 1 
DISTRIBUTIONS OF SCORES ON 
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TABLE 2 


COMPOSITION AND FINANCIAL 
PERFORMANCE—1964 TEAMS 


% ab- | Profit or loss 
ats: i loss |Profit or loss 
Team | y | College | stract | “for first 5 | for 11 ses- 
a bees sessions sions 
Black section 
€ fs 1 60 1,945,674 6,244,525 
2 
1 
1 
D 4 1 75 —3,244,232 2,998,329 
1 
1 
1 
A tji o | =3,939,550 | —9,967,752 
1 
2 
B 5 2 0 —5,169,864 | —5,140,701 
1 
1 
Orange section 
| 
N | 4 1 Sr. 75 | 2,623,681 | 689,620 
2 Jr. 
| 1 Fresh 
s a | 3Sr. o | 4,523,507 | —6,168,780 
1 Soph. 
R 5 | 1Sr. 0 4,585,509 | —6,270,209 
Lih 
1 Soph. 
1 Fresh. 
M Jajish 25 | —10,401,654 | —8,354,613 
oon 
1 Soph. 
Note.—Si senior, Jr. = junior, Soph. = sophomore, 


PARAGRAPH COMPLETION TEST 
a 
Paragraph p 
ompletion 1964 1965 ; 
est score participants participants 
Concrete 
0 
1 
2 2 
s 5 
7 10 5 
8 10 6 
Abstract 
6 9 
6 
T 3 2 
8 2 2 
+ 1 
10 
11 
12 
13 
14 


shman. 


Fresh. = 


section and black section) since only four 
teams played at any one time. Table 3 pre- 
sents the same information for the first five 
sessions of the 1965 teams. After the first five 
sessions of the 1965 game, some personnel 
were exchanged between teams so that the 11- 
session data of 1965 teams are not comparable 


to similar data of 1964 teams. 


Synthetic Environments 


The present studies were carried out in the 
Social Psychology laboratory of the Depart- 
ment of Psychology at Princeton University. 
A section of four teams played a business 
game‘ for a 2-hr. period, once a week, while 


4 DATA was the code name for the 1965 business 
game and BIG was the 1964 game. The major dif- 
ference between these games lay in the size of the 
initial resources made available to each team: 
$8,000,000 for DATA teams, and $4,000,000 for BIG 


the members of the other section observed and 
listened to the verbal interaction through one- 
way mirrors, On a later afternoon in the 
week, the first observing section played and 


the initial Playing section observed, 
At the first meeting of the course, each par- 


n a copy of the player’s man- 
sting with the 


teams. These amounts were subtracted from the net 
accumulated gain or loss of the teams to Produce 
the more comparable profit and Joss figures of the 
criterion Measure of performance, 
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TABLE 4 
nea M Size RELATED TO 
SITION AND FINANCIAL SCIAL PERFORMANCE 
eo ee TEAMS NAY AES nn 
Average 
| T ab- Profit or loss profit or 
x 5 Average aior 1i 
College: | steet] ristis z Average „| loss for 
Team | N däs mem- a h No. off aro, of ees profit or loss | cessions of 
| bers MEM aang ake in first five | BIG & 5 ses- 
ies = Sessions® sions of 
Black section | DATA’ 
x 5 500 
; 33 | 22,031,000 3 6 2.83 3,624,523 3,625,500 
TEDE rag 4 | 7 | 214 | 552084 4,936,485 
1 Fresk. 5 3 | 267 | —2603,233 | —1'722128 
| = — 
f 00 | — 2,004,400 - 
a ot eau ; 7 è See Tables 2 and 3. 
D 3 | 1Adult 33 | —10,819,000 
i X 5 with the game. Thereafter, each section played 
Fresi, | 
3 is 33 | —23,974,260 &2-hr. game “year” per week, 
B À 23,974, 
RESULTS 
Orange section , ; 
The principal results of these studies 
M 4 | 25r. 100 57,363,000 exhibited in Tables 2 and 3. By reference : 
2 Jr. | the composition and performance data con 
R 3 | isr 0 28,804,400 tained in these tables, it is possible to 2 
1 Jr. termine the consequences of financial pe 
1'Sopli. formance of (a) varying the size of the tn 
N 3} ET 67 20,487,400 and (b) varying the percentage of abstrac 
eae members of the teams. 
1 Fresh. 
s $ |i isk O | ~14,782,400 Team Size 
2 Soph. š le i 
E - Table 4 shows the average section rank in 
aA Sorot = senior, Jr, = junior, Soph, = sophomore, 
resh, = Teshman, 


profit or loss and the average profit or loss for 


the first five sessions for teams with threes 
four, and five members, 


TABLE 5 


Cocnirive Comprexrry RELATED 
TO TEAM PERFORMANCE 


Se 
Average 
rofit or 
Average ike for 1 
Composition | No. of Profit or loss sessions © 
teams in first 5 BIG and 
sessions sessions 0 
DATA’ 
Above 50%, 
abstract 12 
members 6 10,681,908 14,296,4 
Below 50% aa 
abstract 4 
members 10 ~2,736,043 —3,464,23 
= ae 


eee, 
a See Tables 2 and 3, 


| 


? 


| 


af 


Team Composition AND FINANCIAL PERFORMANCE 49 


A Kruskal-Wallis one-way analysis of vari- 
ance test (Siegel, 1956) applied to the data of 
Table 4 recommends accepting the null hy- 
Pothesis of no difference in financial perform- 
ance related to team size. 


Percentage of Abstract Members 


eon 4 summarizes the financial perform- 
b ce of the teams when the composition varia- 
le of bercentage of abstract members is used. 
reed _Kolmogorov-Smirnov test (Siegel, 
sina Ps ields a nonsignificant chi-square be- 
fete 6 e performance distributions of differ- 
Sane paged teams after the first five ses- 
els ut a chi-square significant at the -5 
conti etween the distributions obtained by 
ning the full 11-session performance of 
teams and first five sessions of DATA 
teams, 
At the end of the first five sessions in both 
and DATA, a winner was declared and 
Se teams were returned to initial conditions to 
hid a second game. In DATA, changes in 
fore bom of some of the teams were made be- 
fs € start of the second game by exchang- 
Soa ibers between teams. Thus, it is not 
e ble to compare the 11-session perform- 
in as BIG with the I1-session performance 
fa But it is possible to compare the 
ha ‘mance of the new teams in the second 
of DATA, with the results shown in Ta- 
Table 6 makes the same breakdown of 


ferent p St Ibutions are not significantly dif- 
differen ecause of the small N, but the average 
Cogniti H between teams composed on the 
same ie complexity variable is about the 
he rder of magnitude as in Table 5 and in 
Same direction, 


Discusston 


T i n . . 
diff, a“ discovery of a statistically significant 


e 
arg; ce between the performance of teams 
ving in Percentage of abstract members 1s 


Tathe 
© remarkable for several reasons: 


TABLE 6 
COGNITIVE COMPLEXITY RELATED 
TO TEAM PERFORMANCE 


Average profit or 


| 
Composition | oa or loss for second 6 
ams sessions of DATA 
Above 50% ab- 
stract members 3 —3,983,500 
Below 50% ab- 
stract members 5 — 12,799,040 


1. The arbitrary assignment of the partici- 
pants to concrete and abstract categories was 
based on small differences. Persons scoring 5 
and below were classified concrete and those 
scoring 6 and above were classified abstract. A 
replication of this study should utilize par- 
ticipants scoring at more extreme levels on the 
cognitive complexity scale. 

2. The small number of teams and the high 
variability of team performance is also a fac- 
tor. Within the limitations of a preliminary 
study, however, the data reported here sup- 
port the conclusion that the composition 
variable of cognitive complexity, as measured 
by the Paragraph Completion Test, is related 
to team performance when team compositions 
of below 50% abstract members are compared 
with above 50% abstract members. In addi- 
tion, there is some suggestion that team per- 
formance differences become larger as the 
number of sessions increases, that is, the effect 
of the composition variable increases over 


time and sessions. 
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COMPARABILITY OF OLD AND REVISED FORMS OF 


THE STRONG VOCATIONAL INTEREST BLANK 
FOR WOMEN ' 


RICHARD W. JOHNSON 2 
Counseling Center, University of Wisconsin 
The old and revised forms of the Strong Vocational Interest Blank for Women 


(SVIB-W) were administered to a volunteer sample of 124 female Ss. The 
scores on the 29 occupational scales common to both forms of the SVIB-W 


were highly related (Mdn. r=.83) for most of the scales. 
on some of the scales changed markedly, however, largely 


year of sampling of the origin 
latter, correlations between in 
moderately high (Mdn. r = .71 


Interest Scales, scorable on the ol 
the old and new SVIB-W 


The 1969 revision (Form TW398) of the 
Strong Vocational Interest Blank for Women 
(SVIB-W) contains a number of major 
changes (Campbell, 1969). The number of 
scales has been greatly increased, many of the 
items have been changed, new criterion sam- 
ples have been obtained for most of the oc- 
cupational scales, and a new women-in-gen- 
eral (WIG) reference group has been estab- 
lished. In each instance, the changes are more 
extensive than those made with the 1966 re- 
vision of the men’s (SVIB-M) form (Camp- 
bell, 1966). 

Most of the information provided by the 
1946 version (Form W) of the SVIB-W 
(Strong, 1946) can still be obtained with the 
revised form. All but 4 of the 33 occupational 
scales and both of the nonoccupational scales 
found on the old form have been restandard- 
ized. In addition, 29 new occupational scales, 
2 new nonoccupational scales, 19 new basic 
interest scales, and 6 new administrative in- 
dexes are now listed on the profile sheet. 

Because of the widespread changes, the test 
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author, Counseling Center, Universit 
415 West Gilman Street, Madison, 


the Wisconsin 
the University 
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al occupational group members. Because of the 
dividual profiles for the two forms were only 
). Conclusions based upon profile analyses of the 
old form should not be generalized to the new form 
d SVIB-W, may serve as a bridge between 
for use in longitudinal research projects. 
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The level of scoring 
dependent upon the 


- A short form of the Basic 


user will want to know to what extent scores 
on the new form are comparable to scores on 
the old form. Can research findings or clinical 
skills based on the old SVIB-W be generalized 
to the new form? The purpose of this study 
was to compare test results obtained in testing 


a common group of Ss with the old and re- 
vised forms of the SVIB-W. 


METHOD 

Subjects, A total of 
forms of the SVIB-W. 
to 66 (X= 25.5, SD 
42 gainfully employe 


124 female Ss completed bors 
The Ss ranged in age from 1° 
= 8.0). Of the 124 Ss, there were 
d adults, 7 housewives, 16 gradut- 
ate students, 58 undergraduate college students, 2" 
1 high school student. The S 
wide variety of fields. 
were in the field of edu 
field, 12 in Secretarial o; 
fields. The students we 
(N = 33) or education 
maining students Were enrolled i 
Letters and Sciences. All Ss wi 
Procedures, The tw 
administered in count! 
instructed to take th 
within 24 hr. of e 


s were drawn from ^ 
Among the employed Ss; 
cation, 10 in a health-relate® 
r office work, and 6 in othe 
re concentrated in pharmacy 
(N = 19), Most of the T° 
n the College ° 
ere volunteers. e 
° forms of the SVIB-W wer? 
erbalanced order, The Ss. wer 


tration centers 
1965). The Ss w 


P ro 
d copies of their test P 
files together with a bri 


hlet 
brief interpretive pamp 
at the conclusion of the study, 

Statistical analysis. Scale comparabilities for 
two forms of the SVIB- Were determined bY 
relating cores on the old for™ 


the 
cor” 
of 

all of the Profile s 


| 
i 
4 


| 
| 


\ 
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Comparagitity or Orp AND Revisep Forms or SVIB-\W : 


J DATE OF TESTING 
ROUP| OCCUPATIONAL SCALE onmo aINAL e 
1 MUSIC TEACHER 1952° 
MUSIC PERFORMER 1952" 
n ARTIST 1934 1941 
AUTHOR 
LIBRARIAN 1934 
+ 
ENGLISH TEACHER 1934 83 | 24 | 33 |14 | 15 
m SOCIAL SCIENCE TEACHER 1934, 1941 =] 70 | 20 | 33 Pz] 12 
YWCA SECRETARY 1934 63 | 1a | 3a | 10 | 13 
D SOCIAL WORKER 1983" HOE | 25 fia] 13 
PSYCHOLOGIST 0 1942 zn a7 | 30 | 2a | 12 [1 i 
LAWYER 1934, 1941 | 89 j 28 | 28 | 1 | 12 
Vv LIFE INSUR SALESWOMAN 1937 1941 | 86] 29 i 24110 | 44 
y BUYER 1942" 
BUSINESS EDUCATION TEACHER c 1941 je 
STENOGRAPHER. SECRETARY 1934 1947 
|| OFFICE WORKER 
vi ELEMENTARY TEACHER ¢ 1939 
HOME ECONOMICS TEACHER 1941 
DIETITIAN 1941 
vu PHYS ED TEACHER (HS) 1939 
PHYS ED TEACHER (COLL) 
OCCUPATIONAL THERAPIST 1942 
PHYSICAL THERAPIST | 1957 
a NURSE C 1934 
w | PHYSICIAN = 1934 1981 
DENTIST 1934 1942" 
[| | txsonarony TECHNICIAN 1941 
x MATH-SCIENCE TEACHER | opa en 
ENGINEER L aae 
SUPPLEMENTAL SCALES 
| | SPEECH PATHOLOGIST T w J Fal a7 | a2] 4 | 9 o 
| COMPUTER PROGRAMMER 1964, 1966" ea ja [sl | 
CATHOLIC SISTER TEACHER 19617 70. E TELT _ > 
NON-OCCUPATIONAL SCALES X i m 
[_mascuuniry -= FEMININITY 1934 aaa Spi 
alata s- —o New SVIB-W 
fp | ACADEMIC ACHIEVEMENT 1961° 35 | 48 i sed 
* Original criterion group used in 1969 revision All other criterion groups used in the revision were tested in 1965, 1966 or 1967(Campbell, in press) 


Fic. 1. Comparison of scores on old and nel 
as they appeared on the old version of the inv 
changed slightly in the revision, e.g. Nurse has 


echnician has been changed to Medical Techno’ 


from the revised form.) 


t 
aor ety With all of the profile scores on he 
tions bees Of particular interest were the correta- 
Seeuna an the 29 occupational and 2 non- 
on the ional scales with identical or similar names 
Broke and new forms. i 
each g r comparability was determined separately a 
Similan’ Lhe Scores obtained by each S on the 29 
ar scales on the old form were correlated with 


w 
entory. The names of some of the scales have been 


SVIB-W scales (N = 124). (The scales are listed 


been changed to Registered Nurse and Laboratory 
logist. The four scales with no data were dropped 


the scores obtained on these same 29 scales on the 


new form. 
Scores on the new Basic Interest Scales were cor- 


related with scores on a short form of the Basic 
Interest Scales available for the 1946 SVIB-W.3 The 


3 Information on procedures for scoring old 
SVIB-W (Form W) answer sheets (pre-1969) on 
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TABLE 1 
E z FF. STS BETWEEN SCALES 
IVE CORRELATION COEFFICIENTS BETWEEN OLD 
DORREA ao nae From SVIB-W axp New SCALES 
Old scale x SD New scale xX SD r 
S — 

z > 84 

33 11 Artist 28 12 # 
Anthos Newswoman 33 n = 
fotke! 29 11 Instrument Assembler 23 3 86 
See EES Bankwoman 25 11 85 
Telephone Operator 20 14 | 35 
Ag 31 9 Saleswoman 19 12 ae 
ee Telephone Operator 20 14 k 

ional Therapis 2 7 

si of tion Teacher 24 9 Occupational Therapist 36 12 è 
suai ee Physical Therapist 34 14 „64 


Note.—N = 124. 


Basic Interest Scales on the revised form include all 
of the items found on the old form plus new items 
which are included only in the revised form of the 
SVIB-W booklet. 

AJl correlation coefñcients were Pearson product- 
moment rs. All statistical analyses were performed 
on the University of Wisconsin, Control Data 
Corporation 3600 computer. 


RESULTS 


The median correlation coefficient between 
the old and revised occupational scales was 
-83 (see Figure 1). The intercorrelations 
ranged from a low of .63 (YWCA Secretary) 
to a high of .92 (Computer Programmer). 

Scores for 20 of the 29 new scales were 
based on new occupational criterion groups. 
The median old-new intercorrelation for the 
20 scales based on new criterion groups was 
identical (.83 in both cases) to the median r 
for the 9 occupational scales based on old 
criterion groups. 

The academic achievement (AACH) scale 
was only slightly modified by dropping 8 of 
the 55 items which were scored on this scale. 
The high intercorrelation (r = .95) between 
the old and revised AACH scale suggests that 
effect of this change was negligible. The new 
femininity-masculinity scale (FMII), how- 
ever, has been substantially changed. This 
scale now contrasts the interests of equal 
samples of men and women employed in 18 
different occupations, Unemployed women are 


the short form of the Basic Interest Scales may be 
obtained from David P. Campbell, Director, Center 
for Interest Measurement Research, University of 
Minnesota, Minneapolis, Minnesota 55455. 


no longer included in the sample. Only items 
common to the men’s and women’s forms of 
the SVIB are scored. As a result of these 
changes, the correlation between the old and 
new forms was only moderately high (7 = 
70). 

The intercorrelations were almost always 
higher with a similar scale than with any 
other scale. There were only a few cases 1? 
which a key was more highly correlated (P € 
05) with a nonsimilar key than a similar key- 
The old YWCA Secretary key was more 
highly correlated with the new Guidance 
Counselor key (r= .76) than with the new 
YWCA Staff member key (r = 63), The new 
YWCA Staff member key, on the other hana; 
was more closely related to the old Soci@ 
Worker key (r = .77). The new Librarian keY 
was more highly correlated with the old 
English Teacher scale (r= 80) than with 
the old Librarian scale (r= .67), 

Despite the relatively high intercorrelatio™ 
of the old and new occupational scales, t” 
level of scoring on some of the scales varie’ 
considerably. Of the 20 similar scales base 
upon new occupational grou iffered P? 
at least 5 standard scores ar. ve Dail put 
2 of these 10 cases (Artist Psychologist 
the scores were higher on the new form. The 
differences were most notable on the ver 
old scales. Of the four scales standardized a 
1934, three (YWCA Secretary, Nurse, ” ¢ 
glish Teacher) showed increases in score’ A 
nearly one standard deviation or greate 
Scores on the YWCA key moved por 
lowest (standard score — 14) for the grouP 


ee 


` 
] 


y 
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the old form to tied for second highest (stan- 
dard score = 34) on the new form. In con- 
trast with these results, Ss scored lower (or 
equal) on the new form on all nine occupa- 
tional scales based upon old criterion groups. 
In three cases (Social Worker, Catholic Sister 
Teacher, and Speech Pathologist), the differ- 
ence was equal to or greater than 5 standard 
score points. 

Of the four old SVIB-W scales which have 
not been replaced on the new form, all are 
highly correlated with a new scale except 
Physical Education Teacher (College) (see 
Table 1). Inspection of both correlation co- 
eficients and mean scores indicates that the 
Newswoman scale is a reasonable substitute 
for the old Author scale and that the Bank- 
Woman scale closely resembles the old Office 


Worker scale. Relative scores on the House- 
wife key may be estimated with a fair degree 
of accuracy by scores on some of the non- 
professional keys, particularly Saleswoman 
and Telephone Operator. However, Ss in this 
study scored quite a bit lower on these scales 
than on the old Housewife scale. No new 
scale corresponds closely with the old Physical 
Education Teacher (College) scale. The new 
Physical Education Teacher scale, based upon 
the responses of high school physical educa- 
tion teachers, does not replace the old scale 
for college teachers (7 = .47). 

Correlations between the new nonsimilar 
occupational keys and old SVIB scales are 
shown in Table 2. Only the highest positive 
correlation coefficients are reported. The fol- 
lowing scales appear to be most successful in 


TABLE 2 


Hicuest Positive CORRELATION Co 


Appep to SVIB-W 


SFFICIENTS BETWEEN NEW SCALES 


AND OLD SCALES 


— 
New scale x SD Old scale x SD r 
Entertainer 30 14 Music Performer 35 13 81 
Model 31 12 Life Insurance Saleswoman 21 10 63 
Art Teacher 24 16 English Teacher 24 14 62 
Interior Decorator 20 14 Author 33 11 .66 
Newswoman 33 15 Author 33 11 82 
Language Teacher 35 15 English Teacher 24 14 -79 
Recreation Leader 34 11 Music Teacher 25 14 59 
Director, Christian Education 21 13 Music Teacher 25 14 Ti 
Guidance Counselor 29 12 Social Worker 35 13 79 
Translator 33 13 Artist 34 10 J1 
Chemist 14 15 Physician 29 12 78 
Mathematician 18 14 Physici e A 12 ay 
tmy-Enlisted 27 9 Math-Science Teacher 24 13 53 
Navy-Enlisted 31 9 Laboratory Technician 27 14 68 
Army-Officer 32 10 Lawyer 28 11 80 
Navy-Officer 38 8 Lawyer 28 11 .59 
Accountant 21 11 Computer Programmer 2 A 57 
Bankwoma: 25 11 Office Worker 7 2 1 85 
i Business Education Teacher 21 10 85 
Public Health Nurse 32 12 Nurse ae 10 51 
Licensed Practical Nurse 20 12 — A A ee 
Radiologic ‘Te. š 15 Laboratory Technician 27 14 86 
Poulter e | | oe | See fi aie 
Executi Anes 11 jetitian 23 78 
Seleswoman Te e 12 Office waker A n g3 
‘Telepho j 14 Office Worker 29 85 
e e 20 | 13. | Office Worker 2 | u | 86 
Sewing Machine Operator 13 13 Office Worker 29 11 83 
Beautician 30 10 Buyer 20 12 76 
Airline Stewardess 31 13 Stenographer-Secretary 33 10 52 


Note.—N = 124, 
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TABLE 3 


JCT-. NT 
FREQUENCY DISTRIBUTION OF PronucT — 
CORRELATION COEFFICIENTS BETWEEN LD 
AND NEw SVIB-W PROFILES 


Cumulative 
ž f af % 
3 124 100 
35-89 a | 12 = 
80-84 21 107 FH 
75-.79 14 30 ? 
70-.74 15 12 = 
65-.69 9 57 ao 
‘60-.64 9 48 3 
.55-.59 12 39 31 
"50-54 7 27 4 
45-.49 7 20 16 
‘40-.44 7 13 10 
135-.39 3 6 5 
30-.34 2 3 2 
.25-.29 1 1 1 


tapping interests not adequately measured by 
the old SVIB (r < .80): Public Health Nurse, 
Airline Stewardess, Army-Enlisted, Account- 
ant, Navy Officer, Recreation Leader, Art 
Teacher, Model, Interior Decorator, Navy- 
Enlisted, Translator, Licensed Practical 
Nurse, Beautician, Director of Christian Edu- 
cation, Chemist, Dental Assistant, Executive 
Housekeeper, Guidance Counselor, and Lan- 
guage Teacher. All of the other new scales 
correlated highly with at least one old 
SVIB-W scale. The new nonprofessional keys 
appear to be most closely related to the old 
business keys, 

The median intercorrelation between the 
old and new individual Profiles for the 29 


similar occupational scales was .71, Although 
some of the inter 


0 profile correlations were rela- 
tively low (see Table 3), all but five of the 
7S were significantly different from chance 
(p< 05), 


The relationship between the old and new 
group profiles (as shoy 


ae ale: wn in Figure 1) was 
statistically insignificant (r= 31, ps 05) 
The group profiles we abe 


Te less highly intercor- 
related than most of the individual profiles 
presumably due to the restricted range of 


scores on the group profiles. The standard 
deviation of scores for most individual Profiles 
was 10-15 points; the standard deviation 


of scores for the group profiles was less than 
6 points in both cases. 

The Basic Interest Scales for the two forms 
were more closely related than the occupa- 
tional scales. As shown in Table 4, the median 
intercorrelation was .90 with a range from 
86 to .95. 


Discussion 


The median 7 of .83 between the old and 
new similar occupational scales was surpris- 
ingly high. This r is virtually identical with 
the r of .84 obtained between the old and new 
occupational scales on the SVIB-M (Camp- 
bell, 1966, p. 55; Williams, Kirk, & Frank, 
1968). Because of the use of a completely 
new women-in-general group, new occupa- 
tional groups, and new items, the relationship 
between the scales was expected to be lower- 
Furthermore, because new items were used; 
the test scores were based upon two separate 
administrations of the interest inventories. 
In both of the SVIB-M comparability studies, 
the test scores for the old and new scales 
were based upon a single administration of 
the old form of the SVIB (which included 
all of the items scored on the new form). 

For the majority of the interest scales, 5’S 
relative performance on the new scales may 


TABLE 4 
CORRELATIONS BETWE 
(Lonc) Versr 


v OLD (Snort) ann NEW 
S or BASIC INTEREST 


SCALES FOR SVIB-W 
= = 
Old form New form 
Scale o 
No. > No. 
of | X | sp} of | x }sp| r 

= items items e 
Public speakin; 12 2 To a 
s g 5| 9 "87 
Law/Polit 9/53) 11 | 33 33 i | 8 
Merchand ng 12 49 9 13 4 9 8) 
fice Practices 1/47) ala | 48 | 3 | 192 
Numbers 7 |49] of 3 | 32 | ao | 23 
Muziical Science | 10 | 49] 12 | 48 | 48 i | 33 
Mechanical 10 | 48 | 10 | 16 | 22 | i0 30 

utdoors 6 | 49 $ 
Biological Sciences | 6 | $ i0 7 $2 it | 24 
Medical Service 19 | 51 j 10) a7 | 80 | 10 | -22 
eaching 11 | 50 | 10 | 49 50 | 10 36 
Social Service 6) 51} 8] 8} 30] 9 | 39 
Sports 0 | 52{ 10] 12 | 30 | 10 | -25 
fomemaking 10) 49) 81 1 | 30] 9 | 8 
‘Cligious Activities |» 5 44 | 10 9 | 44 | 9 | G0 
ame I 3j st} to] 6 | st 10 | (86 
fitforming Arts 11 3 10 13 3 10 SE 

ing M2 | 52 | 10 | 14 | 32 | 10 | - 
ee. ee 


Note—N = 124, eee 
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be fairly accurately estimated by her perform- 
ance on the old scales. Relative scores on the 
new YWCA Staff Member, Physical Educa- 
tion Teacher, Librarian, and Occupational 
Therapist scales are most divergent from 
relative scores on the old scales. 

Test users will want to consider absolute 
level of scoring on the new scales as well as 
relative level of scoring. In general, there 
was a tendency for Ss to score higher on new 
scales which formerly had been based upon 
old criterion groups (1934 vintage) and lower 
on new scales which were based on the same 
Criterion groups as before. The levels of 
Scoring on the YWCA Staff Member, Social 
Science Teacher, Nurse, Social Worker, En- 
glish Teacher, and Occupational Therapist 
keys were most affected. 

A good part of the difference in level of 
Scoring may be attributed to the changing 
Popularity of interest items (Campbell, 1968). 
As items change in popularity, scores on 
Interest scales will go up or down depending 
upon year of testing of the criterion group 
and the WIG group. Ideally, the occupational 
groups and the WIG group should both 
represent the current interests of females. If 
either one is out of step with the times, the 
Scores will be somewhat distorted. 

The relatively low intercorrelations between 
the old and new individual profiles were some- 
What surprising in view of the high correla- 
tions for the individual scales. The correla- 
tion coefficients between the individual scales, 
however, are not sensitive to differences in 
level of scoring on the two forms. If changes 
in level of scoring had been uniform from 
scale to scale, the interprofile correlations 
Would also have run high. Because some of 
the scales showed a much greater variation 
in level of scoring than other scales, the rela- 
tive positions of the occupational scores on 
the two forms were significantly altered in 
Many cases, The greater the change in profile 
Shape, the lower will be the interprofile 
Correlation coefficient. 

How do these results affect the use of the 
test in counseling and research? Based upon 
Hoyt’s (1960) study, clinical judgments of 
Profile comparability are closely related to 


Statistical judgments of profile comparability. 


He found that a Spearman rank-order cor- 
relation coefficient (a close approximation of 
Pearson’s r) of .65 or less between SVIB-M 
profiles indicated “at least one important 
difference” in interpretation. Using this cri- 
terion, nearly two-fifths (39%) of the Ss 
tested in this study produced profiles with 
important differences between them. Presum- 
ably, counseling based upon the new profile 
would significantly vary from counseling based 
upon the old profile for a substantial number 
of Ss. 

Research studies based on profile analysis 
of the old SVIB-W may be particularly mis- 
leading. The lack of a significant correlation 
between the old and new group profiles sug- 
gests that conclusions based on profile analy- 
ses for groups tested in recent years with the 
old form of the SVIB-W should not be 
generalized to the new form. 

The correlations between the abbreviated 
Basic Interest Scales found on the old form 
and the full-length Basic Interest Scales found 
on the new form are high despite the fact 
that relatively few items (5-12) are scored 
on the scales in the short version. The magni- 
tude of the relationship suggests that the 
Basic Interest Scales on the old form of the 
SVIB-W may possibly serve as an adequate 
measure of interests for investigators con- 
ducting longitudinal studies with the old form 


of the SVIB-W. 
CONCLUSION 


Findings based upon the relative order of 
scores on individual scales on the old SVIB-W 
may be generalized to the new SVIB-W with 
some degree of accuracy for most scales. It 
is important to note, however, that the level 
of scoring on some of the scales changed 
markedly. The scores on some scales, espe- 
cially those scales originally based upon very 
old criterion groups, ran considerably higher 
on the new form. Scores on other scales, par- 
ticularly scales based upon the same criterion 
groups used in the 1946 form, were lower on 
the new form. 

Because of the varying change in level 
of performance on the two forms, findings 
based upon profile analysis will be signifi- 
cantly altered. Clinical insights or research 
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results based upon intraprofile comparisons 
for the old form should not be generalized to 
the new form of the SVIB-W. Counseling 
experience with the new form will have to be 
acquired and new research data will have to 
be collected. 

For researchers midway in a longitudinal 
research project, the short form of the Basic 
Interest Scales will provide a fairly complete 
map of an S’s interests which should conform 
closely with the results obtained with the 
Basic Interest Scales on the new form. 

Tn conclusion, the new form of the SVIB-\V 
yields information sufficiently disparate from 
the old form to suggest that the two forms are 
not equivalent for most purposes. While the 
validity of the new form is still relatively 
unknown, most users will prefer the new form 
because of the use of current occupational 
criterion groups in most cases and the addi- 
tion of new scales tapping new fields of 
interest. The recency of the development of 
the scales appears to be critical. It was the 
scales originally based on the oldest criterion 


groups which produced the most discrepant 
results when revised. 


RicHArp W. JOHNSON 
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JOB CONTENT AND CONTEXT FACTORS RELATED TO 
SATISFACTION FOR DIFFERENT OCCUPATIONAL 
LEVELS * 


THOMAS B. ARMSTRONG ? 


Teachers College, Columbia University 


Herzberg’s theory was tested using Darley and Hagenah’s rationale relative to 
occupational level. Ratings of satisfaction and importance for the job content and 
context factors and overall job satisfaction were obtained from 200 engineers 


and 153 assemblers. The proposed job 


factor dichotomy was not supported. 


However, satisfaction with the content factors made the greatest contribution to 
overall job satisfaction, regardless of occupational level. Conversely, ratings of 


job factor importance were a function 
Were most important for engineers, and 


of occupational level; content aspects 
context for assemblers. Several demo- 


graphic variables failed to influence the findings. The conclusion is that both 


theories represent oversimplifications. 


Recent job satisfaction research has been 
strongly influenced by the theory proposed by 
Herzberg, Mausner, and Snyderman (1959), 
which is based on a content-context job factor 
dichotomy. The main hypothesis states that 
favorable feelings toward content factors, like 
achievement, contribute primarily to overall 
Job satisfaction, but do not generally con- 
tribute to job dissatisfaction per se. Hence, 
Content factors act mainly as satisfiers. Con- 
versely, positive feelings toward context as- 
pects, like job security, contribute to neutral- 
izing dissatisfaction, but do not generally con- 
tribute to job satisfaction per se. Thus, con- 
text factors act mainly as dissatisfiers. More- 
Over, these diverse and curvilinear relation- 
ships are asserted to hold regardless of occu- 
Pational level. 

The results of research directed at the two- 
factor theory have been mixed. In essence, 
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when Herzberg’s method involving recall, or 
the critical incident technique, has been em- 
ployed, the theory has generally been sup- 
ported (Myers, 1964; Schwartz, Jenusaitis, & 
Stark, 1963). Conversely, when other meth- 
ods have been used, the theory has not typi- 
cally been upheld (Ewen, 1964; Hinrichs & 
Mischkind, 1967; Lindsay, Marks, & Gorlow, 
1967). 

Regardless of method, the effect of occupa- 
tional level is evident in the research and has 
been documented by Burke (1966) and 
House & Wigdor (1967). Specifically, Myers 
(1964) followed Herzberg’s method and the 
results showed that the frequency of appear- 
ance for the two types of job factors differed 
for his professionals (engineers) and semi- 
skilled workers (assemblers). Malinovsky and 
Barry (1965), using a correlation approach 
with a satisfaction questionnaire for a low- 
level occupation, maintenance and watchmen, 
reported that the two types of factors ex- 
hibited equivalent relationships with general 
job satisfaction. Friedlander (1965a) factor 
analyzed an importance questionnaire and 
found that the value of the two kinds of fac- 
tors differed as a function of occupational level 
and subgroup. 

On the other hand, it is surprising to find 
that a related, but complex, question has been 
relatively neglected in surveying this litera- 
ture. To what extent, if any, were the results 
of studies using high- and low-level occupa- 
tions affected by the dimension used to assess 
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the job factors, satisfaction versus impor- 
tance? Furthermore, if there was an effect, 
what are the theoretical and practical impli- 
cations? n 
The relationship between ratings of satisfac- 
tion and of importance for various job factors 
has been studied only infrequently in job 
satisfaction research. However, available data 
indicate that the two sets of ratings represent 
two different attitudinal dimensions (Fried- 
lander, 1965b; Froehlich, 1958). Relative to 
Herzberg’s theory, the bulk of studies that 
used satisfaction ratings for the factors (Hal- 
pern, 1966; Hulin & Smith, 1967; Werni- 
mont, 1966) revealed that the job content 
type generally exhibited higher relationships 
with overall job satisfaction than did the job 
context kind. However, as noted earlier 
(Malinovsky & Barry, 1965), these results 
may have been a function of the high-level 
occupations examined. Conversely, no such 
consistent findings stemmed from studies 
which used importance ratings for the job 
factors (Centers & Bugental, 1966; Fine & 
Dickman, 1962; Friedlander, 1965a). More- 
over, the results of two of these studies were 
contradictory, and both used high- and low- 
level occupations. Specifically, Fine and Dick- 
man found that, regardless of occupational 
level, the job content factors were judged 
most important in terms of both Job satisfac- 
tion and performance. In contrast, Centers 
and Bugental reported that the value of the 
two types of factors was a function of occu- 
pational level in terms of keeping one’s job. 
That is, content factors were most valued at 


higher occupational levels, and context kinds 
at lower levels, 


The goal of the present Study was to help 
fill this research gap. In so doing, the Tationale 
of Darley and Hagenah (1955) relative to 
occupational level was used to test the main 
hypothesis of Herzberg’s theory, Darley’s 
more traditional two-factor theory was drawn 


upon because it stipulates that occupational 
level indicates the type of job factor involved 
in job satisfaction. Simply stated, feelings 
about the job are derived from content fac- 
tors for high-level occupations, and from 
context aspects for low-level groups. How- 
ever, it should be noted that the number and, 
in some cases, the meanings of the specific job 
factors differ in the two theories, Also, con- 
trary to Herzberg, advancement is classed as 
a context factor by Darley. 

In the present study, the hypotheses con- 
cerned the relationships between satisfaction 
for the two kinds of job factors and overall 
job satisfaction. In addition, importance rat- 
ings for these same job factors were examined 
to shed added light on the problem. 


Hypotheses 


Two specific hypotheses were formulated: 
(a) For the high-level occupational group, the 
job content factors make a relatively greater 
contribution to overall job satisfaction than 
do the job context factors. (b) For the low- 
level occupational group, the job context fac- 


tors make a relatively greater contribution to 
overall job satisfaction th 
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status, for which Super’s (1957, pp. 6-8) definition 
relative to prestige and importance was used. 

Selection of the final 52 pairs of positively and 
negatively phrased items was based primarily on the 
judgmental agreement of eight social scientists. An 
administration of the JFSS to the personnel sample, 
followed by an item intercorrelation analysis, was 
also used as an aid in item selection. The final edi- 
tion of the JFSS contained 13 subscales, each of 
which was composed of four pairs of items ex- 
pressed in a five-step, Likert-type format; and two 
major scales (Job Content, Job Context) which 
yielded a summed score for each set of specific fac- 
lors. Two types of reliability estimates were high 
for both kinds of scales. For the subscales used to 
test the hypotheses, a mean test-retest coefficient of 
‘84 was obtained for a 2-wk. interval, and a mean of 
r= 85 using the split-half method. 
2-item questionnaire called the Job Factor 
Importance Scales (JFIS), consisting of the posi- 
tively phrased items from the JFSS, assessed the 
relative importance of the same 13 specific job 
factors in terms of job satisfaction. The JFIS con- 
tained 13 subscales, cach of which was composed of 
four items expressed in a five-step importance 
format; and two major scales, (Job Content and 
Job Context) which yielded summed scores for each 
set of specific factors. Internal consistency esti- 
mates were high for both kinds of scales. For the 
subscales, the mean correlation was .85. However, 
test-retest reliability for a 2-wk. period was, in 
general, only moderate. The mean correlation for 
the subscales was .66. 

4. A group of items was included that concerned 
the demographic characteristics of age, time with 
company, and for the high-level group (professional 
engineers), job level and time on present job. 

The total questionnaire, along with a cover letter 
explaining its purpose, was mailed to the profes- 
sionals in June 1966, and administered on company 
time to the low-level group (semiskilled assemblers) 
in December 1966. Anonymity was guaranteed and 
Participation voluntary. 


Sample 


The sample consisted of enginee 
because both occupations are classified in the tech- 
nology field (Moser, Dubin, & Shelsky, 1956), with 
engineers as professionals and assemblers at the 
Semiskilled level, Hence, occupationally, the field was 
Controlled while the influence of level was examined. 

Iso, engineers were chosen because they represent 
One profession which exhibits a relatively high degree 
of job dissatisfaction (Deutsch & Shea, Inc., 1961). 
Both occupational groups were employed at a large 
electronic company’s engineering and manufacturing 
facility in New York state. 

A group of 285 employees was obtained from the 
laboratory population of 309 male, nonsupervisory 
engineers. Of this population, 79% were college 
Sraduates, Of the 285 engineers sampled. 200 (70%) 
returned sufficiently complete questionnaires to serve 
as Ss. Of these 200 engineering Ss, 8676 were betigecn 


rs and assemblers 


TABLE 1 


OCCUPATIONAL LEVEL COMPARISON OF Jos FACTOR 
SATISFACTION SCALES SCORES FOR THE ENGINEERS 
AND ASSEMBLERS 


Engineers | Assemblers 
Factor t 
| x sp | X | sp 
Recognition 28.0 | 
Responsibility 28.7 
Achievement 29.3 
Advancement 26.8 
Work itself 29.1 
Job content 
score 141.9 
Salary 26.3 
Job security 33.5 
Status 24.9 
Supervision—Tech- 
nical 26.6| 6.80} 27.0) 4.20} .68 
Supervision—Inter- 
personal relations | 31.2) 5.32) 29.2) 5.41] 3.45* 
Interpersonal rela- 
tions—Peers 30.3] 5.24} 28.4] 5.30} 3.35* 
Company policy 
and administra- 
tion 27.6| 5.10} 28.4] 4.85} 1.50 
Working conditions} 28.7} 5.09] 23.6] 4.24] 10.24* 
Job context 
score 229.3 | 27.65 | 209.4 | 24.72) 7.11* 


Note.—N = 200 for engineers, N = 153 for assemblers. 
+p < .01 (two-tailed test). 


20 and 39 yr. old; 50% had 1-7 yr. company 
service. Also, by job level, they ranged from junior 


to staff engineer. 
A group of 185 assemblers was obtained from a 


n of 251 males assigned to the lowest level 


populatio t v 
assembly jobs at the plant. Of this population, only 
42% were high school graduates. Of the 185 as- ( 


semblers sampled, 153 (76%) handed in sufficiently 
complete questionnaires to serve as Ss. Of these 153 
assembly Ss, 73% were between 30 and 49 yr. of 
age, and 84% had between 10 and 15 yr. company 


service. 
RESULTS 


An initial analysis of the data was con- 
cerned with two points; the degree and varia- 
bility of satisfaction, and the independence of 
the satisfaction ratings on the two types of 
factors. In regard to the first point, the OJSI 
data indicated that the engineers were gen- 
erally only moderately satisfied with the job, 
but significantly (p< .01) more satisfied 
than the assemblers as shown by a é test (t = 
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TABLE 2 


INTERCORRELATION MATRIX For FIFTEEN SCALES OF THE JoB FACTOR SATISFACTION SCALES 
WITH ENGINEERS 


Variable Lele llall psf ]o]je]e]s]e]s 


1. Recognition 
2. Responsibility 
3. Achievement 
4. Advancement 
5. Work itself 


Sector 1 


6. Job content score Sector 3 


7. Salary 

8. Job security 

9. Status 
10. Supervision-Technical 

11. Supervision—Interpersonal relations 
12. Interpersonal relations—Peers 

13. Company policy and administration 
14. Working conditions 


Sector 2 


15. Job context score 


67 72 64 67 55 52 


i 
05; 7 = .18, p =.01;N = 200, 


Note.—Decimal points are omitted; r = .14, p= 


| 
3.47). The OJSI mean score for the engineers was shown for the job factors. (Exceptions | 
was 16.4 and for the assemblers 15.1, In both were supervision-technical and company policy 
cases, the standard deviation was 3.5, In ref- and administration on which the two occupa | 
erence to the specific job factors, the data are tions were equivalently satisfied ) | 
Presented in Table 1. From this table it will In regard to the ind ] è f the satis- 
be noted that a similar satisfaction pattern Spone ence o j 


faction ratings, the intercorrelations of thé 
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WITH ÅSSEMBLERS OR SATISFACTION Scares 
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g 
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|? [soft fa2 fsa fos 
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. Salary 
. Job security 
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el 
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job factors of the JESS for the engineers are 
shown in Table 2, and for the assemblers in 
Table 3. 

Examination of Tables 2 and 3 indicates 
that the difference in the proportions of posi- 
tively significant correlations (p < .05) within 
the two types of specific factors (Sectors 1 
and 2 of the matrices) and between the two 
types (Sector 3) was not statistically signifi- 
cant for either occupation; respectively, 92% 
and 85% for the engineers, and 89% and 
80% for assemblers. Hence, the proposed con- 
tent-context dichotomy for the job factors was 
not empirically supported at either occupa- 
tional level. On the other hand, the specific 
Job content factors clustered together to a 
higher degree than did the job context type 
within both occupations as measured by a 
median test: for the engineers, x? = 10.72 ($ 
< .01), and for the assemblers x? = 4.29 (p 
< .05). 

In regard to the study’s two predictions, the 
first hypothesis was accepted and the second 
rejected since the job content factors made a 
relatively greater contribution to overall job 
Satisfaction than did the job context type, 
regardless of occupational level. Intraoccupa- 
tionally, the differences in the mean correla- 
tions for the two types of specific job factors 
of the JFSS with the OJSI were as follows: in 
the predicted direction and statistically sig- 
nificant (p < .01) for the engineers (a mean 
r of .63 for content versus an 7 of .32 for con- 
text), and in the opposite direction than pre- 
dicted for the assemblers (a mean 7 of .53 for 
Content versus an r of .38 for context). 

. Since it was evident that the two occupa- 
tions also differed on age and time with com- 
pany, the possible effects of the demographic 
variables on the results were examined. Cor- 
relation and one-way analysis of variance 
showed that age, time with the company and, 
m addition for engineers, job level and time 
On present job, failed to individually influence 

e findings for either hypothesis. . 

For both occupations, a multiple-regression 
analysis, based upon a backward elimination 
Procedure (Draper & Smith, 1967), revealed 
that composites of 3 job factors yielded about 
as good an estimate of overall job satisfaction 
as did using all 13 specific factors. For the 
initial and reduced regression equations, the 


TABLE 4 


OccuPaTIONAL LEVEL COMPARISON OF THE CORRELA- 
TIONS OF THE SPECIFIC JOB FACTORS OF THE JOB 
Factor SATISFACTION SCALES WITH THE 
OVERALL JoB SATISFACTION INDEX 
FOR THE ENGINEERS 
AND ASSEMBLERS 


Correlations | 
with OJSI 
Factor 7 z value 
Engi- | Assem- 
neers | blers 
Recognition 40 | 46 |— .68 
Responsibility 59 24 3.99** 
Achievement 76 | 64 2.20* 
Advancement 46 64 | —2.41* 
Work itself 19 61 3.34" 
Job content score .76 69 1.37 
Salary 18 .37 | —1.90 
Job security 18 08 4 
Status -63 .59 259 
Supervision—Technical 46 470 | — 12 
Supervision— 
Interpersonal rela- 
tions 36 AT | —1.23 
Interpersonal relations- 
Peers 37 wl 63 
Company policy and | 
administration 16 38 | —2.20* 
Working conditions 13 .29 | —1.55 
Job context score 52 -60 | —1.08 
Note.—N = 200 for engineers; N = 153 for assemblers. 


* p < .05 (two-tailed test). 
Dp <.01 (two-tailed test). 


multiple correlations were, respectively, .81 
and .80 for the engineers, and .76 and .74 for 
assemblers. Furthermore, in both reduced 
composites, the two specific factors (desig- 
nated by Herzberg as content) made a rela- 
tively greater contribution to the prediction of 
job satisfaction than did the one context fac- 
tor. This is shown by the second-order correla- 
tion for each specific factor with the OJSI: 
work itself-.45, achievement-.38, and super- 
vision-interpersonal relations-.16 for the en- 
gineers; and advancement-.40, achievement- 
.39, and supervision-interpersonal relations-.22 
for the assemblers. 

Table 4 reveals that the individual rela- 
tionships for 5 of the 13 specific job factors 
with the OJSI differed significantly as a func- 
tion of occupational level. 

Generally, the content factors were more 
potent in overall job satisfaction for the en- 
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gineers than for the assemblers. However, ad- 
vancement followed an opposite pattern since 
it was more salient for the assemblers. For 
the context factors, with the exception of com- 
pany policy and administration, the relation- 
ships were basically similar across the two 
occupational levels. 

For the satisfied and dissatisfied subsam- 
ples, the correlations between the job factors 
of the JFSS and the OJSI are reported in Ta- 
ble 5 along with those previously found for 
the two total samples. The Ss who scored 
OJSI 14 or above composed the satisfied sub- 
samples; those who scored OJSI 12 or below 
comprised the dissatisfied. The Ss who scored 
exactly 13 (the neutral point) on the OJsI 
were excluded from analysis. Within an occu- 
pation, the correlations obtained for the sub- 
samples were, except for their reduced mag- 
nitude, generally similar to those previously 
found for the total sample. In job satisfac- 
tion, both types of factors made significant 
contributions, but generally only those for 
content were statistically significant in job 
dissatisfaction. Moreover, in both job satis- 
faction and dissatisfaction, the typical contri- 
bution for the content type was relatively 
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greater than context, regardless of occupa- 
tional level. 

Table 6 presents data on the judged impor- 
tance of the job factors for the two occupa- 
tional levels. It was evident that the ratings 
were generally restricted to the high impor- 
tance end of the JFIS scale. However, the 
difference in the mean scores of the two major 
scales (Job Content and Job Context) was 
statistically significant (p < .01) within each 
occupation as examined by paired ¢ tests (¢ 
= 12.17 for engineers and £= 9.25 for as- 
semblers). Hence, the judged importance of 
the factors was a function of occupational 
level; content factors were more important 
for engineers and context for assemblers. 
Specifically, achievement, work itself, and 
advancement were most important for the en- 
gineers, and job security, company policy and 
administration, and salary for assemblers. 

Correlational analysis showed that the 
demographic variables cited earlier failed to 
exert any major influence on these findings 
regardless of occupational level, 

Also, occupationally, the satisfied and dis- 
satisfied subsamples generally followed the 
Same pattern of job factor importance. The 
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iti = = 153) | (W=10 y = 34 
R | 1)| (N 
Rents 40** 22 | y 
Meese So | zane 32 a | a 28 
mi “ 2 s 
Kayancemen, .70** 55% 36" ite 10 —.01 
Work itself as | agen | _ oy | eae sot 
Job content score 79 | Sass ‘sae | ORF | som ar 
cs 76" | 5ga ae | ONE | ase 30" 
Salary i -09%* .53%* 50** 
b i -18** i 
oo .18** a E 37 36%" 24 
Supervision—Technical ie .43** 35 a 01 25 
Oe " F. A 
aloe Interpersonal relations pots .24** 22 ‘Ari Pee 2 
interpersonal relations—Peers Te .19* 24 ‘aye 46 23 
Company Policy and administration i -09 18 Bie -48** 35 
Working conditions 16 .18* ‘Ot Sgk 35** 27 
Job context score 13 13 —.10 3 is .30** 23 
.52** .31** 26 à oe .28** —.08 3 
* b< .05. | -60 Tt 44" 
“tp S Ol, 
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4 TABLE 6 
Means, STANDAR ren " j 
ANS, STANDARD DEVIATIONS, AND RANKS OF THE JoB FACTOR TMPORTANCE SCALES SCORES FOR THE ENGINEERS 
zz AND THE ASSEMBLERS 
Engineers Assemblers 
Factor — = 
x | SD Rank x SD Rank 
Ri an | 
ee ten 14.3 10.0 a5 | 3.65) ad 
e neihi 14.8 8.0 13.3 | 3.68 12 
ieee 168 10 153 3.71 8 
Fe vancemient 16.6 3.0 15.2 4.05 9 
TX itself 16.7 2.0 14.7 3.71 10 
Job content score 79.2 73.1 15.73 
Salary 
> : 15.1 AS 16.9 3.61 3 
{ob security 13.9 11.0 18.2 230 1 
Sie ease 12.2 13.0 12.3 4.12 13 
a ervision—Technical 15.1 | 45 16.4 5 
a aime Interpersonal relations 15.0 6.5 16.6 2.74 4 
ior rpersonal relations—Peers 14.5 9.0 15.5 3.42 7 
a eny policy and administration 15.0 | 6.5 17:1 2.48 2 
orking conditions 13.2 12.0 15.8 2.97 6 
Job context score 114.1 128.8 17.45 


ee 


Note.—N = 200 for engineers, N = 153 for assemblers, 


oo between the ranks of the specific 
factors for these subsamples is shown by 


a . . 

3 Spearman rank correlation of .94 with the 
o] i 
Sineers, and .81 for the assemblers. 


DISCUSSION 


a lack of support found for the content- 
E a dichotomy was generally in line with 
S research, On the other hand, con- 
degr actors clustered together to a higher 
on than the context type. One possible 
Acterts ation for this finding involves the char- 
cific Ta of the two kinds of variables. Spe- 
itself y content aspects stem from the work 
Work. Thus, it logically follows that when the 
itself is challenging and interesting, 

ility ae increased opportunities for responsi- 

sume” achievement, and recognition. This, in 
ne enhances the likelihood of advancement. 

mat, context aspects do not spring 
be ioe common denominator, but appear to 
ment ependent bedfellows. Hence, enhance- 
com of job security is not necessarily ac- 
ee by increased satisfaction with other 
ing tn aspects, for example, salary oF work- 
actor: nero. In some instances, the specific 
Verse S mean different things to the two di- 
groups of workers, Recognition gen- 


erally clustered together with responsibility to 
a higher degree with the engineers than as- 
semblers. Conversely, recognition clustered to- 
gether with salary and company policy and 
administration to a higher degree for the as- 
semblers than engineers. 

Confirmation of the frst hypothesis pro- 
vided some support for both theories. More- 
correlations found with the engineers 
pecific job factors and overall job 
satisfaction were strikingly similar to those 
shown in research which used high-level 
groups (Halpern, 1966; Wernimont, 1966). 
Finally, the estimate of overall job satisfac- 
tion obtained using the reduced statistical 
model was also similar to Halpern’s results. 

For the rejection of the second hypothesis, 
the findings for the assemblers were generally 
in line with those of Malinovsky and Barry 
(1965). Thus, additional evidence was pro- 
vided for the saliency of the content factors 
regardless of occupational level. The results 
also argue against the common stereotype held 
for semiskilled, blue-collar workers, that is, 
that they merely trade their time at work and 
seek security and economic features in order 
to cope with life’s basic needs. While these 
data appear contradictory to past occupa- 


over, the 
for the s 
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tional level research, Centers’ (1948) early 
study also revealed that some of the most 
liked job factors were of an intrinsic nature 
for his semiskilled Ss. In terms of the two 
theories, doubt was cast on the validity of 
Darley and Hagenah’s (1955) model, and 
some support provided for Herzberg’s theory. 
Lack of support for Darley’s rationale could 
not be explained as a function of the sample, 
since these blue-collar assemblers were a 
poorly educated and moderately paid group of 
semiskilled workers. Second, it appeared that 
they had chosen their jobs more as a means 
for survival or entry into the organization, 
than for intrinsic factors. Finally, despite the 
fact that their assignments typically involved 
repetitive tasks, the majority were relatively 
satisfied with their jobs. 

On the other hand, because advancement is 
classed as a context factor by Darley and 
Hagenah (1955), it could not be stated that 
their theory was completely refuted, This was 
because in the reduced job factor composites, 
the relative contributions of the two kinds of 
specific factors to overall job satisfaction dif- 
fered as a function of occupational level. 
Specifically, for the assemblers, the two con- 
text factors (as designated by Darley) of ad- 


-interpersonal rela- 


Moreover, these findings were also contrary to 
Darley and Hagenah (1955) because occupa- 
tional level failed to be the main indicator of 
the kind of factor involved in job satisfaction 
and dissatisfaction. 

Since the judged importance of the two 
types of factors was found to be a function of 
occupational level, the predictions of the Dar- 
ley and Hagenah (1955) theory were sup- 
ported, but those derived from Herzberg’s 
model were refuted. In the latter theory, it 
logically follows that content aspects should 
be judged most important because they are 
asserted to be the primary contributors to job 
satisfaction, independent of occupational level. 
(Content factors are stipulated to act mainly 
as satisfiers.) 

On one hand, this set of results was di- 
rectly in line with the bulk of research on 
occupational level (Herzberg, Mausner, Peter- 
son, & Capwell, 1957), and similar to the 
findings of Centers and Bugental (1966) and 
Friedlander (1965a). 

Conversely, the present findings were con- 
trary to those of Fine and Dickman (1962). 
Although the discrepancy could be a function 
of a number of variables, the use of the tw0- 
dimensional occupational classification syste™ 
by Moser, Dubin, and Shelsky (1956), aP- 
Pears to shed some light on the problem. I” 
tan this system, which includes both 

nd field of work, the lower groups use 


by Fine and Dickman (electronic technicians 
and aides, secretaries, 
be of mixed levels and 


i Satisfaction versus importance rat 
ings). Neither theory could accommodate poth 


work. 


Despite the fact that the two theories failed 
to represent valid models, job satisfactio" 
remains a vital concept ot future researc?” 

he importance of the concept is noted 
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Super’s (1957) work where it is viewed as an 

Í internal aspect of adjustment to the world of 

) Rork. Also, inasmuch as a job provides outlets 

tor personality characteristics and basic needs, 

Job satisfaction is conceived as having an im- 

e ant effect upon success and satisfaction in 
njob areas of life (Thompson, 1963). 
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Researchers in the psycholo 
of variables which have con, 


where interviewers evaluate 
a fact-finding interview, 
college students evaluate 
etc., based on trait adjec 
some of the variables o 
interview setting are presented. 


a target S 


The effects of different info 
erties on assessment evaluations have been 
both extensively and intensively studied. At 
this point, for example, is it safe to conclude 
that negative information has a greater im- 
pact on assessment evaluations than does 
positive information (Bloom & Brundage, 
1947; Bolster & Carlson & 
Mayfield, 1967; 1951; 
Luchins, 1957a, 19 1967; 
Newman, Bobbit 


rmation prop- 


Asch, Bolster & 
> and Luchins, 


order of presentation (Bolster 


1 This article has | 
outgoing Editor, 
? Requests for 


gy of impression formation have isolated 
sistently demonstrated an effect o 
tions. Some authors have extended these finding 
job applicants base 
The feasibility of ge 


as 


tives is questioned, The results of 
Í impression fo 
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a number 
n judges’ evalua- 
gs to the employment setting 
d on impressions gained from 
neralizing from studies where 
a potential friend, roommate, lover, 


a study extending 


rmation to a simulated employment 


carries more evaluative w. 
tive information and that 
to predominate, 
The effect of th 
stimulus Ss on 
been extensively st 
concluded that a 
extreme evaluati 
lus § of opp 
Holmes & Berk 


eight than does posi- 
primacy effects seem 


e range and number of 


e 1 
individual, Dt 


places himself /herself, 


d in s Ae 
! omores as Interviewers and adj es 
tives as Interview Information, There ee 
appear to be some risk į 2 


n generalizing pee 
from studies ood 

i : es evaluate a stimulus 5 be 
Potential friend 
four adjectives wi 


z ane sae j 
as quarrelsome versus capable. Implicit 
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Such generalizations is that the experimental 
Situation closely approximates the employ- 
a gr If the experimental setting con- 
tine z 4 he elements of the employment set- 
Olean be dignified by terming it a model. 
rently T such a goal is laudable, but cur- 
minia ifficult to achieve. However, at the 
tempt i the experimental setting should at- 
dhe ex o approximate the applied setting on 
xperimental dimensions in question. 

FF ca ri reported here is an attempt to 
varia o the employment setting some of the 
Pir ged which have consistently demon- 
ape an influence on assessment evaluations 
ae th ip eo circumstances. These variables 
order e avorability of the information, the 
within 2 which the information is presented 
tant a stimulus S, the order in which the 
the į ka Ss are evaluated, and the validity of 
the information being evaluated. In addition, 
a ee were first trained on use of valid 
ier mation, the judges all had conducted in- 
Bite and the administration of the ex- 
sibl nental material followed as closely as pos- 

e the sequence of a selection system. The 
Purpose of the study was to determine to what 
extent an initial valid impression is altered by 
ord interview information differing in 
it Orability, order of presentation, and qual- 

Y of the preceding applicant. 


Suns METHOD 
ub jects 


AE responding Ss who acted as interviewers were 
2avk pores agency managers who were attending 
the Tin chools in Agency Management conducted by 
is i e Insurance Agency Management Association. 
recruitin a they have complete responsibility = 
Men t ing, interviewing, selecting, and training sates- 
© work under their supervision. , 

job 5 School hours cover all aspects of the mane 
Conte Pecifically, selection hours deal with the va ue, 
tests, nt, and conduct of the selection mee 
indoct and early training. The managers me w 
Index a, on the validity and use of the pee’ 
Peters attery (AIB; Paterson & Thompson, wd 
Various & Wallace, 1966). The success base rates or 
and in Scores are presented and managerial costs 
further plications are discussed. The managers = 
bivariate acted that as a result of the triangular 
criteria e distribution between test score al e 
an su the test is more accurate in predicting failure 
high] 5 uccess. Thus, the managers are subject to a 
Y standardized and practical interpretation of 


est results on the ATB. 


Construction of Hypothetical Job Applicants 


Previous research has clearly indicated that mean- 
ingful results can be obtained by constructing hypo- 
thetical job applicants from prescaled descriptive 
statements (Webster, 1964). The type of state- 
ment used obviously depends on the situation to 
which Æ wishes to generalize his results. In the 
research reported here, the goal was to simulate the 
fact-finding interview. The type of statements used 
were descriptive personal history items. The develop- 
ment of the personal history information used to 
construct the hypothetical applicants has been dis- 
cussed elsewhere (Carlson, 1967; Mayfield & Carl- 
son, 1966). 

The life insurance agency managers attending the 
2-wk. schools each evaluated eight hypothetical job 
applicants during the second week of the school. The 
first item of information about an applicant was 
always the aptitude test results. For one-third of the 
sample the test results indicated that the experi- 
mental applicant had passed the AITB, for one-third 
of the sample the hypothetical applicant had not 
taken the AIB, and for one-third of the sample 
the applicant had failed the Aptitude Index test. The 
test results were initially presented alone on the top 
of the first page of the test booklet describing an 
applicant and repeated as the first item on all suc- 
ceeding pages describing an applicant. The remaining 
eight items of personal history information were 
presented one at a time in a serial-cumulative fash- 
ion. Each test booklet thus consisted of nine pages 
with each page adding an additional item of infor- 
mation, At the bottom of each page the manager 
was asked to rate the applicant on a 7-point scale as 
to his potential success as a life insurance sales repre- 
sentative. Thus, each manager was asked to rate 
each hypothetical applicant nine different times based 
on nine different sets of information. 

Four different types of hypothetical applicants 
e stimulus material. The first type of 
hypothetical applicant was constructed from the test 
results and eight items of favorable information 
(FF). The items were approximately equal in scale 
value, standard deviation, and test-retest reliability. 
A second type of applicant was constructed from the 
test results and eight items of unfavorable informa- 
tion (UU), again with approximately the same 
mean, standard deviation, and test-retest reliability. 
A third hypothetical applicant was constructed from 
the test results and four items of favorable informa- 
tion and four items of unfavorable information 
(FU). The four items of favorable information were 
the same as the first four items of favorable informa- 
tion describing the first applicant, while the four 
items of unfavorable information were the same as 
the first four items describing the second applicant. 
The fourth applicant was the same as the third, ex- 
cept the four items of unfavorable information were 
presented first and the four items of favorable infor- 
mation presented last (UF). 

For all hypothetical applicants, only the scale 
value varied, the standard deviation and test-retest 


served as th 
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reliability of the items were approximately equal. 
One-fourth of the sample responded to each of the 
r types of hypothetical applicants. : 
S ERT af the test booklets the hypothetical 
applicant preceding the experimental applicant was 
of similar stimulus value, but constructed of dif- 
ferent items of personal history information. In the 
remaining half of the test booklets the preceding job 
applicant was of opposite value. The experimental 
pair was randomly allocated among the remaining 
six applicants. 


Analysis 


The original design of the experimental material 
was a 2X3 X4 factorial with nine repeated ratings 
made by each interviewer. Because the items of per- 
sonal history information were similar in stimulus 
value in blocks of four, it was decided prior to 
analysis to analyze three levels or amounts of infor- 
mation rather than nine. The first level of informa- 
tion was the test results alone (T), a second amount 
of information was the test results plus the first 
four items of personal history information (T4, 
while the third level of information was the test 
results plus all eight items of personal history infor- 
mation (T +8). In addition, rather than encounter 
unequal variance-covariance matrices as a result of 
the repeated measures and also to allow additional 
treatment effects to be explored, a random sample of 
25 Ss was taken within each cell. The resultant de- 
sign was a 2X3X4X3 factorial with 25 observa- 
tions per cell. The independent variables wi 


: ere two 
types of applicant samples (homogeneous, 


where the 


TABL 


MEANS AND STANDARD DEVIATIONS 
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experimental and preceding job applicants were of 
similar stimulus value; heterogeneous, where the ex- 
perimental and preceding job applicants were of 
opposite value), three levels of test results (pass, not 
taken, failed), four types of personal history informa- 
tion (FF, FU, UF, UU), and three amounts of infor- 
mation (T, T+4, T+8). A factorial analysis of 
variance was performed on the 2X3X4X3 treat- 
ment combinations. 


RESULTS 


Table 1 presents the mean and standard 
deviation of the ratings for the treatment 
combinations. 

Table 2 presents the results of the analysis 
of variance applied to the interviewer's evalu- 
ations. Looking first at the F values and their 
statistical significance, it could superficially 
be concluded that almost all the independent 
variables and their various combinations pro- 
duced some effect on the interviewer’s evalu- 
ations. This would be especially true if con- 
centration were focused on the number of sig- 
nificant F values and their magnitude. How- 
ever, Table 2 also presents omega-squared 
(Hays, 1963) which indicates the relative 
effect of the treatments on the dependent 
variable. By first determining whether or not 
a particular treatment produces a statistically 


E 1 


E OF ĪNTERVIEWER’S EVALUATIONS 
Homogeneous sample Heterogeneous sample 
Test results Ts T T 3 
T + 4b T + 8 T T+4 r+8 
x = ç = 
oe) = |e | 2 | wl = | spl x sp | x | SP 
Passed test g 
FF 
FF SI 2188) gazli wl gs | gi #2 | a | ee | 2 
FU ael G18 8) | ao | a 6 | 24 | 12 
yr a al | gte oli] zl Z 1i | 3a | 12 
7 — “| Se] @) at | tiles | Bl ae 11 | 8 | 13 
FF 3.1 
E 3 6 | 35 9 
PU E 3.6 1.0 3.0 38 4 
EU 332 lee | AEAEE E io o|a| 3 
op alala S al g 18 | 8 | 30 | 12 
er ies 3 : a fos | Wate Ges 18 S| a7 | i 
FF 1.2 l l l 
FU ia 3 y Zi | a2 | 28 | 1? 
UE 12 HETE | IE 
UU | 12 MEARE 7 
— — — — — a 3 : f 
ang NOt®: 25 in each cell, 4 ; i ame 
anc unfavorabl nformation, and Ub Wvorahle | mation; RU =f 
ST = test results alone: ‘vorable information, FU = favo" 
aA Ta i pet Temlis Dlus four items of Dersonal inf, i l 
= esults plus eight items of personal neeaae 
ation, 
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Significant effect on the interviewer’s evalua- 
lions, and second, the magnitude of this cf- 
fect, the importance of the treatment effects 
are more realistically evaluated. 
ey cakes of both the statistical and prac- 
i criteria point out three treatment condi- 
tons which appear to have an effect on an 
interviewer’s evaluation of a hypothetical ap- 
plicant. First, the most important, 14% of 
the explainable variance in the interviewer's 
evaluations is attributable to the objective in- 
formation, valid test results. Second, another 
12% of the variance in interviewer’s evalua- 
tions is due to differences in the favorability 
of Subjective personal history information. 
Third, an additional 11% of the variance in 
the interviewer’s evaluations is attributable to 
a combination of differing amounts and types 
information, The remaining treatment ef- 
€cts, while statistically significant, do not 
Practically reduce the variability of interview- 
er’s evaluations. 
_ Figure 1 presents a diagram of the inter- 
Vlewer’s mean evaluations of the hypothetical 
+bplicants. The interaction of the amount of 
nformation with the type of information is 
ee for the three different test-result 
tl nditions, By visual examination, it 1s clear 
te interaction is built into the experimental 
Setting by the type of stimulus applicant con- 
aaa However, the interaction per se is 
y the major interest. The primary interest iS 
y the amount of change in initial valid im- 
tts created by different kinds and order 
Information, Based upon these results, 


TABLE 2 
ANALYSIS OF VARIANCE OF INTERVIEWER’S RATING 
Source df MS F Ww 
Amount of information 
(D 2| 49.65 | 42.08* | .01 
Type of information (J) 3 | 289.54 | 245.37* | .12 
Sample (K) 1| 27.89) 23.64* | .00 
Test results (L) 2 | 497.59 | 421.69 | 14 
IJ 6 | 127.41 | 107.97* | .11 
IK 2| 32.54 | 27.58* |.01 
IL 4| 19.75 16.74* | .01 
JK 3| 19.20 | 16.27* |.01 
JL ó| 9.05} 7.67* | 01 
KL 2 719 67 | .00 
IJK 6| 6.09} 5.16*| .00 
IJL 12 2.96 2.51* | .00 
IKL | al .89.| © 75° q}c00 
JKL 6| 143] 1.21 |.00 
IJKL } 12 75 64 | .00 
Error | 1728 1.18 
I | 


*p =.01, 


three post hoc hypotheses were developed and 
tested for statistical significance. The post hoc 
hypotheses were as follows: 


1. For hypothetical applicants constructed 
of consistent information (all favorable or all 
unfavorable): (a) The first four items of favor- 
able information have the same effect on inter- 
viewers’ evaluations as the second set of four 
favorable items (Šrr — rru = Xrru 
Ten). There is neither a primacy or re- 
ect for favorable information. (b) The 
vorable information have 
ffect on interviewers’ 


— Xr T+8/ 
cency eff 
first four items of unfa 
a significantly greater el 


Failed Test 


Test Not Taken 


Passed Test 


a Mean 
a Mean 


ij T+4 T+8 T 


Amount of Information 


16. 1. Interviewer’s mean rating by tY 


T+4 
Amount of Information 


pe of information, 


a Mean 


T+8 


f T+4 
Amount of Information 


T+8 


amount of information, and test results. 
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evaluations than the second set of four un- 
favorable items (Xv, — Xury > Aura 
= Kimo) There is a primacy effect for un- 
favorable information. 

2. For hypothetical applicants constructed 
of inconsistent information (half favorable, 
half unfavorable) : (a) Unfavorable information 
produces greater change in an inverviewer’s 
evaluation when presented last, that is, a 
recency effect (Neu; Xruz,, > Xur, 
E AUFs): 


These three hypotheses were tested by the 
method of nonorthogonal contrasts. The three 
hypotheses were accepted at the .01 probabil- 
ity level. Thus, the effect of subjective inter- 
view information in altering valid impressions 
is dependent on the amount, kind, and order 
in which it is presented. However, it should 
be recalled that while valid impressions may 
be altered, they still account for the largest 


single portion of the explainable variance in 
the interviewer’s evaluations. 


DISCUSSION 


The substantive findings from the original 
analysis indicated that three variables ac- 
counted for significant portions of the vari 
in the interviewer’s evaluations, The first i 
pendent variable, valid test information, pro- 
duced the greatest effect on interviewer’s eval- 
uations. This confirms a previous study in 
which the author found that without subjec- 


tive personal information, the test r 
counted for 55% 


x /0 
in the 

1970). In 
the test re 


ance 
nde- 


l for only 14% of the 


types of subjectiv: 
8; and FF, FU, 
forming a 3 x 2 
counted for 30% pant Wits 


of the explainable variance. 
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Thus, the difference between the 55% of the 
variance accounted for in the previous study 
and the 30% accounted for in this study is 
due to sample and design differences, while 
the difference between the 30% of the vari- 
ance accounted for in the 3 X 2 analysis and 
the 14%% of the variance accounted for in the 
3 X24 & 3 original analysis is due to the 
interaction of the test results with the addi- 
tional independent variables. Thus, initial 
valid impressions can be readily altered by 
subjective personal information. 

The second independent variable, favorabil- 
ity of the information used to construct 4 
candidate, accounted for 12% of the explain- 
able variance in the interviewer’s evaluations. 
As would be expected, the applicants con- 
structed of eight items of favorable informa- 
tion were rated most positively while the ap- 
plicants constructed of eight items of un- 
favorable information were rated most nega- 
tively. Of the applicants constructed of half 
favorable-half unfavorable information, the 
UF order produced more positive ratings tha” 
the FU order. This persisted irrespective ° 
the type of test results used to establish the 
original valid impression. R 

The interaction between type of information 
and amount of information, which accounte 
for another 11% of the explainable varianc® 
indicates to some extent the mechanism bY 
which the initial valid impressions are altere® 
The interviewers seem to be demonstrating th? 
rule that “when the selection test is faile¢ 
forget the man”; and “when the selection tes 
1s passed, forget the test and evaluate tH 
applicant using other tools.” As would be ex” 
pected, information of Opposite value to the 
initial valid impressions produced a much 
greater change in rating than did information 
of similar value. The initial evaluation of f 
hypothetical job applicant who has athe 
passed or not taken a valid selection test 3 
altered to a greater extent by unfavora™ | 
information and much less affected by fav! a 
able information, The initial evaluation ° a 
hypothetical job applicant who has failed 
valid selection test is only slightly more A 
fected by favorable than he Inian orable m 
formation, In general, information which g 
consistent with an interviewer’s initia 


1 
A cs na! 
Pression adds less to his total impressio” i 
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ce wee value detracts from 
difficult ri oe ee = un E have some 
lorimaticn of coal a te an i. ee in- 
e ice te ier va ne while information 
ten r erent values can be easily 

: nd possibly overreacted to. The 

ak an interviewer feels uncom- 
nconsistent information may 
serve to explain why Webster (1964) found 
that interviewers actively sought information 
to support their early impressions. 
The results of the tests on the post hoc 
“dip a again demonstrate the strong ef- 
i unfavorable information and suggest 
Gand that both primacy and recency effects 
cox, e created by unfavorable subjective in- 
rmation. Unfavorable information produced 
Primacy effects when the hypothetical job ap- 
Plicants were constructed of uniform or con- 
sistent information. In this situation the first 
Set of unfavorable information accounted for 
the majority of the change in interviewer's 
evaluations, Additional unfavorable informa- 
tion changed the initial negative evaluation 
Very little. This was observed following all 
three test-result situations. This indicates that 
the contrast effect of negative information 
with any preestablished set may generate a 
Primacy effect. Thus, only a few items of un- 
favorable information are necessary to create 
à negative impression; adding more unfavor- 
able information does not make the evaluation 
ink, more negative. By comparison, favorable 
tece mation showed neither a primacy nor ba 
recency effect. Favorable impressions are cre- 
ated much less rapidly and increases in posi- 
tive evaluations do occur with additional in- 
formation, 

The results for applicants of contrasting 
Subjective personal history information sus- 
Rest a recency effect for unfavorable informa- 
tion. There is a greater change in the initial 
impression created by valid information when 
Subjective information is presented in a favor- 
able-unfavorable order than when the same in- 
°rmation is presented in an unfavorable- 
favorable order. Although the information is 
exactly the same, the effect of unfavorable in- 
rmation is greater after a positive impres- 
son has been established than is the effect of 
Positive information after a negative impres- 
Sion is created. Thus, while both recency and 


primacy effects can be created by unfavorable 
information (after both positive and negative 
initial valid sets have been established), it is 
difficult to conclude that this demonstrates 
anything beyond the strength of unfavorable 
subjective information in creating a contrast- 
ing situation. 

While the assimilation of later information 
into early evaluations makes conceptual sense, 
the data presented here demonstrated no such 
effect for favorable information and suggest 
that unfavorable information produces a con- 
trast effect, given either a positive or negative 
initial impression. It is recognized, however, 
that the stimulus material employed in the 
research reported here may have been too 
obvious (e.g., a college graduate vs. did not 
graduate from high school) and that more 
subtle trait descriptions (e.g., honest vs. dis- 
honest) do in fact produce primacy effects 
which may be due to assimilation of later in- 
formation into early impressions. 

The findings of the research reported here 
also serve to reinforce the validity of two 
other current issues in the conduct of labora- 
tory experiments. First, they provide ample 
evidence in support of Cohen’s (1968) state- 
ment that ANOVA tables should include a 
measure of the proportion of variance ac- 
counted for by a treatment effect. In this 
two variables shown to have an 
of impression formation gen- 
lly significant results, but did 
not practically reduce the variance in the in- 
terviewer’s judgments. Second, these results 
indicate that a great deal of caution is neces- 
xtending the conclusions of laboratory 
experiments of theoretical interest to applied 
settings where the goal may be similar, but 
the task and the tools used to perform the task 
are different. The laboratory setting or simu- 
lation experiment may appear to be very real 
or natural in its conception, but it does not 
eliminate the need for testing these findings 
and hypotheses in natural settings (Barker, 
1968; Garnsey & Hibbs, 1967; McGuire, 
1967; and Williams & Rausch, 1969). 
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RELEVANCY IN THE TEST OF MOTIVATOR-HYGIENE 
THEORY 


BENEDICT S. GRIGALIUNAS 1 AND FREDERICK HERZBERG 


Case Western Reserve University 


Motivator-hygiene (M-H) theory 
the sequence of events methodology, 
utilized, inconsistent resul 
cedures have been 
attitudes. Therefore, 
measure of relevanci 
ducing the empirical inconsisten 
ings of items measuring jo 
In addition to irrelevancy, 
cate M-H theory predictions seems 
social desirability, value systems, an 
items by the respondents. 


The fundamental prediction of the moti- 
vator-hygiene (M-H) theory of Frederick 
erzberg, initially reported in the Motivation 
to Work (Herzberg, Mausner, & Snyderman, 
el, has undergone well over 50 replica- 
ns and is, perhaps, the most replicated 
study in contemporary industrial psychology. 
The sequence of events methodology and its 
Variations has been largely utilized (though 
not exclusively) throughout these replications, 
and consequently, this has led to the criticism 
that the M-H theory is “shackled to the story 
telling methodology [Dunnette, Cambell, & 
i 1967, p. 148].” Also, there exists evi- 
neath e of inconsistency in results where alter- 
ve methodologies are utilized. 

of Herzberg et al. (1959) chose the sequence 
f events methodology in their original study 
Or three reasons: (a) Attitudinal research is 
‘ue yet sure of what an attitude is and how it 
ione. differ from similar concepts such as opin- 
S, interests, feelings and tendencies; (b) 
ee can one know that someone has an atti- 
ar © or feeling about something? Employees 
tes given predetermined choices of jol 
ors to rate or check, and as 4 result, atti- 


tu k ult, 
Gan are given when none may exist; (c) 
tural noise responses are too often the 
articularly 


ei of rating scale items, Pat" 

(He, they are irrelevant to the job situation 
erz 

sala et al., 1959). 

g Requests for reprints should be sent to Benedict 


v eS rigaliunas, Department of Psychology: Case 
Stern Reserve University, Cleveland, Ohio 44106. 


predictions have been replicated through 
but when alternative methodologies are 
ts are often obtained. Item rating and rating pro- 
accused of producing irrelevant data for the study of job 
a study comparing the two methodologies was made. A 
e was obtained in order to determine its effect in pro- 
cies. The results showed that ratings and rank- 
b attitudes are heavily contaminated by irrelevancy. 
the failure of rating and ranking of items to repli- 
to be further due to the biases produced by 
d misinterpretations or reinterpretation of 


Because of the above problems in attitudinal 
research, the criteria for a job attitude se- 
quence were defined as follows: it had to be 
an objective event, something had to have 
happened; it had to be bounded by time (be- 
ginning, middle, and end); the feelings had to 
be exceptionally good or bad; and the re- 
spondents’ feelings about his job had to be 
directly affected. As a result, the focus of the 
data, because the respondent was personally 
involved, was on the existence and the change 
of a feeling. The criteria ensured that a feel- 
ing was being tapped (Herzberg, 1966). In 
this manner, factors emerge out of the data 
rather than being determined a priori. 

Herzberg’s methodological rationale has led 
Dunnette and Kirchner (1965) to state: 


1 contribution of the study is that 
fied were allowed to emerge from 
descriptions of actual job situations rather than 
being based exclusively on responses to checklists or 
sets of statements developed ahead of time by the 
investigator. The job factors derived by Herzberg’s 
classification are more likely, therefore, to reflect 
things in the job environment leading to employees 
approach and avoidance behaviors [p. 152]. 


A more fundamenta 
job factors so identi 


Yet, many if not most of the research stud- 
ies whose results are inconsistent with the 
M-H studies have utilized rating and ranking 
procedures of factors or items. A few, for ex- 
ample, are: Burke’s (1966) ranking of 10 job 
characteristics in order of importance for the 
self; Dunnette et al’s (1967) Q-sort decks 
for satisfying and dissatisfying job situations; 
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Ewen, Smith, and Hulin’s (1966) job descrip- 
tive index with GM faces scale; Friedlander’s 
(1965) questionnaire based on 14 work en- 
vironment characteristics and their impor- 
tance, and Friedlander’s (1964) ratings of 
importance of 18 factors; Graen’s (1966) tat 
ings or rankings of importance of Herzberg’s 
factors; Rosen’s (1963) rating of importance 
of absence of items; and Wernimont’s (1966) 
forced-choice and free-choice rating of Herz- 
berg’s factors. 

These methodologies have been accused of 
producing irrelevant data by Whitsett and 
Winslow (1967). Especially cited is the Wer- 
nimont (1966) and the Dunnette (1965) stud- 
ies in which the problem of irrelevant rating 
of items is most apparent. Wernimont ob- 
tained responses to both forced-choice and 
free-choice items from 50 accountants and 82 
engineers in self-descriptions of past satisfy- 
ing and dissatisfying job situations. He con- 
structed two forced-choice checklists, each 


of items with each pair 
consisting of one motiva 


ith a given 
» the individual 
n two statements 
t to the situation 


Nevertheless, Wer 
that since both 
had been checke: 


of this bias. In the light of his own words, the 
opposite is a more logical argument since the . 
irrelevancy is the bias as he points out so well 
in his discussion of his use of the forced- 
choice methodology. 

In order to examine the irrelevance phenom- 
enon, Stern (1968) used 20 statements taken 
from a study by Dunnette (1965) with high 
factor loadings on 10 factors. He had each S 
write both a positive and a nega 


gative incident 
and then rate each statement on a scale of 


l-5 of increasing importance of that state- 
ment in producing the feeling of satisfaction 
(dissatisfaction), However, he reserved a rat- 
ing of O to those items which § felt to be to- 
tally irrelevant, 

As a result, Stern (1968) found that Ss 
themselves rated approximately 50% of well 
over 9,000 possible ratings as being totally 
irrelevant to the high or low incidents de- 
scribed. The remaining 50% of the ratings 
had 7 or more fa 
10% of the tot 
rated 


unhappy sequences, 


The purpose of 
whether 


reason fo 
events methodolog 
Sponses are too o 


e sequence 3 
Y, that cultural noise k 


ften the Product of ratine 
Scale items when they are irrelevant to the j° 
situation, can be tested, 
METHO 
Hypotheses ğ 
Hypothesis 1. Th is coding iy 
yield the Predicted ee coe 


h 
theory result—the mor 
Significantly more motiv’ 
V incidents will consist 


incidents will consist of 
Sequences while the 


al 
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poar and hygiene statements, but especially 
ki ator statements will be rated highly for both 
high and low incidents. 

oe 3. When only the relevant ratings are 
eae the predicted M-H theory outcome will result 
a incidents will consist of significantly more 
ere Or sequences while the low incidents will 

sist of significantly more hygiene sequences. 


Procedure 


coo Ss, junior and senior students at Case 
dine Reserve University, were requested to par- 
oa mt the study. Their mean age was 21 yr. 
The T Hee were male and 38 were female students. 
oped ae Experience Questionnaire (CEQ) devel- 
data A ope (1968) was utilized to gather the 
follows ae the CEQ consists of two parts: Part 1 
intervie he outline of the Herzberg et al. (1959) 
t w and elicits two incidents, one high and one 
in tig ih only difference is that the CEQ incidents are 
richer briefer written protocol form rather than the 
2 iiri more lengthy incidents of the interview. Part 
ions oe examines the incidents in terms of the rea- 
The Aa by the respondents for their feelings 
ead S are asked to rate 20 statements per experi- 
duci as to the importance of that statement in pro- 

cing the high or low feeling for that experience. 
it ad statements include five motivator and five 
iar factors, each factor represented by two 
ei vaia The items used had high factor loadings 
(1967 -H factors as developed by Dunnette et al. 
colle ), but were modified in order to better fit a 

ge population.? The factors were: 


Motivator Hygiene 
Achievement Co-workers (Peers) 
Responsibility Supervision 
Work Itself Status 
Recognition for Salary 


Achievement 


Personal Growth Company Policy and 


Administration 
awe sets of statements were included, one set to 
P FaN for happy (high) experience and one set for 
are Ppy (low) experiences. Generally, the two sets 
of Written as opposites of each other and the order 
m Statements differs for each set. Each of the state- 
jones was rated on a scale from 1 to 5 of increasing 
see of that factor in producing the feeling 0 
Sfaction (dissatisfaction). or 
aha out to obtain a measure of relevancy, Ss igi 
with ow incidents and the two sets of statemen 
juq Out Ss’ ratings were given to SIX judges. ae 
iota Was instructed to read each incident and oe 
Faroe statements and then only check those 
ments that are relevant to the incident described 
Ee Of the six judges, three were advanced ae 
t en ann who were well acquainted with a 
g a e expert judges) and three Were gi 


? Status and personal growth were not included in 


tk 
ne Dunnette et al. (1967) study. 


TABLE 1 


MOTIVATOR, HYGIENE, AND MIXED SEQUENCES 
IN HIGH AND Low FEELINGS FROM 
CONTENT ANALYSIS 


Sequence | High Low | Total 
| feelings feelings 
Motivator | 45 | i | 59 
Hygiene | 18 | 53 | 71 
Mixed l e | “40 s 
Total | 71 m | a8 
= 33.51, p < .0U1. 


graduate students who were totally naïve of M-H 
theory (the naïve judges). This combination of 
raters was used in order to ensure that there would 
be no source of bias resulting from a rater’s familiar- 
ity with the M-H theory. 

Two different expert judges were then given only 
Ss’ high and low incidents and were instructed to 
content analyze the incidents according to the Herz- 
berg methodology. The coding scheme was as follows: 
motivator sequence, hygiene sequence, mixed (both 
motivator and hygiene content) sequence, and un- 


codable. 
RESULTS 
Hypothesis 1 

Of the 81 Ss requested to relate both a high 
and a low incident, 76 high and 81 low inci- 
dents were obtained. The two content analy- 
sis coders were in perfect agreement on 83% 
of their coding. The differences between the 
two coders were then resolved and 100% 
agreement was reached. Five high and four 
low incidents were categorized as uncodable 
because the incidents did not meet the afore- 
mentioned criteria for an incident. 

Table 1 presents the coding of the high and 
low incidents and Figure 1 presents the same 
data in percentages according to the manner 
of presentation in Herzberg (1966). Since the 
mixed category is almost equally divided be- 
tween the high and low sequences and con- 
tributes relatively little to the chi-square, it 
was excluded from Figure 1A. 

It is evident from the data that the basic 
M-H theory result has been replicated. Of 
the high incidents, 71% consisted of motivator 
sequences and 29% consisted of hygiene se- 
quences while the low incidents consisted of 


3 Five Ss related only low incidents and no high 


incidents. 
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Low Incidents 


High Incidents 


10; 
21% Motivator 7 % A 
Hygiene 29% 
19% 
4 
a 
| Motivator i 71 io B 
61% L | 
39% Hygiene 29%, 
é 
28% | Motivator g o% F 
i or 
7 Hygiene 33% 
72 
4 To 


Fic. 1. Percentages of motivator and hygiene se 


79% hygiene sequences and 21% Motivator 
sequences. The chi-square is 33.51, significant 
well beyond the .001 level. 


Hypothesis 2 


The Ss rated the Corresponding sets of 
statements for 72 high and 75 low incidents.* 
Each statement was rated on a scale from 1 
to 5 of increasing importance of that state- 
ment in producing the high or low feeling. 
Although the ratings of 2 and 3 can be con- 
sidered as indicating some importance, only 
the ratings of 4 (very important) and 5 (ex- 
tremely important) were used in the forth- 
coming analysis. Tf irrelevant rating does 
occur and has a significant effect on the data, 
it would manifest itself in these extreme rat- 
ings. 

The items rated 4 and 5 by Ss were used to 


categorize the incidents into the following 
types of sequences: moti 


Similarly, if the items 
both motivator and h 


4Four high and 
their corresponding statements rated 
5 Three high and five low inciden 
uncodable category because they ha 
rated 4 or 5, 


six low incidents did not have 


by Ss, 
ts made up the 
d no Statements 


quences in high and low 
(B) Ss’ 4 and 5 ratings, and (C) judges relev 


feelings from (A) content analysis, 
‘ancy ratings, 


sequence was categorized as 
The results are shown in 


cate that an M-H th 
There is no si 


a mixed sequence. 
Table 2 and indi- 
eory result does not occur. 

gnificant difference in the num- 
ber of motiv; 


ator and hygiene sequences be- 
tween high and low incidents, 
Since Ss’ ra 


Presentation in Herzberg (1966). Again, a” 
M-H theory result is not obtained, 


TABLE 2 
Motivator, HYGIENE, 


AND MIXED SEQUENCES 
IN HIGH AnD Low FEELINGS FRogt Ss) 
4 ann 5 RATINGS 
SSS 
Sequence High Low Total 
feelings feelings 
Motivator 23 21 44 
ygiene 5 8 13 
Mixed 41 41 82 
Total 69 70 139 
an 


= _ a eee 


i 


: 
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Table 3 presents the frequencies of the 4 
and 5 ratings in a chi-square table which 
yields a chi-square of 12.77, significant be- 
oon the .001 level. This data, as well as Ta- 
a 2 and Figure 1B, indicate that motivator 
actors obtained through ratings are more im- 
portant in producing both high and low feel- 
e Also, both motivator and hygiene factors 
hae! to be important in contributing to 
a eatin and dissatisfaction as claimed by 

e “traditional” theory of job satisfaction. 


Hypothesis 3 


Eck high and 81 low sequences were read 
ieee judges and their corresponding sets of 
a ee were rated on relevancy. A cri- 
ase of 4/6 or better agreement of judges’ 

gs was set in order to determine whether 


TABLE 3 


FREQUEN 
SQUENCY OF MOTIVATOR AND HYGIENE 
IN Hien AND Low FEELINGS FROM Ss’ 
4 AND 5 RATINGS 


Factor High Low Total 
feelings feelings 
Motivator 415 246 661 
Tygiene 167 160 327 
Total 582 406 988 


N, - 
Note—y? = 12.77, p < .001. 


oa statement was relevant to the 
then ent. Both high and low incidents were 
ment: coded according to the relevant state- 
se s in the following categories: motivator 
os hygiene sequence, mixed sequence, 
rel uncodable.® That is, if all the items rated 
th evant by the judges were motivator items, 
Sat the sequence was categorized as a moti- 
telev sequence. Similarly, if the items rated 
RA ant included both motivator and hygiene 
isd then the sequence was categorized asa 
dicat ag iam The preliminary analysis ifi- 
ence; ed that there were no significant —, 
rel s in the number or kind of factors rated 
s me by the expert and the naive judges. 


d of three high 
r no statements 
there was no 
levant. 


6 
Sad ge ee category consiste 
were anh incidents where eithe 
agree ed as relevant by the judges °F 
ment on any statement as being re 


TABLE 4 


MOTIVATOR, HYGIENE, AND Mixep SEQUENCES IN 


Total 


cy RATINGS 
Sequence High Low 
ie ğ | feelings feelings Tota 
Motivator | 41 | 16 57 
Hygiene | 20 | 42 62 
Mixed | 12 15 27 
| B | 73 146 


Note.—x? = 19.10, p < .001. 


The results are presented in Table 4 and 
Figure 1C. High incidents consist of 67% 
motivator sequences and 339 hygiene se- 
quences, while low incidents consist of 72% 
hygiene sequences and 28% motivator se- 
quences. The obtained chi-square of 19.10 is 
significant beyond the 001 level. Thus, when 
only the relevant statements are analyzed, 
the results are very similar to those obtained 
by Herzberg (1966) and the replications. This 
is clearly illustrated by comparing the results 
of Hypothesis 1 with results of Hypothesis Sh 

The mean number of statements rated 4 
and 5 by Ss and the mean number of state- 
d relevant by the judges are pre- 
e 5. The Ss chose a ratio of 
approximately three statements to one chosen 
by the judges. This discrepancy is due to Ss 
giving more of the high ratings to motivator 
factors as indicated in Table 3. 

le of the irrelevancy of the 


A good examp 
ratings given by Ss can be illustrated by the 


following case: A coed rel 


ments rate 
sented in Tabl 


TABLE 5 


COMPARISON OF THE MEAN NUMBER OF STATEMENTS 
RATED 4 AND 5 py Ss TO THE MEAN NUMBER OF 
JupcEep RELEVANT BY THE JUDGES 


STATEMENTS 
= = = - 
Incident Group | x 
ee | 
High Ss | 8.08 
Judges 2.74 
Low Ss 5.41 
Judges 1.79 
Total Ss 6.77 
Judges 2.3 


ating a low incident i 
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describes a time when she started dating other 
male students and came to the realization that 
she perhaps didn’t love her steady boyfriend. 
Their relationship became tense and distaste- 
ful. Soon she couldn’t stand to be with him 
anymore and the relationship was broken. She 
then rates the following statements 5 (ex- 
tremely important) : 


I did not gain a feeling of worthwhile accomp- 
lishment from my work. 

The various facets of my work were not chal- 
lenging. : 

Lid that I had not learned anything. 

I did not enjoy the kind of work I did. 


The following statements are rated 4 (very 
important): 


I was told that I had done a poor job. 
I was dissatisfied because my work gave me 
very little responsibility. 


The judges, on the other hand, rated the fol- 
lowing two statements as relevant: 


I got along very poorly with a fellow 
A close bond between myself and someon 
age group was broken. 


student. 
ein my 


The S rated both of the abov 
3 (rather important). 
It could be argued that the items rated 


are, indeed, relevant to the respondent. For 
example, within the cont 


coed certainly did not en 
she did or perhaps she felt that she had done 
a poor job of breakin 
The respondent reint 
make them seeming] 
while the researcher cate 
to work-related behav} 
stick to the manifes 
this instance, an i 
lead to a work itsel 


A concise summa 


€ two statements 


ry of the findings follows: 


1. The traditional M-H theory analysis has 
resulted in a replication with this student 
population. 


at ings by Ss with- 
out considering relevancy did not follow the 


M-H theory predictions, 
3. The analysis of the stat 


ed in a replicati 
theory predictions, 


ements judged 
on of the M-H 


Discussion AND CONCLUSIONS 


It is assumed that rating scale procedures 
will lead Ss to utilize the lower end of rating 
scales to avoid the unreasonable position of 
having to rate the importance of items which 
are in fact irrelevant to the experiences they 
are describing. However, the evident fact AS 
that the presence of the low rating scale points 
does not prevent Ss from rating as important, 
items that are actually irrelevant to their 
experiences. It is difficult to even conceive of 
any experience to which 5-10 factors could be 
relevant. The Ss have to misinterpret or re- 
interpret the statements in order to produce 
the exaggerated rating of factors. 

When faced with an item, Ss seem to have 
a great deal of difficulty in focusing back to 
the incident and determining what is impor- 
tant in the incident as Opposed to what is im- 
portant to them generally. The force of social 
desirability or the pull of their value systems 
is stronger than the directions. Hence, quite 
automatically, Ss rate the general importance 
of the item to them rather than the impor- 
tance of the item to the incident they have 


described. 

Tt is not surprising, therefore, that Mali- 
nowsky and Barry (1965), Friedlandet 
(1965), a 


and others find the intervening varia 
ble of job level affecting the importance 9 
extrinsic and intrinsic factors. What they are 


that both motivato" 
equally important fo" 
» More important, thie 
eelings that result ™ 


fest 
bscured by manifes 
d social desirability of ite” 


or job attitudes in gen . E 
eral, the pr 
lems of methodol ie oo 


» Overrating þ S: 1 misty 
i o s, and 
terpretation or reinterpre T 


erpretation of items bY 


ie 
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can be corrected, the use of these techniques 
in job attitude research will remain a stum- 
bling block to understanding job attitudes. 
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NEW APPROACH TO THE PREDICTION OF 
MEDICAL SPECIALIZATION: 


STUDENT-BASED STRONG VOCATIONAL INTEREST BLANK SCALES 


GARY T. ATHELSTAN t anp GERALD J. PAUL 


E ion i th Occupations, American Rehabilitation Foundation, 
search and Education in the Heal 2 A 
Center for Rese Minneapol 


In an attempt to devise means of predicting eventual specialization for medical 


students, longitudinal data from 1,583 Ss were studied. Specialty members 
was determined in a follow-up 11 yr. after the Strong 
Blank (SVIB) was administered. SVIB scales for surgeons, 

pediatricians, and psychiatrists were developed, using the items wh 
tiated the specialist groups as freshmen. Senior-based sca 
but predicted no better than those using freshman res; 
student-based scales were reliable on retest, but on mea: 
overlap, they did not perform as well as standard SVI 
However, their predictive validity was superior to that o 


SVIB scales for medical specialists. 


Some 20 yr. ago, under a contract with the 
United States Army and the Surgeon Gen- 
eral’s office, Strong and Tucker (1952) devel- 
oped the Strong Vocational Interest Blank 
(SVIB) scales for surgeons, internists, pa- 
thologists, and psychiatrists. Their criterion 
groups consisted of 1,500 established medical 
specialists who were diplomates in their re- 
spective specialties, while 500 physicians-in- 
general comprised the reference group. The 
ensuing scales correctly identified, on the 
average, 81% of the specialists; that is, an 
individual generally earned his highest score 
on the scale for his specialty. The average 
scale intercorrelation was —.17, and the 
scales held up well on cross-validation. In 
general, the scales appeared very promising 
for their intended uses in allocating drafted 
physicians to different specialty residencies in 


the armed forces, and in Career-choice counsel- 
ing of medical students, 


As part of the study, the SVIBs of 783 
medical students were gathered for the pur- 
pose of later follow-up to determine how well 
the scales predict eventual specialty choice. 
When these students were followed-up in 1960 
(Tucker & Strong, 1962), it was found that 
their medical specialist Scores did not cor- 


1 Requests for reprints should b 
Athelstan, Director of Research Center for R 
and Education in the Health Occupations, bute 
Rehabilitation Foundation, 1800 Chicago Ay, A 
Minneapolis, Minnesota 55404, è enue, 


e sent to Gary T, 
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ship 
Vocational Interest 
obstetricians, 
hich differen- 
les were also tested, 
ponses. The resulting 
sures of distributional 
B occupational scales, 
f previously developed 


respond with their type of practice. The scales 
did not discriminate the specialist groups, and 
the majority of the 1950 students earned their 
highest score on the psychiatrist scale. Re- 
cently, Campbell (1966) completed a reanaly- 


sis of all the original data. No significant er- 
rors were found, 


In attempting to 
the scales to predict 
Strong (1962) sug 


account for the failure of 
specialization, Tucker and 

gested that changes in the 
criterion Stoups themselves could be a majo! 
factor. In 1950, when the original criterion 
groups were studied, only about 15% of phy- 
siclans were certified specialists. By 1960; 
When they followed-up their 1950 students: 
the rate of claimed Specialization had risen tO 
about 75%, and 47% of all physicians were 
actually certified in a specialty. Strong an¢ 
Tucker hypothesized that medical students 
began Specializing for different reasons tha” 
those prevaili 


rns of interest assoc” 
may have 
tantially, : 

Another explanation hinted at by Tuck! 
“ee Strong (1962 » and reinforced by Camp” 


bell ( 1966), sug jë” 
s : gests t le SI 
cialty interests 1 hat measurab the 


physician has be 
a time; that is, 
membership in 
stantially true, 


may not appear until # 
en in specialized practice by 
Interests may be modified 4 
à Specialty, Tf this were a 
of Course, it would never ” 
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possible to predict specialization with suffi- 
cient accuracy to make the effort practically 
worthwhile. y 
Although it may be that the fully developed 
os of a mature specialist do not take 

eir final form until after several years of 
me the characteristics that eventually 
ie physician to specialization may already 

Present and measurable in a student, but in 
F different, or possibly immature form. In- 
na the manifest differences between such 

verse medical specialists as, for example, 
fae and psychiatrists, seem great, not 
as well a but in personality variables 
rE uch differences should certainly be 
iee nt in some measurable degree in stu- 

who are future psychiatrists and sur- 
on Furthermore, the differences among 
ape if measurable, may be of a more 
asic nature, and therefore less subject to the 
rapid changes which seem to occur in the 
adult specialist groups. 

The problems to which Strong and Tucker’s 
ieee work was addressed are even more 
Rina on today. Well over 90% of new physi- 
may toes plan to specialize, and the student 
aa ge to embark on a somewhat differen- 
year j training program as early as the third 
nd in medical school. W hile the pressures to 
Ee early specialty choices are increasing, 
ae is no corresponding increase in oppor- 
the pe students to gain early exposure to 
dua ee This has created a 
«l TON, which is very stressful for students. 
Rc regarding professional — future” 
Sekar as a factor interfering with 
sam a school progress by 42% of a large 
Hutchi of medical students _ (Johnson 2 
ne hins, 1966), the most important o 

arly two dozen such factors which were 

studied, 
o A Tthiotigh there has been little sp 1i, 
EA iy satisfaction among medical peee 
that ES is anecdotal and indirect evi ence 
also lee sich with specialty apies man 
numb e an important problem. Significan 
Ath sae of specialists in some fi 
Fi elstan, 1968; Miller, Coker, Greenberg, & 
ten Onnell, 1966) either report deep dissatis- 
action or demonstrate it by changing spe- 
Clalties, y 


The present investigation W 


specific study 


as undertaken 


in the belief that the problems of predictin; 
and/or assisting the specialty choices of phy 
sicians are important enough to warrant ye 
another attempt to apply the SVIB to tha 
end. Since the earlier work in this area seemec 
to have ruled out the usefulness of the tradi 
tional SVIB approach to the measurement 0 
medical specialists’ interests, this study em 
ployed a different method: The specialis 
scales were based on the SVIB items whicl 
differentiated students who later became spe 
cialists; that is, the responses of the student 
were used, rather than those of mature spe 
cialists. 
METHOD 

Subjects 

The Ss were 2,401 freshmen comprising nearly th 
entire entering classes of a representative, stratifie 
sample of 24 medical schools (Hutchins, 1964). The 
were tested with the SVIB and other instruments i 
1956 and again in 1960 as part of the longitudin: 
study of medical students conducted by the Associ 
ation of American Medical Colleges (AAMC) 
Subsequent specialization was determined by tł 
AAMC as of 1967, in a review of American Medic: 
Association records. 

In order to compare directly the effectiveness 
freshman-based and senior-based scales, the analyst 
were restricted to 1,583 Ss who had both 195 
(freshman) and 1960 (senior) SVIBs, and who lat 
confined their specialization, if any, to one feli 
The original sample of freshmen was reduced b 
617, a figure which included those Ss who left scho 
before graduating; who were not seniors in 1960; ¢ 
who, for various other reasons, were not availab 
for the retest. An additional 205 Ss were not ir 
cluded in the analyses because they reported two € 
more areas of specialization upon follow-up. Th 
rate of specialization for the final sample was abou 


80%. 


Procedure 

The 1,583 Ss were grouped according to specialt 
reported in 1967. The SVIB item response compar 
sons were made for each specialty group, with a 
other Ss serving in turn as the reference group. As 
standard procedure in the construction of SVI 
scales, items were selected for inclusion in the scal 
on the basis of how well they differentiated tl 
responses of the specialist or criterion groups fro 


2 The authors gratefully acknowledge the assistan 
of Edwin B. Hutchins, now professor in the Resear 
Office of the Vice President for Student Affairs 
Iowa State University, and David P. Campbell, I 
rector of the Center for Interest Measurement R 
search at the University of Minnesota, in obtaini 


and working with these data. 
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TABLE 1 


: 
NuMBER OF [rems SHOWING RESPONSE DIFFERENCES OF TEN I 


ERCENT OR MORE IN Irem 


RESPONSE COMPARISONS BETWEEN THE SPECIALTY CRITERION Groups 
zai AND ALL OTHER Ss 


| 
> 4] Obstetrics- | ee. 
General | Internal General | Gyne- (Psychiatry | Pediatrics 
Group practice | medicine | surgery | loa i 
| 1j 
: 7 + | 5 | a | 4 g 
Freshman criterion vs. reference group 5 | i 7 | a P | vd | 2 
Senior criterion vs. reference group | | | | 3 


Note——Maximum possible = 288, 


es of all others. For example, if 70% of 
aes ol responded “Tike” to the school- 
subject item physiology, while only 50% of other 
future physicians responded similarly, that item 
would be said to yield an item response difference of 
20% between surgeons and physicians-in-general ; it 
would therefore be an appropriate item to include in 
the surgeons’ scale. To be included in a scale, an 
item must always differentiate at a statistically sig- 
nificant level, at least. Usually, a minimum of 10% 
item response difference is required, even if sta- 
tistical significance is reached below 10%. It is con- 
sidered desirable to have all of the items in a scale 
produce response differences of 15% or higher, but 
items of lower power are acceptable, if they are 


needed to provide a sufficient number of items for 
an adequate scale, 


In order to dev 
with the 1966 revi 
Were confined to t 
and present forms, 


TABLE 2 
Eases CHARACTERISTICS OF THE STUDENT- 
ASED SCALES FOR Four MenicaL SPECTALTIES 
i SN 
Specialty 
Student-based scales = 
Ob- 
General Stetrics | Psych; 
general, S| Psychi-| Pegi. 
Surgery | Gyne. 3 S 
cology | #*Y | atrics 
eee 
shman B 
No. of items per scale 24 45 
Minimum item response i is a4 
„difference (%) 10 11 11 
enior 10 
No. of items per scale 33 5 
Minimum item response i iii ae 
difference (%) 10 12 14 10 


scored on the freshma 


n anior 
n specialty scales, the sen 
specialty scales, 


and a graduation scale, which wee 
based on items differentiating between all freshmen 
and all seniors. Group means, scale intercorrelations: 
test-retest correlations, distributional overlap ae 
Predictive hit rates Were calculated for each pect 
Since we were especially concerned with scale vali : 
ity, particular emphasis was placed on the hit ie 
that is, the Proportion of Ss whose highest score W3 


on the scale corresponding to their specialty. 


ReEsutts 


There were six specialty areas with sai 
cient representation to serve as possible E 
terion groups, These were: general pa 
(n = 277), Seneral surgery (n = 166), oie 
nal medicine (n= 257), obstetrics-gynecolo® 
(n= 86), Pediatrics (7 = 87), and psychia yi- 
("= 113). As shown in Table 1, the compat.” 
sons of the item responses of general pen 
tioners and of internists with those of 6 
others did not yield a sufficient number, 


: ee n 
differentiating items to consider develop” 
scales for thos 


d 

an! 

tricians (OB), the Psychiatrists (PY), nile 

the pediatricians (PD) from all others, veri 
not great seemed 


oth levels, 50 
_ The scales were designed to have Oa 
Items, with the items having the maxi! at 
level of item response difference possi let) 
that scale length. Table 2 shows that all 5% 
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but GS met the desired criteria of length and 
level of item response difference. Though it 
contained only 24 items with response dif- 
ferences of 10% or greater, and thus did not 
meet the length criterion, the freshman GS 
scale performed satisfactorily in all subse- 
quent analyses. 

pE hese initial scales were evaluated by de- 
termining what percentage of each criterion 
group scored higher on its own scale than on 
the other three scales. Both the freshman and 
the senior scales performed adequately on this 
measure, with 56% of the specialists scoring 
highest on their own scale. Since the overall 
performance of the two sets of scales was very 
similar in this and subsequent analyses, with 
the freshman scales usually slightly better 
despite their shorter length, further study of 
the senior-based scales is not included in this 
report. 

_Although these proportions of Ss scoring 
highest on their own scales are much lower 
than those obtained by Strong and Tucker 
(1952), it must be pointed out that their 
rate of 81% was based on an analysis of only 
two scales at a time. A similar evaluation of 
the student-based scales resulted in these 
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scales correctly identifying 75% of the cri- 
terion groups. 

The standard adult PY scale was scored for 
comparison along with the student-based PY 
scale; the placement rate of the student-based 
scale was somewhat superior. While 60% of 
the freshman PY group earned their highest 
score on their own scale, when the adult PY 
scale was substituted, the figure declined to 
46%. The two PY scales had only 11 items in 
common, comprising slightly over 20% of the 
total items in each scale, but they correlated 
58 with each other in the entire sample of 
freshmen. 

The graduation scale was devised in the 
hope that its relationship to other scales or to 
sample characteristics would reveal something 
of the nature of interest development or of 
interest stability among the students. This it 
did not do. However, the content of the scale, 
that is, the items to which the seniors re- 
sponded differently from the freshmen, is 
worthy of mention. The scale contained 31 
items with response differences above 10%; 
on 28 of these items, the difference was in the 
direction of decreased liking on the part of the 


seniors. Nearly all of the items which lost 


TABLE 3 


MEAN SCORES OF 


Four SPECIALTY GROUPS 
on FRESHMAN-BASED SCALES 


AS FRESHMEN AND AS SENIORS 


Freshman-based specialty scales 
Average 
Specialty S g other 
ASE General Obstetrics- Pediatrics Psychiatry specialty 
surgery Gynecology v 3 
Freshmen 
> 5 34 41 38 
Oaer] surgeons 50 a 3 fe Bs 
4 stetricians 43 3 a 3 ra 
Pediatricians 40 38 a 
P: SA 8 35 3 
sychiatrists 3 38 i H Hi 
erreia medicine T S a “A 
eneral practitioners O 55 A z 
gee physicians 41 38 
Sen 
Gene 48 4 30 40 37 
eneral surgeons 8 "6 sf a 1 
Obstetricians 42 E A D st 
Pediatricians 39 EH = 2 39 
Psychiatrists 37 7 = 3 35 
Internal medicine 39 et Aa n > 
General practitioners 44 37 z 3 = 
Other physicians 41 
4-yr. test-retest correlations .67 72 65 Ti = 
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TABLE 4 


RELATIONSHIP A ¥ v PE ZATION 
i ONSHIP OF PREDICTED SPECIALIZATION TO ACTUAL SE ECIALIZATI 
zi K: 


Predicted specialization 


i >| iatrics Psychiatry 
surgi Obstetrics-Gynecology Pediatrics 
Actual General surgery m 
specialization! __ = 5 s T w 
Çi Not 7 Not | i ent Tote 
Enter | Nt | Total | Enter eel Total | Enter enter | Total | Enter eter í 
| enter | | i 7 
| 7 | 2 5| 87 | 68 45 | 113 
166 49 37 86 42 8 5 
aeea i ed ah a ei 3] Si ei a 3 8 
o 
3 5 92 | 1342 
289 289 1080 | 1369 229 1139 1368 350 9 s 
me | e eee %4) 16 | 7| 94| “24 | “og | 92 
& + | ee 
55 455 | 27 55 037 | 1455 
7 64 1455 338 1117 1455 271 1184 1455 418 1 3 
e 2 | o 7) 100) 19 | si! 100] 99 | 41 | 100 
2 a ee - 
Note.—Total N = 1,455; specialty was not determined for 128 Ss, 


favor among the seniors had medical or related 
content, for example, the occupations of chem- 
ist; dentist; pharmacist; surgeon; and the 
school subject, zoology. The greatest decline 
in like responses was for lab technician, which 
lost 33% of the Ss who liked it as freshmen, 
The only activity which gained proponents 
among the seniors was bridge-playing: the 
other two items which changed reflected the 
increased tendency of the seniors to describe 


themselves as having mechanical ingenuity 
and many novel ideas. 


© same mea- 
sure for all of the regular SVIB vocational 
scales in this group was .67, 
Nearly all of th 
tively; the only 


and OB at .12 a 


e scales intercorr 
exceptions ar 
nd between 


elate nega- 
e between ES 
PD and PY at 


-ll. The mean intercorrelation of all nE 
scales is —.15 ; this may be compared to th 
—.17 reported by Strong and Tucker (1952). 

€ mean overlap between the distribu- 
tions of the criterion groups and their repe 
tive reference groups was 60%, as determi é 
by Tilton’s (1937) formula. The poorest or, 
Was GS at 67%, while the PD scale was oa 
best with a percentage overlap of 51%- fo 
the retest, the average overlap increased es 
71%. In comparison, the average distribu 
tional overlap of the regular SVIB scales K 
33%; the corresponding figure for Strong a” 


s 
Tucker’s (1952) adult specialist scales W% 
49% 


S was predicted p- 
which he earned his h 


a 
Predicted specialty» t 


Table 4 il 


edict 
> 4 illustrates how the scales pre 
for the indivi 


s| 
dual Specialties, One hundre 


A 
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(7%) were predicted to enter GS and actually 
did so. These Ss constituted 20% of the 491 
Ss who were predicted to enter that field and 
59% of the 166 who actually entered. The 
true positive hit rate for each scale can thus 
be obtained by relating the number of cor- 
rectly predicted entrants into each specialty 
to the total number of actual entrants. This 
rate for the OB scale, for example, is 49 out 
of 86, or 57%; the corresponding figure for 
PD is 48% (42/87), and for PY it is 60% 
(68/113). 

_All scales showed a high false positive rate. 
This is not especially serious for two reasons. 
First, all the cell entries are affected in some 
degree by the number of scales available; in 
this analysis the scale on which an S scores 
highest is the basis for predicting that he will 
enter the specialty. Second, for counseling 
Purposes it does no harm to consider several 
Possible specialties; it would be less desirable 
to Incorrectly rule out a specialty which could 

e an appropriate choice. The false negative 
tate is also more serious from a technical 
Point of view; while increasing the number of 
Scales in this analysis would probably lower 
the false positive rate, it could only increase 
the false negative rate. 
oe this analysis does not take into ac- 
ae the absolute level of any of the scores, 
s e figures may overestimate the amount of 

rror which could be anticipated in the coun- 
need use of the scales. Nevertheless, even 
ae in this oversimplified prediction 
oe igm, the performance of the scales ap- 
on to provide a significant improvement 

e accuracy of chance predictions. 

Summary AND DIscussION 

_ In an attempt to determine whether the 
Interests associated with specialization in 
Medicine can be identified early enough to 
assist the career choices of medical students, 
a new method of constructing SVIB medical 
Specialist scales was tested. Scales for GS, OB, 
et and PY were developed, using VIB 
ems which differentiated medical students 
Srouped on the basis of membership in those 
Specialties 11 yr. after completing the SVIB. 
nternists and general practitioners We? also 
studied, but their SVIB item responses did not 
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differ sufficiently from those of other physi- 
cians to permit the development of scales. 

Although the performance of the resulting 
scales was considerably poorer than that of 
the standard SVIB occupational scales, they 
demonstrated the possibility of predicting 
specialization from scales based on the re- 
sponses of medical students. These student- 
based scales may have greater predictive 
validity than the medical specialist scales de- 
veloped by standard SVIB scale construction 
techniques. On such measures as scale inter- 
correlations, distributional overlap, and pre- 
dictive hit rates, the student-based scales per- 
formed about as well as the adult specialist 
scales previously developed by Strong and 
Tucker (1952). The 4-yr. test-retest reliabil- 
ity of the student-based scales was nearly 
identical to that of standard SVIB scales 
scored for the same sample. 

The use of the student-based scales in 
counseling medical students is not clearly in- 
dicated at this time, although they certainly 
permit greater accuracy of prediction than 
that obtainable by chance, which seems to be 
the primary alternative means of prediction. 
Further research on these scales is needed for 
the following reasons: In the first place, in 
order to be of much practical value the cov- 
erage of specialties should be increased. It 
should prove possible, for example, to develop 

h major fields as anes- 


similar scales for suc 
thesiology, ophthalmology, pathology, and 


radiology- Secondly, the validity of these 
scales should be further tested. We hope to do 
this in a very direct way in the near future, 
by evaluating the predictive validity of the 
scales for other samples of physicians on which 
longitudinal data can be obtained. 

This investigation has demonstrated the 
feasibility of predicting specialization from 
medical school freshman SVIB_ responses. 
However, it is not likely that any amount of 
refining of techniques would result in signifi- 
cant improvement on the level of accuracy 
obtained here. The primary reasons for this 
are the wide range of job functions found in 
and the fact that interests 
probably covary more with the functional 
dimensions in medicine than with the formal 
specialty divisions. The interests of physicians 
gaged in such activities as research and 


each specialty, 


en; 
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teaching, administration, or straight na 
practice probably resemble the interests ; 
their colleagues in those functional areas, ir- 
respective of specialty membership, e2 
closely than they resemble the ae E 
their respective cospecialists. Severa ba 
(e.g., Schumacher, 1964) support this 
pothesis, and the future research program o 
the Center for Research and Education in the 
Health Occupations will deal specifically with 
that question in a follow-up of the Ss studied 
here. 
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, EFFECT OF PRICE INFORMATION ON PRODUCT 
QUALITY RATINGS 


E. R. VALENZI anp I. R. ANDREWS + 


Bowling Green State University 


discriminable in a blind-taste test condition and varying 
quality by 46 Ss. Each of the three products 
product, twice as a medium-price product, 
Confirming two hypotheses, taste quality 
product 


Three food products 
widely in price were rated on taste 
was identified twice as a low-price 
j and twice as a high-price product. 
ratings were positively related to price information given and to actual 
differences. Contrary to the third hypothesis, actual product differences ac- 
counted for approximately three times as much variance as price information 
between the laboratory setting and actual consumer 


sa. 


given. Several differences 
behavior may have led 
consumer quality judgments. 


Several studies have evaluated the influence 
judg- 


of information on consumers’ quality 
ments. Allison and Uhl (1964) showed that 
different brands of beer undistinguishable in 
the blind taste-test condition were rated sig- 
nificantly different when brand names were 
Provided. Makens (1965) presented Ss with 
samples of turkey cut from the same bird but 
identified as a known brand name and an 
unknown brand name. The sample labeled 
with the known brand name was rated su- 
perior on a hedonic scale. Petit (1958) showed 
that information could affect consumer pref- 
erence ratings if it was meaningful to the 
rater, Raffensperger and Pilgrim (1956) found 
that information affected consumers’ discrimi- 
nations even when it contradicted perceptual 
experience. These studies indicate that, be- 
Sides perceptual experience with the product 
Itself, product information has a significant 
influence on consumer quality judgments. 
Price is frequently suggested as an impor- 
tant variable affecting perception of product 
quality. Leavitt (1954) and Tull, Boring, and 
Gonsior (1964) used price as an independent 
variable with relatively homogeneous products 
ang found a positive price-quality relation- 
ship. Their experimental task was a paper- 
and-pencil one and did not allow Ss. to use 
the products evaluated. McConnell (1968) 
Presented the same brand of beer in bottles 


i Retest t 
Requests for reprints should be sent to 2 RD 
i ndrews who is now at the Department of A 
cs and Commerce, Simon Fraser University, Burnaby 


2 sas 
2, British Columbia, Canada. 
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to an underestimation of the effects of price cues on 


labeled with one of three relatively meaning- 
less symbols (L, M, P) to represent brand 
names, Each brand was labeled with a dif- 
ferent price and consumers chose a brand, 
drank the beer, and then rated it on several 
characteristics. Ratings were positively re- 
lated to the price information provided, dem- 
onstrating that price was a cue influencing 


the perception of quality. 
All the studies cited above used products 


that were homogeneous in at least two ways: 
(a) the products were identical or highly 
similar so that they could not be discriminated 
in a blind condition, and (b) the products did 
not vary widely in purchase price. The re- 
sults of these studies are therefore limited in 
generality to nondiscriminable products that 
are used for the same purpose and are ap- 
proximately equivalent in price. 

White (1966) reported an experiment that 
used products discriminable in a blind condi- 
tion. Two brands of adding machines were 
evaluated by users under three conditions: 
blind, correct brand labeling, and with brand 
labels reversed. In the blind condition the 
unknown brand was rated somewhat superior, 
in the correct labeling condition the unknown 
brand was rated as somewhat inferior, and 
when the labels were reversed the machine 
with the known brand label was rated far 
superior. These results indicate that even when 
there is a difference between products, quality 
ratings can be reversed by product informa- 


tion. 
The present experiment extended previous 
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research by testing three products used for 
the same purpose that varied widely in price 
and were discriminable in the blind-test con- 
dition. In addition, an experimental design 
Was employed that compared the effects of 
price information against the effects of actual 
Product differences (i.e., differences in qual- 
ity ratings measured in a blind-test condition). 

It was hypothesized that quality ratings 
would be positively related both to price in- 
formation given and to actual product differ- 
ences, but that price would have the larger 
effect. It was hypothesized that price would 
have a larger effect because actual product 
differences identified in a previous experiment 
(Andrews & Valenzi, 1970) were relatively 
small while the price information given in- 
cluded substantially different prices, 


METHOD 
Subjects 


Seventy-seven female, introductory psychology 
volunteers were divided into two groups, an Infor- 
mation group (n=46) that received correct and 
incorrect price information, and a Blind group that 
received no price information (n=31 
the range in Purchase pri 
The No Information gri 
verify that products weri 
test condition, 


For the Information group, one of the two inde- 


i product differences, 


utley margarine at 20¢ 
EverSweet butter and 


brands, while Imperia 


a pound. 
are local 
al brand. 
these products 


firmed with a single sti 
For the 95¢, 45¢, and 20¢ a pou 
tively, the blind-rated quality m; 
scale) were 5.82, 5.39, and 4.66 
nificant, F = 10.7 


Newman-Keuls 
showed that 
and the 45¢ 


procedure (Winer, 1962, 


s A g paired 
comparison method of stimulus presentation, this 


at the .05 level, 


The second independent variable was price infor- 
i ‘en. The Ss tasted each of the three prod- 
x times, twice identified as a high-priced 
spread, twice identified as a medium-priced spread, 
and twice identified as a low-priced spread. For both 
groups, order of stimulus Presentation was deter- 
mined randomly for each S. 


Procedure 


The products Were presented to Ss on 1-in. sun 
of plain white bread, spread somewhat more thickly 


than In addition, each 
Square of bread was placed spread-side down on a 


without looking at the spread. This procedure mini- 
mized visual cues, especially color (the Nutley mar- 
garine was noticeably lighter), and facilitated tast- 
ing. 

Each S received an aluminum tray with 18 squares 
of bread in three rows. For the Information group: 
the price per pound of the spread was shown on a 
strip of tape placed immediately above the bread. 
The Ss were told that they would be tasting 18 
different spreads Purchased from six different stores 
in the area. For this reason 18 
used, 6 high (95¢, 96¢, 98¢, 
medium (36¢, 38¢ 
(13¢, 14¢, 


As part of the introduction and instructions to the 
S Were told that this was the 
-Part experiment, that the first 


e deception, the instructions 


Paa a To explain the pres- 
the price information immediately above the 
of the housewives i? 
or brand 
that this 


tinued cooperation and high Motivation. To keeP 
comparable for thi 
information had to e provi 


- ntal questionnaire dat? 
bearing on the success of the i: 4 ARAA 


| 


4 
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The i 7 
ae interv al between tastes was set at 2 min. and 
feat Pace a stopwatch. The relatively long inter- 
ay ae was chosen to allow the natural sali- 
esses to remove the residual f i 
ior oe al food particles 
After tasti come 
ae tasting each sample, Ss rated it on a 9-point 
ae ity scale identical to Bruvold’s (1968) 
ater » equal interval scale for rating the taste of 
the ie ge that the word, water, was changed to 
ee , RE The scale ranged from 1, “this 
4 as a horrible taste” t “this 
Et aa e” to 9, “this spread has an 


RESULTS 


p eai he nine different combinations of 
ferent | Se products paired with three dif- 
twiceb evels of price information was rated 
group graen of the 46 Ss in the Information 
each of “i average of these two ratings for 
became th Aiea product/price combinations 
deta, <p e basic data for each S. These basic 
ot — analyzed in a 3x3 xX 46 analysis 
her ance design with one S per cell (Lind- 
he 1956, pp. 237-238). 
both, in Table 1, the main effects for 
Inspecti ependent variables were significant. 
showed E f of the cell means in Table 2 
version ies with only one exception (the in- 
hebreen means 4,59 and 4.67), differences 
direction adjacent cells were in the expected 
Positively As predicted, quality ratings were 
and al y related to price information given 
differene Positively related to actu 1 
inform ces despite the presence of the price 
N ation. 
ot conforming fully to the expected pat- 


1al product 


TABLE 1 
A 7 
NALYSIS OF VARIANCE SUMMARY T. 
FOR Propucr Quarry RATINGS 


ABLE 


= 
O — 
Factor ss | af |MS p |o 
Act J aaa 
ieee Product: (A) |131| _ 2 | 65.2 zz | 135 
Ss (Cy mation (B) | 45 |72 225 | 10.1% |.041 
AXB 134| 45| 2.97 
AXC 7| 4| 425 | 12.9% 020 
PG 247 | 90| 2.74 
AXBXC 193 | 90 | 2.14 
T 7 56] 180] -31 
i 923 | 
ee x 
=o enn) COE, O SS ae 
f the proportion ol 
i and is 


Vari: Mega-squa 
ariance quared ‘y a 
nce Accounted toe ie a an estima 


Rivi 
Men by ete a ade 


nt variable 


oy 


* 
P <.05. 
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TABLE 2 


Mean Propuct Quatity RATINGS FOR 
INFORMATION GROUP 


| 
Price information given 


Actual 

purchase x 
pee 05¢-$1.04 | sois | 13¢-22¢ 
95¢ 6.66 | 6.07 5.66 6.13 
45é 6.28 5.92 5.51 5.90 
20¢ 5.26 4.59 4.67 4.84 
X 6.07 5.53 5.28 


tern of results was the low-priced margarine 
vhen identified 


which received a higher rating y 
as a low-priced margarine than when it was 
identified as a medium-priced margarine. As 
explained below, this slight and nonsignificant 
reversal might have been an artifact of the 
rating scale. Also a possible artifact of the 
rating scale was the significant interaction 


between the two independent variables (see 
Table 1). 
The most surprising result was the noncon- 
firmation of the prediction concerning the 
relative effects of actual product differences 
and price information given. Contrary to 
expectation, the omega-squared values (Hays, 
1963, p. 325) jn Table 1 showed that actual 
accounted for 13% of 


product difference: 
quality rating variance while price informa- 


tion given accounted for only 4%. 

Since actual differences in product taste 

were relatively small and differences in price 

information given were relatively large, one 
to conclude that the effects 


might be tempted 
of price cues on consumer quality judgments 
are relatively minor. As discussed below, how- 


there were several characteristics of this 
periment which might have led 
timation of the influence of 


ever, 
laboratory ex 
to an underes' 
price. 
DISCUSSION 


In evaluating the results of this kind of 
study, it is important to know how Ss in- 
terpreted the purpose of the experiment. Data 
from a postexperimental questionnaire shed 
some light on that question. When asked, 
“What do you think is the purpose of this 
experiment?”, 21 of the 46 Information group 


in 
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Ss responded to the effect that they onp 
the experiment had something to do wi 

price and quality. Eight Ss guessed an met- 
rect hypothesis and 17 Ss accepted t 3 T 
pothesis implied by the instructions. Oft 3 

Ss who guessed correctly, only one said that 
she tended to rate according to price rather 
than on the basis of taste. In addition, the 
mean product quality ratings for Ss who 
guesse correctly were essentially the same as 
for those who did not. It seems unlikely, 
therefore, that deducing the correct hypothe- 
sis had a profound effect on the quality rat- 

g 

‘ evius research (Leavitt, 1954; McCon- 
nell, 1968; Tull, Boring, & Gonsior, 1964) 
had shown that product purchase price 


influ- 
ences judgments of product quality for non- 
discriminable products of approximately 


equivalent price. The present r 
tend that finding to relative 
food products which are dis 
which vary somewhat in price. 

It is clear, on the other hand, that the 
price cues given to Ss did 
scure the rather subtle di 
tastes. In fact, when 
high-, medium-, and low- 


esults may ex- 
ly inexpensive 
criminable and 


eans shown in Table 


taste differences among 
small, while the differ- 


be obtained 
products, for example, three diamonds whose 
purchase prices were identifie 
and $800. 

Second, a sample of coll 
responded without consid 
impact of the price differ 


ege girls might have 
ering the long Tange 
ences as a housewife 


would be more likely to do. In other words, i 
housewife in deciding between products priced 
at a 75¢ differential might calculate what thi 
price difference adds up to over the course n 
a year on a frequently purchased product: r 
college girl, on the other hand, might a 
the decision as a one time purchase ane 
therefore view the price difference as trivial. 

Also of possible importance is the fact oie 
the present Ss did not have to make a pur- 
chase decision and pay out their own money 
to back up that decision, Therefore, there was 
no need to distort product quality evalua- 
tions to justify a purchase decision. When i 
lady spends $40 for an ounce of l “roya 
queen bee jelly,” her perception of its cos- 
metic effect is apt to be distorted. 

Also relevant is the fact that these product 
tastes were compared to one another with 
only a short time interval between tastes. The 
typical consumer is more apt to base his prod- 
uct quality comparison on taste samples 
widely separated in time, thus reducing the 
accuracy of such comparisons, an 

Last, it is possible that the effect of price 
information given was reduced by what might 
be called a basement effect. As shown in Table 
2, there was little difference and even a non- 
significant reversal between the two lowest 
cell means—4.59 when the low-priced product 
was identified as a medium-priced product 
Fedo! When the same Product was identi- 
fied as a low-priced Product. Apparently: 


many Ss were unwilling to say that a neutral 
tasting 


; y 
ere not overestimated P3 
udy, 


Errect OF PRICE INFORMATION ON Propuct RATINGS 91 


To extend the above results, the experiment 
pula be replicated with products other than 
m ae beverages and with products which 
ae ed absolute price differences. Labora- 
Stee tees be cross-validated by field 
cient s ritical to a true understanding of the 
Sead ke price on product quality evaluations 
a e an experimental design which al- 
h | consumers to spend their own money in 
choosing among various priced products. Post- 
purchase product evaluations would have to be 
ol ea shortly after purchase and again 
Ea he S time interval. Also of interest 
ena e prades which compared price in- 
whieh ‘a against brand information to see 
wach has the stronger effect on product qual- 
ity judgments. 
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RELATIONSHIP BETWEEN JOB ATTITUDES AND 
TWO FORMS OF WITHDRAWAL FROM 
THE WORK SITUATION 


L. K. WATERS 


Ohio University 


Attitudes concerning the job in gener 


El 


Overall satisfaction and satisfaction 
situation were significantly related t 
extrinsic variables were related. For 
tudes toward specific aspects of th 
relationships when nonattitudinal v: 
satisfaction was significantly related. 


Vroom (1964) has proposed that 


workers who are highly attracted to their jobs 
should be subject to stronger forces to remain in 
them than those who are less attracted to their jobs, 
These stronger forces to remain should be reflected 
in a lower probability of behaviors which take the 
person out of his job, both permanently and tem- 
porarily [p. 187]. 


Reviews of the literature by 
Crockett (1955); Herzberg, 
son, and Capwell (1957); and Vroom (1964) 
have generally indicated some relationship 
between job-related attitudes and both termi- 
nation and absenteeism criteria. More recent 
studies by Hulin (1966) and Mikes and 
Hulin (1968) have supported the hypothe- 
sized relationship between attitudes and ter- 


Brayfield and 
Mausner, Peter- 


ported relationships 
as essentially replica- 


METHOD 


The respondents in this study wi 
visory female employees in one r 
national insurance company who 
job satisfaction questionnaire as 
study (Waters & Waters, 
ranged in age from the late 
all were at least high school 


ere 160 nonsuper- 
egional office of a 
had responded to aq 
Part of an earlier 
1969). The employees 
teens to early 605 ‘and 
graduates, Immediate 


AND 


ariables were partialed 
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DARRELL ROACH 


Nationwide Insurance 


al and specific aspects of the work situa- 
tion were correlated with termination decisions and recorded 


absences during the year following adm 


out, but overall 


supervisors of all employ 
were also females, 

A job attitude questionnaire ww: 
small groups of employees by 
during a single working d 
sured that their individua 
made known to the comp: 


ees utilized in the study 


as administered to 
One of the authors 
lay. Respondents were as- 
l responses would not be 
any. Names were requested 
but employees were given the option of not respond- 
ing to that item if they “felt uncomfortable” doing 
so. Approximately 17% did not give their names. 
The job attitude scales were presented in bocklet 
forms and consisted of separate overall satisfaction 
and dissatisfaction scales (always the first tw0 
scales, » an overall satisfaction/ 
he five scales of the Job De- 
Smith, 1967) and a list of 11 
in alphabetical order) to be 
rated on a satisfaction/dissatisfaction scale. Ratings 
n (both overall and for 
made on a 12-point gn- 
rate satisfaction and dis- 
-point scales which consisted 
ints of the 12-point satisfac- 


scale plus a seventh alternative 
issatisfied) , 


job attitude booklets w 


n s 
on ere classified ^ 
terminations, No attempt was made to distinguis! 
among these cases with respect to differences in re- 
Ported reasons i 


ting their employment 
Tequency of absences mor 
tude booklet administra 


‘t-year frequency. Abe 
ci were ı lable or could not be matchet 
to job satisfaction booklets in 17 cases. The case 


Were excluded from the analysis, The final samp? 
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Roneisted of 131 cases 
ti isted of 131 cases, 109 current employees, and 22 
erminations, 


RESULTS AND DISCUSSION 


Denes 1 gives the correlations between non- 
attitudinal variables, overall- and specific 
eae attitudes, and the two criterion 
bles nes; None of the nonattitudinal varia- 
Of ae related to the termination criterion. 
te he overall job attitude measures, those 
nudne Senna along a bipolar satis- 
action/dissatisfaction or satisfaction only di- 
aoon were more related to actual job be- 
All oi oe reported degree of dissatisfaction. 
ake the ratings of satisfaction with specific 
et ealiie of the work situation that correlated 
intrin cantly (p < .05) with termination were 
with sic variables. These data are consistent 
tio earlier studies of correlates of termina- 
(19 by female clerical workers by Hulin 
a and Mikes and Hulin (1968) for over- 
i peta measures and by Ross and 
fine (1957) for the greater potency of in- 
c variables, 
ek fm frequency of absence criterion, 
lenne ias with both job grade and company 
Pattee significant (p < 05). When these 
frequ es were partialed out of the attitude- 
overall cy of absence relationships, two of the 
dissati satisfaction measures (satisfaction/ 
the samt and satisfaction) and two of 
eager job factors (JDI work and JDI 
ated nage were significantly ( < 05) re- 
ture r frequency of absences. In the litera- 
Mann De only one study (Metzner & 
invest} 953) seemed a reasonably adequate 
tater oe of the relationships between at- 
Collar and absenteeism among female white- 
study workers, The data from the present 
and aaa with the results of the Metzner 
tondhi, a study in that they found no rela- 
item P between a number of attitudinal 
h and absenteeism. 
Poa alan between termination © 
at thos Sf absences, wag 142 Gadicamg 
Bs scorn who terminated had more absences 
Withdraw. terminators). These tw forms 0 
nent oe from the work situation (pee 
other and Temporary) were related $ on ‘ 
Sures of ; “aps were related to overal ma 
taineq job satisfaction. These results, 09- 
1 under conditions in which situational 


nation and 


TABLE 1 


CoRRELATIONS BETWEEN PREDICTOR VARIABLES 
AND TERMINATION AND FREQUENCY 
or ABSENCE CRITERIA 


Criteria 
Variables Fre- 
Termi- | quency 
nation of 
absences 
Nonattitudinal 
Marital status = 09 
Education 108) E07 
Job grade 15 —.23** 
Company tenure 10 —.21* 
Job description index 
Work .24** | —.28** 
Pay ‘07 | —.09 
Promotion 09 05 
Supervision 14 05 
Co-workers 11 —.18* 
Job factors 
Growth and advancement .24** | —.07 
Recognition for work done abl —.10 
Responsibility on the job 38%* | —.16 
Sense of achievement .24** | —.18* 
Work 22* —.20* 
Competent supervision 09 —.09 
Considerate supervision 01 04 
Company policies & practices 04 —.04 
Co-workers —.06 —.03 
Physical working conditions 01 —.03 
Salary ae —.21* 
Overall measures 
Satisfaction /dissatisfaction .24** | —.23** 
Satisfaction 23%% | — 2855 
Dissatisfaction =i -16 
ee ee 
ep < 05. 
++p = 01. 


factors appear to have minimal saliency, SUP- 
port Vroom’s (1964) proposal | that with- 
drawal from the work situation 1s related to 


job attitudes. 
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THE CULTURE ASSIMILATOR: 
AN APPROACH TO CROSS-CULTURAL TRAINING * 5 


FRED E. FIEDLER, TERENCE MITCHELL 
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The construction of self-administered program! 
described. These programs provide an apparently 


culture assimilators, is here 


effective method for assisting 
fully with members of a 


for the Arab countries, Iran, Thailand, 


describes the steps involved in the devel 
Iture assimilator programs un 


studies validating the cu 
conditions. 


Interaci: 
Heei among persons belonging to dif- 
common as H AOLA is becoming increasingly 
nomic integr ay toward political and eco- 
and technical; tOn international cooperation, 
e eaae become more frequent. 
interpersonal ; concerned with stereotyping, 
turally het attitudes, and behavior in cul- 
Interaction erogenous groups suggests that 
Substantial po cultural barriers produces 
"ledler, Mey ifficulties (e.g., Fiedler, 1966; 
or si & Oonk, 1961; Rombauts, 
recent han abe see Triandis, 1967). A 
ight, 1969) a of cross-cultural training 
*PPproaches ) surveys a large number of 
Some the: which have been proposed to over 
ithe Since most of these 
aluation, one not been sub jected to rigorous 
ness, One | ittle is known about their effective- 
and Which method which has been evaluated 
'S the cult also seems to produce useful results, 
—. assimilator. The present report is 
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nother culture. Culture assimi 
Central America, 
lopment of these programs, 
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med culture training manuals, called 


just success- 
lators have been constructed 


and Greece. This article 
and reviews 


der laboratory and field 


a description and evaluation of this training 


procedure. 
The culture assimilator® is a programmed 
learning experience designed to expose members 
of one culture to some of the basic concepts, 
attitudes, role perceptions, customs, and values 
of another culture. There is no single culture 
assimilator ; different assimilators are needed 
for each pair of cultures. Given Cultures A, B, 
and C, we could have assimilators telling A 


about B, B about A, Aabout C, C about A, etc. 
n be general or 


Furthermore, assimilators Ca 
specific; that is, they may train an individual 
to behave effectively in a very narrow set of 
social situations, such as in a hospital or as & 
community development worker, or for a broad 
of socia The content of 


] situations. 
assimilators cap also vary 2 good deal. Some 
assimilators emphasize the interpersonal atti- 


tudes that contrast the Jearner’s culture and 
the target culture; other assimilators emphasize 
the customs of the target culture; and still 
others concentrate oD the value contrasts of 
the two cultures. Research to date has not 
ufficiently to indicate which kind 
aximally useful in which type 
lts this article 


group 


progressed S 

of assimilator ism 

of social situation. The resu 

reviews suggest that a balanced assimilator 

containing an equal number of general and 
ilator training was 


ssim 
now at the State Uni- 


3 The concept of the culture 4 


originated by L. M. Stolurow, 
versity of New York at Stony Brook, who was one of 


the co-investigators of the Advanced Research Proj- 
ects Agency project from 1962 to 1967. 
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specific items, and concerned with each of the 
above areas of content does improve inter- 
personal interaction in heterocultural settings. 
More efficient designs for developing these 
programs are likely to be found in future 
research. . 
Preparation for overseas work assignments 
varies from giving a man a two-page pamphlet 
to several years of preparation. This training 
will vary depending on the organization, the 
type of work, and the available time to provide 
training. Such preparation is a key component 
in the in-country training of Peace Corps 
volunteers, while it is very limited in the case of 
most business organizations. The culture 
assimilator is designed primarily for personnel 
with limited time and who, therefore, would 
normally receive little or no training. It is 
designed to provide the trainee with extensive 
information about the culture in a 2-5-hr. 
time span. 
In order to provide such information it is 
necessary to select those areas of culture con- 
trast which have the largest impact on social 
behavior across cultural barriers, This requires 
an exceptionally good understanding of the 
two cultures. Such understanding can be 
obtained by examining existing ethnographies 
and historical records, as well ag by analyses of 


the subjective cultures of the two cultural 
groups. 


By subjective culture we mean the ch 


aracter- 
istic way in which a cultural group perceives 
and responds to its social environment. 


Triandis (Triandis & Vassiliou, 1967 


type requiring sentence com 
comparisons, rank-orderi 
the best response fro 
taken to ask a representati 
concerning a repres 
objects, The inform 
of multivariate m 
analysis and analy 
the major dimensi 
Cognition used in 


to which these dimensions influence a wide 
variety of responses. An examination of the 
relationships among such data normally reveals 
some themes or typical culture-determined 
viewpoints which can be communicated as 
representative generalizations about that cul- 
ture. The assimilator can then focus on those 
important themes and omit details about the 
other culture that cannot be easily learned in a 
short time. However, only in the most recently 
developed program, the Greek assimilator, was 
there an extensive use of these methods in the 
construction of the program, although other 
programs made indirect use of information 
gathered by these methods, 

The main problem in the development of the 
culture assimilator, as well as any culture 
training program, is: How can we meaningfully 
simulate intercultural interactions? There ane 
innumerable ways of learning about another 
culture. Most methods involving paper and 
pencil or books make 


the trainee a passive 
observer. Yet the heterocultural encounter is bY 
definition an 


active process. The participants 
must continually evaluate, interpret, and react, 
and again evaluate the effect of their reactions. 
We were, therefore, committed to an instruc- 
tional Program which made the trainee an 
active participant in the learning process. F 
Granted that no book or reading situation 
will be able to provide ideal simulation experi- 
ences, programmed instruction does have the 
advantage of forcing the individual to interpre! 
and evaluate various Situations and to assimi 


late immediate feedback on the adequacy and 
accuracy of his interpretation, 


Basically, the met 
the trainee to read 
cultural encounter, 
tion of the encount 


hod here described require? 
a short episode of an inte!” 
and to give an interpreta 
er. The program gives ap 
his answer was correct 0 
er, the trainee is asked t° 


is thus quit? 
helming majority ° 
stimulating and enjoyable 
velopment of an effecti¥ 


of these instruments, 


STEPS IN THE DEVELOPMENT OF 
CULTURE ASSIMILATORS 
Selection of Content 
is n ae of all culture assimilators 
B = _ problems in heterocultural 
the grentest aie in areas in which there are 
and i Ba ergencies in norms, customs, 
diferente betwen two cultures. Where no 
cine a. there will be little cause for 
the Selena problem is, thus, one of identifying 
Ad ia et ET concepts and behaviors, 
Ea mi the trainee with a series of 
with them Hp hich he must symbolically cope 
used to ache ce of methods have been 
behaviors T} js thes critical concepts and 
briefly ern analysis of subjective culture, 
approaches rene earlier, is one of these 
critical ae nother concerns the analysis ot 
Critical ae 
nique oe The critical incident tech- 
(1954) kd T N developed by Flanagan 
defines it as consisting 
of a set of = 
of hum 


Sai oe for collecting direct observations 
Potential Tehi vior in such a way as to facilitate their 
developin, as ulness in solving practical problems and 
Meident k aaay psychological principles. - By an 
Is Suflicienitly cont any observable human activity that 
Predictions Hind in itself to permit inferences and 
act. To he ane made about the person performing the 
Where the ical, an incident must occur in a situation 
Clear to tpe hose or intent of the act seems fairly 
Sufficient] yo dg and where its consequences are 
effects [ Y definite to leave little doubt concerning its 
p. 3277. 


Plans 
anaga z zi E aa 
technique f (1954) utilized the critical incident 
inefiectiy, og the investigation of effective and 

e job behavior. For the purposes ° 
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given sufficient knowledge about the culture. 
Finally, the incident must be relevant to the 
American’s task or mission requirements. Thus, 
a love affair may be interesting or upsetting for 
some Americans overseas, and it may present 
innumerable occasions which the American 
finds difficult to understand. However, under 
ordinary circumstances, information concern- 
ing love affairs would not be considered 
relevant for the job. 

Critical incidents are obtained by 
Americans and host nationals with whom they 
come in contact to describe some specific 
intercultural occurrences or events that made a 
major difference in their attitudes or behavior 
toward the members of the other culture. These 
may be pleasant, unpleasant, or simply non- 
understandable occurrences. One such incident 
from the most recently developed assimilator is 
presented below. 

Incident reported by an A merican teacher in 
Greece. The teacher mentioned that when she 
sreece she met a number of Greeks 
whom she knew only, what she considered to 
be, casually, She was surprised by the fact that 
these people often asked her questions which 
were of a rather personal nature. She felt that 


the Greeks were somewhat rude to ask ques- 
standard of living, and 


tions about her clothes, 

other personal things. She felt that this infor- 

mation should be told only to close friends. 
The incident is then refined into the episode 

which, along with 75-100 others, is eventually 

used in the program. Names are changed an 

irrelevant material dropped. An example of 


the final episode is below :* 


asking 


was new in ( 


Sh; 


W onean eld; a school teacher in Athens, 

ask her shai ange to be only casual acquain 

“How mu k re she was going or where she had 

reeks TE do you make a month?” or 
very rude. 


was amazed at 
tances. When she entered or left her apa 
been. If she stoppe ti 


«here did you £ 


Page X-1 


ns that were asked her by Greeks 
rtment, people would 


d to talk she was asked questions like, 
aring?” She thought the 


the questio: 
ens: 


et that dress you are we 


the ideal 
on occurt- 
t national 
American 
is likely 


deve à 
i loping 


inci e a 
ncident y culture assimilators, 


eNCa + Mus papit 

ence in whi T describe (a) a comm 
p teract; (6) han American and a hos 
nds at ae situation which the 
i ul, puzzling, or which he 


© mis; 
Sinte 
ae . erpret; ç 5 e ssi: a 
i e titles, t; and (c) a situation which can 
edina fairly unequivocal manner, 


Alternatives and feedback. Having written the 
it is then necessary to write four 
hich will be presented to 
e four alternatives 


In fact, the pre- 


episodes, 
possible responses W. 
the trainee. Each of thes 
obviously must be plausible. 


atements here sh 
a separate page. 


own in a box is 


4 Bach set of st 
normally presented on 
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ferred method is to ask the Americans inter- 
viewed to write out various explanations, and 
the most plausible are selected from this pool 
of alternatives. Only one is deemed correct, 
with the others containing various ethnocentric 
errors. ' 

Finally, four feedback explanations must be 
written. These tell the trainee not only whether 
his answer is correct or incorrect, but they also 
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provide him with further information about 
the culture. They should, in effect, give the 
trainee a rationale for interpreting the correct- 
ness or incorrectness of his reply and assist him 
in building up a frame of reference for handling 
similar situations. The feedback information 18 
taken from our interviews and subjective 
culture data. Examples of the alternatives and 
feedback explanations are below: 


Why did the Greeks ask Sharon such “personal” 
1. The casual acquaintances were acting like frie 


2. The Greeks asked Sharon the 
Church. 


3. The Greeks were unha 
her habits. 


| 
questions? ae 
nds do in Greece, although Sharon did not realize it. 


questions in order to determine whether she belonged to the Greek Orthodox 


py about the way in which she liv 


4. In Greece such questions are perfectly Proper when asked of women, but improper when asked of men. 


Page X-2 


Go to page X-3 


Go to page X-4 
ed and they were trying to get Sharon to change 


Go to page X-5 


Go to page X-6 


You selected 1: The casual acquaintances 


were acting like friends do in G 


Page X-3 


reece, although Sharon did not 


e more intimate in Greece 
would seem too personal in 


Go to page X-1 


You selected 2: The Greeks asked Shari 

to the Greek Orthodox Church. 

No. This is not why the Greeks asked § 

tion is “personal” depends upon the cult 
“personal,” Why? Try again. 


haron such questions, Remember, w 


ure. In this case, the Greeks did n 


Go to page X-1 


You selected 3: The Greeks were unhappy abou 


Page X-5 
ere trying to get 


You selected 4: In Greece such 
asked of men, 


When are these questions proper? T 
friends in Greece, Was Sharon regarded as 


hat you have learne 
a friend by these 


uations, However, sex hi 


as nothing to do with it- 
d about I 


ss pen 
Gracies oper hehayior betwee 
irecks? 


Go to page X-1 


am, 


J 


an 
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2 ne in important functions of feed- 
major ee = te learner to some of the 
withiwhich es S a a the two cultures 
leamed pa eai orking. In short, information 
saltiras of | ee? analyses of the subjective 
corporated i p two communities can be in- 
thüs be t a he feedback. New concepts can 
incidents aught in the context of concrete 


Validation Sleps 


1E e..s 
Tee der ee have been constructed and 
number ai and feedback are prepared, a 
the stories bist ation steps are undertaken. All 
tives (i.e è kay just the second page of alterna- 
to an apti asst the feedback) are submitted 
target cultu pee sample of people from the 
story and rig subject is asked to read the 
eels ig Bi ig choose the alternative which he 
occurred fro ga asa description of what 
information m a cultur s point of view. This 
Stories for whic then tallied „and only those 
agreement ; hich there is a high percentage of 
duct, The on incorporated into the final pro- 
Model She Soi. is, therefore, based on a 
Meanings wiii ee that given behaviors have 
Most of the rr are more probable than others. 
behavior ocë ae when a certain situation or 
Culture eens the person from the other 
M ehers a a certain thing by his action: 
also asked yp -the same subject population are 
they feel th A Oe each story on how important 
Occurrence p sota is, and how probable is its 
US further į he results from these scales help 
represent T deciding which incidents would 
Problems that most frequent and important 
hen the fi i nea ae 
Chosen, they inal group of incidents has been 
» they are examined by a number of 
and sorted into categories according to 
y deal—for 
>, hospitality, authority relations, $&V- 
are then 
nciples : 
dealing 
> episodes 
s to more 


GD yep: 

within % 
begin 
dfg 
see Foa & Chemers, 1967); (6) 


Categories of concepts 
irst a, with rather broad 
Ms. Intrody = moves to more Spe 
One or two cory and summary st 
Paragraphs) are presentec 


social issues 
cific prob- 
atements 


] before 
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and after large groups of incidents dealing with 
similar topics, and all of these incidents and 
statements are then bound together in book 
form. 

Depending in part on the availability of 
critical incidents, we have generally obtained 
150-200 incidents, of which 75-100 are eventu- 
ally included in the culture assimilator. The 
time for preparing a 75-100-item program is 
estimated as approximately 800 man-hours 
spread over 9 mo. This includes collection of 

pisodes and alternatives, 


items, writing the € 
and checking and revising the alternatives and 


feedback information. 


RESULTS FROM LABORATORY EXPERIMENTS 


Four studies have been completed which 
provide validation evidence for the culture 
assimilator programs. These studies are very 
briefly summarized below. 


Arab Assimilator 
emers, Lekhyan- 


anda, Fiedler, & Stolurow, 1966) tested the 
hich was the first 


program and, 
The Ss consi 
Officer Training 
served as leaders. 
of the Universit, 
Association particip 


bers. These Arab students worked in two-man 


teams with various American leaders to 
were assigned at random. 

p of leaders (ROTC cadets) were 
hotomized into a control group 
rammed instruction on 
hy. The other 14 leaders 
assimilator which dealt 


with the role of women, importance of religion 
in the Mid East, and interpersonal skills in 
on. The testing and training 
on four consecutive nights. 


After the training period, each American was 
appointed the leader of a three-man group with 
two Arabs as members. Each group performed 
three tasks: (a) a letter encouraging Arab 
villagers to allow their women to work in a 
factory to be built nearby; (b) traversing a road 
map with the shortest possible mileage, given 
that every point on the map had to be touched 
at least once; and (c) a negotiation task to 
decide on the number of Moslems to be em- 


whom they 

The grou 
randomly dic 
which received prog 
Arab country geograp 
received the culture 


group interacti 
sessions took place 
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ployed in an Arab-American mining venture. 
The Americans were told to seek a nondis- 
criminatory hiring policy. The tasks were 
administered in counterbalanced order, and the 
pairs were Systematically rotated to balance 
the design of the study. After each task all 
participants completed various scales des- 
cribing the group climate, perceived effective- 
hess, stressfulness of the situation, etc. 

The differences in task performance between 
culture assimilator-trained leaders and leaders 
with geography training were small, but in the 
hypothesized direction, although none reached 


second task; and seven scores composed of 


expected direction with one at the p < .05 
the p < .10 level, These 11 
indexes consisted of three measures generated 
by leader ratings; five measures composed of 
member ratings, and three combined ratings of 
the overall group atmosphere. The results of 
this pilot study, therefore, yielded Promising 
relations which encouraged further work, 


Thai Culture 4 Ssimilator 


A second study used a considerably more 
sophisticated culture assimilator developed for 
Thailand (Foa, Mitchell, Santhai, Wichi- 
arajote, & Wichiarajote, 1967). This Thai 
culture assimilator was experimentally tested 
by Mitchel] Foa (1969). 7 
involved 32 i 


en randomly assigned 
to an experimental and a control group, The 
16 men of the experimenta] Sroup received the 
culture assimilator program, and the 16 men 


ed the geography 
program. 


Sixteen pairs of foreign students from the 
Far East Participated in the study, 
member of each pair was 
other from Korea or another Far Eastern 
country. Each pair of foreign students worked 
with one culture-trained and one geography 
trained American, The American’s Job was {o 
Supervise the construction of a toy building 
made by his team of two foreign students and 
lo get the building Constructed in the shortest 


One 
from Thailand, the 
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possible time and as accurately as possible. jí n 
American leader was not, however, permitted 
to work on the task. The experimental situation 
was conceptualized as simulating the peal lees 
of a Western expert working in a pane 2 
Country: the expert is usually better qualifiec 
technically than his local Co-workers and he is 
expected to train them in doing the job rather 
than doing it himself, ’ 
At the end of each session, a Thai observer, 
the American leader, and the two foreign 
students completed an evaluation form, After 
the second session, all but the leader answered 
an additional rating form comparing the two 
American leaders, Since culture- and go 
graphy-trained Americans alternated in the 
sessions, this Jast rating amounted to a com- 
Parison of the two types of training, although 
neither the SToup participants nor the T hai 
observer knew that they were making this type 
of comparison. Since there were two observers 
and two task orders Possible, the design, was 
‘ed to control for these possible 
Three Composite ratings of the interpersonal 
relations were generated for each of the four 
participants, These ratings dealt with the 
leader’s handling of the group as a whole, his 
treatment of the members, and an overall 
assessment of the Sroup climate. For these three 
ratings, two Were significant for the leader 
ratings, all three for the Thai observer, one fo! 


id Carear 
the Thai member, and none for the Korear 
member, All 12 


: 4 ratings were in the predicted 
direction, 

In the area of group effective 
ratings were used i 
of the numbe 


ness, two scale 
as well as measure? 
made in the con 

lildings and the time t? 
- None of these ratings or mea” 
Sures were significant, The results were, there” 
fore, similar to those of the first study showing 


that the interpersona] area seems to be more 
Sensitive to the culture training than group 
effectiveness, Whether ese interpersonal dif 
ferences would facilitate Productivity over i 
more prolong, time in a field situ 
tion was, of ? & question for subseque? 
field studies, 


finish the task 


ed period of 
Course 


Piety STUDIES 
Honduras Culture 4 SSimilator 
The third study 


. of 
involved a program f 
Central America (Sy. 


ay 
j jdm” 
monds, O’Brien, Vidn 


Cross-CULTURAL TRAINING 


& Hornik, 1967) which was experimentally 
validated in a study by O’Brien, Fiedler, 
and Hewett (1970), This study was conducted 
i the context of a people-to-people program 
conducted by Los Amigos de las Americas of 
Houston, Texas. Amigos yearly sends 250-300 
teenagers to Central America to establish and 
Operate public health clinics which provide 
polio vaccinations, diptheria shots, and similar 
preventive measures, as well as performing 
community development work. Three waves of 
Caed miteipants went to Honduras and 
E at a for 3-wk. terms. In their host 
we ant the members of each wave were 
ey oh ba teams of 2-6 persons, as well as to a 
al and three regional headquarters teams. 
io ae participants in the summer of 
training f received the „culture assimilator 
illness i hile the remaining 146 received the 
RE i area training given normally. To 
tke ei the possibility of a Hawthorne effect 
partic ure assimilator training was given no 
A cular emphasis. The training given to 
bert varied somewhat; that is, some Amigos 
Others a lecturers or saw some films and not 
the ee upon the available time and 
lator a. ies of scheduling. The culture assim- 
any thi anng “Was, therefore, not seen as 
Ching out of the ordinary. 
_ The culture assimilator for Honduras and 
oe was- based on critical incidents 
the Bise E Amigos who had participated in 
used a hil the previous year. The progam 
theory TE framework of structural jole 
O’Brien Toe & Harary, 1962, 1964; aer k 
ing with ; lations emphasized incidents deal- 
Court, relations between the Amigos and host 
É Yy nationals collected from the previous 
Year's group. 
fear emir of performance and ae 
Were me t two forms: (a) performance rating 
. btained from the director of the Amigos 
a ape and from his various headquarters 
PE (b) in addition, several adjust- 
t E pager were obtained from the Amigos 
TTP n These included (a) aa 
after oe scores obtained before ate 
ratings es overseas experience, and (o) daily 
with oak adjustment, mood and saen 
Overseas thee came to be navn as happy 
Sheets), The > = e to be know L j A 
betwen. ese latter comparisons were Made 
Cen first- and third-week ratings. 
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The results showed a general improvement in 
work performance of culture assimilator- 
trained as against untrained Amigos. Of 
particular interest was the finding that the 
mean adjustment of all trained Amigos also 
generally improved, but that the improvement 
as well as the work performance was higher for 
those who had been Amigos the previous year 
than for those who had gone to Central America 
for the first time. This completely unexpected 
finding suggests that the culture assimilator 
effectively assists in the integration of pre- 
viously obtained cultural experiences as well as 
providing information for the novice abroad.° 
The data thus show that the assimilator train- 
ing contributed significantly to adjustment as 
well as performance in field conditions and to 
better interpersonal relations in laboratory 
studies. 

Thai Assimilator Field Study 

Finally, Worchel and Mitchell (1970) re- 
cently completed a field test with the Thai 
Assimilator, utilizing as Ss officers of the 
United States Military Advising and Assist- 
ance Group and civilians from the United 
States. The available officers and civilians 
were randomly divided into those who re- 
ceived the culture assimilator training and 
those who received comparable culture train- 
ing in essay form. 

The method of evaluating the training con- 
sisted of self-report questionnaires mailed out 
months after the training. Other 
evaluations, while highly desirable, could not 
be obtained on advice of the American authori- 


several 


ties in Thailand. 
The questionnaire contained 15 items about 


the officer’s experience in Thailand. These 


items dealt with his contacts with Thais, his 
work relationship, his health, his desire to go 
back to Thailand, etc. On 12 of these items, the 
differences between respondents with culture 
assimilator training were more favorable, and 
of those, 4 were statistically significant. None 
of the items in the counter-expectational 
direction approached significance. Thus, the 
his finding is supported by informal statements of 
personnel of the United States m sion in Iran where an 
experimental study, using an Iranian culture assimilator, 
was conducted by Chemers (1968). Unfortunately, the 
Tran assimilator has not been tested under field condi- 


tions as yet. 
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culture assimilator training could be regarded 
as successful. 


Greek Assimilator Field Study 


A recently completed Greek culture assimi- 
lator (Mitchell, Gagerman, & Schwartz, 1969) 
was administered to 76 American military 
and civilian advisors in Greece. A comparison 
of culture assimilator-trained and control Ss 
showed that the assimilator-trained Ss re- 
ported themselves to be significantly better in 
getting along with Greeks, in obtaining co- 
operation from Greeks, in being accepted by 
Greeks, in understanding Greeks, as well as 
enjoying their tour of duty and feeling generally 
better during their tour of duty than did a 
large group of military personnel who had not 
received culture assimilator training. The 
assimilator-trained Ss also saw themselves as 
significantly better adjusted and as having 
significantly better interpersonal relations and 
higher productivity than Ss who had received 
no training. A more detailed report of this 
study appears in Worchel and Mitchell (1970). 

In summary, the major steps in the con- 
struction of culture assimilators were described. 
They provide an apparently useful method for 
the instruction of members of one culture to 
behave effectively in social and task situations 
involving members of another culture. Three 
particular strengths of this approach are sug- 
gested: (a) the training depends on a detailed 
analysis of empirically derived critical cultural 
information relevant for effective interaction; 

(b) it provides an active, ego-involving training 
procedure; and (c) it has been shown to 
improve the effectiveness and satisfaction of 


those trained with this method wh 
compared 


methods. 


: en they are 
with those trained with other 
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A magnitude estimation techniqu 


between merit increases and satisfaction for pro 


electronics organization. The linear, 
fitted to the data in combination wi 
function and the percentage of pe 
of reference gave the best fit to the 
concept of diminishing utilit; 


analyses found that each dollar of merit increase 
he sample, satisfaction was not a monotonic increasing 


approximately 10% of t 


function of dollars. This is contrary 


consistent with equity theory. 


i isa long history in theorizing about 
AS fa jective value of money to individuals. 
Cram r back as 1728, the mathematician 
fhe oe ee: 10 postulated that 
tion re of money might be a power func- 
late. the number of dollars. A short time 
pothe = 1738, the philosopher, Bernoulli hy- 
hea y a logarithmic function. The impor- 
e this theory has been recognized by 
ROSIN ication in modern day literature (Ber- 
Postul. Pil More specifically, Bernoulli 
enced ea that an individual’s wealth influ- 
an addi e utility of additional money 50 that 
Poor Pa dollar has more utility” for the 
econon an the rich man. Since that time, the 
utilit nic principle of decreasing marginal 
ae that each added dollar has less utility 
Widel ) than the preceding one, has been 
except accepted (Stevens, 1966). The one 
Wisin Mis principle would be the 
incre: nan and Savage (1948) proposition that 
when a marginal utility may come about 
that a consumer is shifted into a new class 
Would give him new social and economic 
Status, 


Whi . 
well ms Bernoulli’s hypothesis has become 
ion aee (Adams, 1963), Cramer’s proposi- 
cent] has passed into obscurity and only re- 
Work, has been revitalized by the extensive 
—< of Stevens. Using a technique of magni- 

PTni asti 
the aie article has been accepted 

2 ee Editor, Dr. Kenneth 
pitrett M for reprints should be sent to 
ties Ae nent Research Center, 

dministration, University of 


ewe 
Y, Rochester, New York 14627. 


for publication by 


E. Clark. 
Gerald V. 
College of 
Rochester, 
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s Administration, University of Rochester 


e was used to determine the relationship 


fessional employees of an 
logarithmic, and power functions were 
th four frames of reference. The power 
rceived equitable merit increase frame 
data. The generally accepted Bernoullian 


y for each added dollar was not supported. The 


had increased utility. For 


y to accepted economic theory but is 


tude estimation, Stevens (1966) has consis- 
tently shown that a power function best de- 
scribes the relations between stimulus and re- 
sponse for a number of sensory modalities. 
More recently, the technique has been applied 
to attitudes and other social phenomena. 
Galanter (1962) reported an investigation in 
which Ss were asked how much money would 
make them twice as happy as $10, $100, 
and $1,000. The results supported the gen- 
eral power law with an exponent of approxi- 
mately .43 and, at the same time, the proposi- 
tion of decreasing marginal utility. In a more 
recent study of crime, Sellin and Wolfgang 
(1964) found that the seriousness of an of- 
fense which involved money grew as a power 
function of the amount of money stolen. The 
obtained exponent was .17 which supports 
even more strongly the proposition of declin- 
ing marginal utility of money. 

In critiquing Galanter’s (1962) study as 

reliminarily reported by Stevens (1959), Ed- 
wards (1961) indicates that much more evi- 
dence would be required to convince research- 
ers that the utility of money was not linear. 
ds contends that twice as 


In essence, Edwar e h 
much money should be twice as important to 


the individual. 

There is one common element among the 
theoretical propositions of Bernoulli (1964), 
Cramer (Bernoulli, 1964), Stevens (1959), 
and Edwards (1961). All would postulate that 
the utility of money is a monotonic increasing 
function of the number of dollars. They differ 
in that Bernoulli would postulate a logarithmic 
function, Cramer and Stevens a power func- 
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o— Linear 
& Lagarithmic 
e— Power- 
Increasing 
O~ Power- 
Decrasing 


SATISFACTION UTILES 


MERIT INCREASE 


Fic. 1. Represi 
power functions. 
utility.) 


entation of linear, logarithmic, and 
(A utile is a hypothetical unit of 


tion, and Edwards a linear function, The vari- 
ous theoretical curves are illustrated in F igure 
1. If the exponent of the power Curve is less 
than 1, this represents decreasing marginal 
utility, and if more than 1, increasing mar- 
ginal utility, 

While there 


work concerning the actual 
No reports were found which 
y of merit increases. The 


a recent study by Zedeck 
and Smith (1968) which 


method of 
thresholds i 


y (satisfaction) 
: bove and below 
that point. 


Specifically, the 
vestigation was to 


ncreases 
and to 
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METHOD 
Subjects 


Questionnaires were distributed to 157 professional 
level personnel employed at a medium-sized Eastern 
electronics company. Eighty Ss responded to, pi 
anonymous questionnaire. The 64 replies (41% a 
the population) containing complete data, were used 
in the present investigation. : 

The sample included 31 professionals classified as 
engineers and 33 classified as nonengineers who were 
in the fields of accounting, purchasing, production, 
marketing, and personnel. Thirty-nine Ss had a 
bachelors degree or beyond while 17 had completed 
2 yr. at college. The remaining eight Ss had a high 
school education. Fifty-three Ss were between the 
ages of 25 and 39. Only five Ss were younger than 
24, and six were over 45. A total of 29 Ss reported 
their yearly salary as being between $8,000 and 
$12,000 a year. Nine Ss each reported income in the 
brackets of $12,000-$14,000, $14,000-$16.000, and 


$16,000-$18,000, Eight Ss reported an annual salary 
of over $18,000 a year. 


Research Instrument 

The questionnaire used for the present study was 
similar in its basic construction to one that has been 
previously used in investigations of juvenile de- 
linquency (Sellin & Wolfgang, 1964) and information 
systems (Barrett, Thornton, & Cabe, 1968). The 
ñrst page of the questionnaire was as follows: 


Most companies award merit increases to saluted 
employees as part of an annual performance appraisa 2 
No doubt you look forward to your next performance 


appraisal with expectations of receiving a certal! 
increase, 


Merit in- 
creases in 
dollars per 


year Instructions 


$1,700 1. Fourteen Possible merit increases 
are listed to the left. C 

“L 

1,000 C] 


900 E] 
2,100 L] 
600 w 
1,100 C 


which you feel w 
able at your next ye: 


our own perceive 
d contributions t0 


2. Your 
satisfacti 


satisfactic 


equitable €ase chosen above: 


fair and equitable merit i 
been assigned a value of A 
crit increase as a standard aie 
Y other merit increase in E as 
Sstandard, For example, if an 


Score ever 
tion to thi 


UTILITY or MERIT INCREASES 


other merit increase would seem to bring 
you five times as much satisfaction, 
write in a score of 50. If a merit increase 
would bring you only half as much satis- 
faction, write in a score of 5.0. If a merit 
increase would seem to bring only 1/20 
of the satisfaction as the standard, write 
in a score of 0.5. 

You may use any whole or decimal 
numbers that are greater than zero, no 
matter how small or how large they are, 
just so long as they represent how much 
satisfaction the merit increase would 
bring you as compared to the standard 
merit increase. 


700 C] 
2,500 E 
1,300 w 

800 E 

300 O 
1,500 C] 
3,000 C 


ene page of the questionnaire elicited in- 

Service wih thy gare S’s annual salary, length of 

of depe h the company, occupation, age, number 
ndents, and formal education. 


Procedure 


Each 
sent an 
of th 
enc] 
nai 


professional employee in the company was 
hs PP letter explaining the purpose 
osed estigation. They were asked to fill out the 
re Ine and to return the question- 
Ne comr Isned in an envelope to an address outside 
follow-up lett Two weeks after the initial letter, a 
Ces that letter was sent to all professional employ- 
i ing those who had returned the question- 


Naire 
ani i aS 
return a requesting those who had not to please 


T RESULTS 
eee Separate regression and correlation 
ato docoune performed on the data taking 
ree h int four frames of reference and 
bl YPothesized functions as shown 10 Ta- 
- In each case, there were 64 Ss making 


l4; 
u 
i Judgments resulting in 896 data points. The 
rs were 


1 basis. 


an 


Kies oo of reference or predicto 

It was on either a logical or theoretica 
e Sten ed that the combination frame of 
hi est e and function which resulted in the 
decision correlation would best represent Ss 
rence process: The simplest frame of ref- 
the F Osen was the correlation of each of 
values Piro increases with the sat 
tlerence od to them, The second frame o 
obtained Was the percentage merit increas: 
1 €ases by dividing each of the 14 merit in- 
Hated OY each §’s salary which was corre- 
Mer} With the i i for each 

It incr satisfaction values 

ease. The third frame of reference 


isfaction 
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was the percentage of which each of the 14 
merit increases represented perceived equita- 
ble merit increase for each respondent. This 
was considered to be the frame of reference 
most analogous to the questionnaire instruc- 
tions which requested that each employee 
compare his chosen equitable merit increase 
with every other increase. The last predictor 
or frame of reference was most analogous to 
Bernoullis (1964) concept of wealth. The 
employee’s salary plus merit increase was Con- 
sidered to be the best estimate of his total 
wealth. In Table 1, U is the utility or per- 
ceived satisfaction, M is the merit raise, S is 
current salary, and E is the perceived equita- 
ble merit increase. 


The simplest frame of reference or inde- 


pendent variable was the dollar value given to 
each of the 14 merit increases. As shown in 
Table 1 the merit increase frame of reference 
gave the highest correlation with the power 
function, but this coefficient of .76 was not 
significantly different from the .73 obtained 
with the linear function. Both the linear and 
lations were significantly higher 
formation) than the .66 cor- 
vith the logarithmic func- 


power corre 
(p < .05; z trans 
relation obtained y 
tion. 
The se 
dent variable was 
crease. As shown i 


cond frame of reference or indepen- 
the percentage merit in- 
n Table 1 the correlations 
were almost identical to the merit increase 
me of reference. The third frame of ref- 
entage of perceived equita- 
ble merit increase. The .79 correlation ob- 
tained with the power function was signifi- 
cantly higher (p < .05) than either the linear 
or logarithmic function. The last frame of 
reference was wealth as a predictor of satis- 
faction. As shown in Table 1 all the obtained 
correlations were low. l 

The results indicate that the most consistent 
relationship was found with the power func- 
ver all frames of reference. The highest 
as obtained with the percentage 
rit increase frame of 


frar 
erence was the perc 


tion 0 
correlation W: 
of perceived equitable me ) 
reference expressed as a power function. This 
obtained regression line is shown in Figure 2. 

To determine if the factors of age, profes- 
sion, and service generated significantly dif- 
ferent relationships between merit increases 
and satisfaction, separate correlation and re- 


ee 
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gression analyses were computed using the 
percentage of perceived equitable merit in- 
crease frame of reference and the power func- 
tion which had given the highest correlation. 

As shown in Table 2, the only significant 
difference in these divisions was between en- 
gineers and nonengineers ($ < .05). 

A surprising finding was that seven of the 
employees did not report satisfaction as a 
monotonically increasing function of merit 
increase. For those seven employees, more 
money did not bring more satisfaction, and 
for some, more money would actually result 
in a decrease in utility, Despite this, the cor- 
relations reported (see Table 1) were not sig- 
nificantly different from those obtained when 
the group was removed from the total sample. 


Discussion 


increase) did not seem to 
mportance considering the 
ons obtained when it was 


used with all three functions. 


TABL 


CORRELATION (Propuct-Momenr) BETWE 


EN 
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The failure of support for Bernoulli’s 
(1964) decreasing marginal utility of money is 
consistent with other studies. Allais (1953) 
presented rational arguments against Ber- 
noulli’s concept while an experimental study 
by Coombs and Pruitt (1960) failed to sup- 
port the concept. The complexity and con- 
founding influence of the experimental popu- 
lation was illustrated by Mosteller and Nogee 
(1951) who found that, with the college stu- 
dent Ss, support was given to the diminishing 
utility of money while with national guards- 
men, the data indicated increasing utility of 
money. A recent financial portfolio simula- 
tion by Gordon, Paradis, and Rorke (1969) 
found the power function to best represent 
the utility of money, but the coefficient was 
below one supporting decreasing marginal 
utility, 

One of the most intri 
both the power func 
marginal utility of 
Starr (1969) 


guing supports for 
tion and the increasing 


everity rate of mining 
of mining. A power 
with an exponent of 
result can be inter- 
e increasing marginal 


E1 


0 SATISFACTION AND MERIT I EASE 
Four FRAMES OF REFERENCE AND THREE Dirrerent Funeria Sidi 
Function 
Frame of reference 
Linear Logarithmic Power 
Merit increase U=A4B (M, lu 
= M) U=4 4 
Gi) ie 1+ omo (M) U=4 ye M)" 
d 768 
Percentage merit increase U=A4B (M/S J= 
LIS) f ) |U=4+4 2log (M/s) |u=4 + B (M/S) 
r .76> 
Percentage of Perceived equitable 
merit increase U=A4 +B (M/E T= 
(M/E) | T (M/E) U=4+ =e M/E) Ü= iin ar/e" 
i «79° 
Wealth U=4A4B 7 
Gai + (S + M) U AFE lag (S +M) U=448 (S +)" 
i wie 
Note.—Abbreviations: U = utility or perceived satisfac Wii wishin ea A 
merit ease. tsfaction, M = merit raise, S = current Salary, Bo Perceived equit able 
n = 2.06. 
en = 2,21, 


da= 16, 


UTILITY or Merit INCREASES 


ed be money since miners appear volun- 
kT o work in situations involving more 
Of fio each additional dollar they are paid. 
ihe ebes an alternate interpretation is that 
rete a not perceived according to objective 
tidra ‘a the fact remains society pays 
ee als in the occupation as if money had 
ng marginal utility. 

oe Widely cited Friedman and Savage 
the curve of utility does embrace both 
sal ae of increasing and decreasing mar- 
defects f ity, but suffers from a number of 
a ae rom the viewpoint of an experimental- 
(1953) bee ney to test the theory. As Alchian 
an indi Ra pointed out, it is not clear whether 
salary vid ual moves up or down the curve as 
P i aereas or whether current income 1s 
chan ay ag en somewhere on the curve for 
ire S in income. While it is postulated that 
individucs marginal utility occurs when an 
a en ual moves into a new economic class, 
Constitutes a new class is not defined. 
bit ’s generality makes definitive test 
man a vy nen In our judgment the Fried- 
explai nd Savage model does not adequately 
a a the results of this and other empirical 
wie c The failure to be able to test empiri- 
Y specific features of the model may ac 
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TABLE 2 


CORRELATIONS (PRODUCT-MOMENT) BETWEEN PER- 
CENTAGE EQUITABLE MERIT INcREASE AS A POWER 
FUNCTION OF SATISFACTION WITH THE SAMPLE 
DIVIDED BY AGE, PROFESSION, AND YEARS 
SERVICE IN COMPANY 


Š 7 Correla- 
S; l N 
amp. e tion n 
Age 
Under 30 24 19 2.28 
30 and over 40 80 2.17 
Profession 
Engineer 31 at 2.16 
Nonengineer 33 82* 2.27 
Service 
Under 3 yr. 35 78 2.15 
3 yr. and over 29 81 2.29 
*p =.05 


count for its survival for over 20 yr. While it 
is impossible to make a final statement at this 
time, based on this and other studies, strong 
support can be given for utility being a power 
function of money with the increasing or de- 
creasing marginal utility being a function of 
both the situation and the characteristics of 
Ss. 
It would appear that our results from the 
equity group (seven employees for whom util- 
ity was not a monotonically increasing func- 
tion of money) are another example of utility 
function confounding (Conrath & Deci, 1969). 


It has been found, especially in game situa- 


tions, that occasionally the utility for out- 
comes is not monotonically related to the re- 
wards ascribed to the outcomes. This phe- 
nomena often occurs in an interpersonal inter- 
action where not only the individual’s absolute 
reward, but reward relative to others, is in- 
volved in the choice process. The utility func- 
tion confounding obtained in the present in- 
vestigation appears not to be a result of com- 
parison with someone else, but an estimation 
of equity and relative risk. From the com- 
ments received by slightly over 10% of the 


sample whose satisfaction was not a mono- 


tonically increasing function of salary, there 


was a point reached where a larger merit in- 
y. The worry ex- 


crease gave cause for Worry 
pressed was related both to the feeling that a 
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very large increase in salary would be un- 
earned and that there would be performance 
expectations which could not be met. From 
the comments received, it is evident that a 
small minority of employees would not be 
happy with extremely large increases in salary. 

There appears to be a mixture of both 
equity and risk involved in the perception of 
those whose utility functions are not mono- 
tonically increasing. The equity component 
relates to their desire not to be overpaid for 
the responsibility of their job, while some 
perceive a risk in a large salary increase since 
this might imply an expectation of higher job 
performance than the individual could meet. 
The perceived inequity of too large a raise 
seems to fit nicely Adams’ (1963) theory of 
equity. 

The company investigated was a relatively 
young, fast growing, technically based organi- 
zation. We could speculate that a company 
with different characteristics might have em- 
ployees with a different power function and 
percentage in the equity group. For example, 
one would predict that an older, more con- 
servative firm would have a larger percentage 
in the equity group and a smaller exponent 
in the power function. It could be hypothe- 
sized that in a more conservative organiza- 
tion which gave smaller 
large increase would evo 


The differences in individual responses to 
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appears to be unable to embrace these find- 
ings. If these results are replicated, it would 
appear that conventional economic concepts 
must take into account other complex psycho- 
logical variables which elicit differential re- 
sponses from employees, 

It is recognized that the magnitude estima- 
tion technique does have certain shortcom- 
ings. Most critical perhaps for our present 
investigation was the number 10 used as a 
standard. It is known that size of the number 
influences the exponent in the power function 
(Poulton, 1968). In addition, both the range 
and distribution of merit increases used in the 
questionnaire may influence the response. 

Some Ss in responding to a questionnaire of 
this type might attempt to order the merit 
increases in ascending values and adjust their 
responses accordingly, For this reason, the 
increments between listed merit increases 
were not uniform. The bunching of merit in- 
creases between $500 and $1,100 also served 
the purpose of collecting close incremental 
responses above and below the values most 
likely to be chosen as an equitable merit in- 
crease. This nonuniform spacing or bunching 


however, may in itself, influence the consis- 
tency of Ss’ response, 


One solution to this 


Ss to compare and s 
would 


Possible tendency of 
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EFFECTS OF IRRELEVANT STIMULI ON THE PROCESSING 
OF INFORMATION IN COMPLEX DISPLAYS! 


IRWIN L. GOLDSTEIN? 


University of Maryland 


The use of response categories refers to the 
response labels by which observers are asked 
to group and organize stimuli which appear in 
complex displays. Several experiments (Long, 
Henneman, & Garvey, 1960; Long, Reid, & 
Henneman, 1960; Reid, Henneman, & Long, 
1960) produced results that showed increased 
frequency of correct identifications with in- 
creases in category restrictions. An increase in 
category restrictions js generally accepted to 
mean an increase in the number of response 
categories so that they are more descriptive 
of the specific characteristics of the stimuli. 
For example, total geometri 
as restricted as squares 


, triangles, circles, and 
crescents, Generally, 


improvement in perfor- 
ating that the cate- 
response alternatives 
npeting responses and 
nable aspects of the 
7 (Goldstein, 1966) found 


large numbers of objec 
displays used in that study 
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University of Tennessee 


and gas tanks which S was instructed to ignore 
while he grouped relevant stimuli into the 
Proper response categories. The effects of the 
irrelevant stimuli were not assessed in that 
study. Yet, it is now apparent (Goldstein, 
Johnston, & Howell, 1969; Kibler, 1965) that 
the observer in the man-machine system iS 
going to have to Monitor more and more com- 
Plex displays which in Many cases means that 
he will have to Process relevant information 
intermixed with irrelevę 


ant stimuli. A 
Irrelevant stimuli are often defined as tha 
class of stimuli to w 


hich no overt response 15 
required. Unfortunately, the effects of irrele- 
vant stimuli on performance has not been 
thoroughly investigated and some of the data 
obtained has been contradictory, Using ae 
Sponse time as the dependent variable ané 
investigator (Archer, 1954) did not find any 
effect as irrelevant stimuli increased while 
other investigators (Crawford, 1962; Eriksen, 
1955; Rabbitt, 1964) have noted significantly 


: + rill 
rhen irrelevant stimuli 
ay. Studies using a 
8) etc.) as a dependen 


igation of effects of 1 
are 


The present stud 
of Irrelevant info 
response time 


z p sects 
Y Investigated the sare 
™mation as measured by be A 
and counting errors, More eo 
att r i r 
“Honship between irrelevant a 
ee i 
i Ponse Organization was 
vestigated with the OPE of pinpointing s0™ 
. an 
reasons why irrelevs 
: í 
a detrime? 


PROCESSING OF INFORMATION IN COMPLEX DISPLAYS 


METHOD 


di Subjects. The Ss were 40 undergraduate volunteers 
evida equally into the relevant or the relevant- 
th ‘levant condition. The Ss received course credit for 
cir participation. 
Fi at The displays used in this experiment 
bps of 35-mm. slides taken of four relevant 
tirele io (squares, triangles, crescents, and 
the a s he displays were made by randomly placing 
i objects in a 64-cell matrix. The lines defining the 
atrix did not appear in the final slides. 
Or ge dimension for the relevant group was 
aleis y changing the number of each of the four 
ANAE Ls stimulus objects appearing 1n any scene. The 
Henan eyel of the stimulus complexity dimension were 
Sati sg as follows: A low level (Cy) included 1, 2, or 3 
feito: ie each relevant stimulus class; an inter- 
clase= 7 evel (Cz) included 4, 5, or 6 stimuli from each 
irom e Spipples level (Ca) included 7, 8, or 9 stimuli 
vants er class. The stimulus dimension for the rele- 
relevant: evant group was exactly the same for the 
Tilas t stimuli. In addition, there were two types of 
thin objects on the display; pentagons and 
sihigh ns, (A nothingon consisted of two circles 
irrele Slightly overlap each other.) The number of 
level vant stimuli on the display corresponded to the 
s A ei that is, for the low level (Cı) 1, 2, 
Pr an the pentagon and the nothingon. , 
an ure. The Ss in the relevant group were given 
Pap ora, which corresponded to three levels of 
cate icity of response classification. For the reponse 
to tear designated as Ry (one subclass) S was required 
isles E the total number of geometric objects on the 
risca y. These include squares, triangles, circles, and 
ee nts.” For the response subclass designated as Rə 
Pr Near prank’ S was required to “tell E the total 
clude er of rounded objects on the displays (these in- 
sided ae and crescents) and the total number of 
Por ene (these include triangles and squares). 
Classes) ponse subclass designated as R4 (four sub- 
, S was required to “tell Æ the total number of 


TABLE 1 
Mean Response Time (IN SEC.) FOR THE 
RELEVANT GROUP AND THE RELEVANT- 
IRRELEVANT GROUP 


Response categories 
late eee 
Stimulus dimension | 4 gup- | 2 Sub- | 4 Sub- 
class | classes classes 
(Ri) (R) (Ra) 
Relevant group 
fed (En 396 | 6.28 | 685 
Corat (O) 730 | 13.90 | 16.13 
Relewnblex (Ca) 13.72 | 21.66 | 2462 
Si Vant-Irrelevant group 
Mable (Cy) 6.14 | 9.63 | 955 
Comm (Ca) 1322 | 20.62 | 21.40 
wou (G) 2091 | 2837 | 32:55 
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squares, the total number of triangles, the total number 
of circles, and the total number of crescents.” The Ss 
in the relevant-irrelevant group were given the exact 
same instructions except that they were told there 
would be two other objects, nothingons, and pentagons, 
which they were to ignore. The Ss in both groups were 
given several practice trials and each S was tested to be 
certain that he could easily discriminate the different 
stimuli from each other. The Ss in each group were 
read the instructions for one of the three response 
classes and were then shown nine slides, three for each 
level of stimulus complexity. The same procedure was 
followed for the second and third response subclasses. 
The order of the three response subclasses was random- 
ized for each S as were the nine slides shown for each 
of the subclasses. Thus, the relevant versus relevant- 
irrelevant factor was a between-group variable while 
the response categories and stimulus complexity were 
within-group variables. 


RESULTS AND DISCUSSION 


Mean response time. Table 1 summarizes the 
mean response times (in sec.) for the 20 Ss 
in the relevant group and the 20 Ss in the 
relevant-irrelevant group. An analysis of vari- 
ance was used to test the significance of the 
various effects (Winer, 1962, p. 319). The F 
ratios for the stimulus dimension (F = 499.75, 
df = 2, 76) and for the response dimension 
(F = 161.39, df = 2 76) were significant at 
p < 001. As can be seen from Table 1, there 
was an increase in the time scores as both the 
stimulus complexity and the number of re- 
sponse categories increased. A significant stim- 
ulus by response category interaction (F = 
41.27, df = 4, 152, p < .001) indicated that 
the more complex stimuli as compared to the 
simple stimuli took proportionally longer to 
classify using the more restrictive response 
categories (i.e. the four subclass category). 
These data confirmed earlier studies (Gold- 
stein, 1966) indicating that it obviously re- 
quired more time to count more objects and 
more important that the increased discrimi- 
nation required by the more restrictive re- 
sponse subclasses led to a longer response time. 
That is, Ss in the more restrictive response 
subclass (R4) counted and named separately 
all four geometric objects instead of simply 
counting the total number of geometric objects 
(Ri). It certainly appears possible to conclude 
on the basis of the studies completed that re- 
gardless of the beneficial effects of response 
categories in decreasing the number of classifi- 
cation errors, it will take longer processing 
time for these more complex judgments. 
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TABLE 2 


VAN’ JP 
MEAN ERROR SCORES FOR THE RELEVANT Grou 
; AND THE RELEVANT-IRRELEVANT GROUP 


Response categories 
a 
Stimulus dimension 1 Sub- | 2 Sub- | 4 Sub- 
class | classes classes 
(Ra) (Rə) (Ra) 
Relevant group - i 
Simple (Cı) o y D 
Medium (C:) J i. ia 
Complex (C3) 1.00 . i 
Relevant-Irrelevant group ” e 
Simple (C,) -10 23 
Medium (C4) -57 1.47 Ka 
Complex (C4) 1.53 2.25 f 


y, a signifi- 
y by relevancy inter- 

176, p< -001) indi- 
mplexity increased, 


Sroup performed pro- 
Portionally worse than the relevant group. 


Berlyne (1958) has Suggested that the human 
acts as a commu i 


will pass 


locked without a 
In Processi 


cessing time, T 


a detrimenta 
elevant, Th ee-way Rele- 
vancy X Response X Stimulus interaction was 
not significant (p> .05) 
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Mean error scores. Table 2 mies ich 
mean error scores for both the relevant ~~ 
irrelevant group. An error was defined ae 
underestimation or overestimation of the: a 
number of relevant stimulus objects. O 
purpose of the following analyses, thsstne ne 
were treated on an absolute basis, disregarding 

gn. tT 
we can be seen from an examination of ae 
2, the number of errors increased as a 
complexity increased (F= 182.06, df = ae. 
$ < .001). In other words, there were mM K 
errors with an increased number of ogee 
result which should be expected. Also, th 
relevant-irrelevant effect was Significant (F a 
15.50, df = 1, 38, p< 001). This main one 
generally indicated that S made more ee 7 
(see Table 2) when irrelevant objects nits 
Present. An even more important aspect of t 4 
part of these data are the interesting e 
actions that showed the most detrimental el 


“evant group. This effect PN 
yY examining the significant in 
Sponse (F = 12.99, df = 2, 76, p< wee 
Stimulus x Response interaction (fF = be 
I= 4,132, p< -01), and Response X ne 
vancy interaction (F = 404, df= 2, a 
P < 05). The three-way Relevancy X R€ 
Sponse X Stimulus inter, 2: 
cant (p > -05). Table 3 shows th 

ecame progressiy, y os bE 
aS Compared to R, response as the stimuli as 
came more and more complex, However, 4, 
can be seen by an examination of Table in 
Proportionally more of the errors occurred 5 
the relevant-irrelevant Soup with use of ea 
sponse Category 2 and 1, Thus, performan“ 
was Senerally w 


re 
b ver 
orse for those Ss which W 
TABLE 3 a 
pU” 
MEAN Error Scores as 4 FUNCTION op Srimv! 
Comprexiry Anp Response, Ç, A 
= 
| Response categories = 
Stimulus ia Tae — b- 
dimension | 1 Sub- 2 Sub- 4 P 
| class classes cles 


(Ra) 


ea ee | 


Simple (Ci) | 
Medium (C,) | 
‘omplex (C3) | 


Ra | a 


08 
38 
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presented with irrelevant information and that 
decrement in performance was proportionally 
worse for Ss in the relevant-irrelevant group 
when they used Response Category 2 and 1. 
lhis generally higher error rate for groups 
which must process information in the pres- 
ence of irrelevant stimuli shows that for this 
task Ss longer response times for the irrelevant 
Conditions did not compensate Ss and lead 
fo performance similar to relevant Ss. Thus, 
irrelevant stimuli add to processing time and 
also contribute to a higher error rate. An 
ee of the Rs and R: category for the 
that R group suggests one way 
iita Si evant stimuli may add to the error 
- The irrelevant objects were sided objects 
(pentagon) and round objects (nothingon) and 
oh aS quite likely that these two irrelevant 
fe eS interfered with the two-response cate- 
eid called sided versus round. It is also 
A coe to speculate that these two irrele- 
On objects caused some difficulty with the 
ae ee category called total geometric 
cake, since these objects were indeed geo- 
is objects. However, in the four-response 
Category group, each of the four relevant ob- 
a identified by their specilic name (e.g. 
ave id, S; crescent). The two irrelevant objects 
fiora ips with different names and shapes 
$ e relevant objects and might not be ex- 
ie to cause as much interference. Indeed, 
ror amination of Table 4 indicates that the 
TOE for the relevant group and the 
the ea -irrelevant group were quite similar for 
tow: a ir-response category. Certainly, there is 
respon Possibility that our original belief see 
ar nae categories can be used to help 
Sanize information must now be reconsidered 


M TABLE 4 
Man fe 7 F 
N Error Scores as A FUNCTION 
= CATEGORIES AND RELEVANCY x 


or RESPONSE 


Response categories 
— 


Releys eo = 
ancy conditions | 1 Sub- | 2 Sub- 4 Sub- 
class classes classes 
m R) | R) | 
en A m | s 
Nelevant-Irrelevant 33 i32 | 72 
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to the extent to which there is irrelevant 
information present which may interfere with 


the use of the categories. 

In any case, the irrelevant stimuli generally 
retarded performance. As displays tend to be- 
come more complex with the operator expected 
to pick out the pertinent (relevant) informa- 
tion from the background (irrelevant) stimuli, 
the data presented in this study indicate it will 
be necessary to consider ways in which to over- 
come the presence of “irrelevant stimulation.” 


REFERENCES 

ARCHER, J. E. Identification of visual patterns as a 
function of information load. Journal of Experi- 
mental Psychology, 1954, 48, 313-317. 

Bertyne, D. E. The influence of complexity and 
novelty in visual figures on orienting responses. 
Journal of Experimental Psychology, 1958, 55, 
289-296. 

Bourne, L. E., & RESTLE, F. Mathematical theory of 
concept identification. Psychological Review, 1959, 
66, 278-296. 

Crawrorp, A. The perception of light signals: The 
effect of the number of irrelevant lights. Ergonomics, 
1962, 5, 417-428. 

> C. W. Partitioning and saturation of visual 

cy of visual search. Journal of 

1955, 39, 73-77. 

fects of stimulus complexity 

belasses on observer re- 

1966, 


ERIKSEN, 
display and efficient 
Applied Psychology, 

GoxpsTEw, I. L. The el 
and restrictive response su 
sponses. Journal of Experimental Psychology, 
71, 104-108. 

GoLpsTEIN, I. L., JOHNSTON, W., & HowELL, W. Com- 
plex vigilance: Relevant and irrelevant signals. 
Journal of A pplied Psychology, 1969, 53, 45-48. 

Kioter, A. W. The relevance of vigilance research to 
aerospace monitoring tasks. Human Factors, 1965, 
7, 93-99. 

Lone, E. R., HENNEMAN, R. 
experimental analysis of set: 
modality. American Journal of Psychology, 
73, 563-567. 

Lone, E. R., Rep, L. S., & 1 i 4 
experimental analysis of set: Variables influencing 
the identification of ambiguous visual stimulus- 
objects. American Journal of Psychology, 1960, 73, 
553-562. 

Rappitr, P. M. A. Ignoring irrelevant information. 
British Journal of Psychology, 1964, 55, 403-414. 

L. S., HENNEMAN, R. S., & Lone, E. R. An ex- 


H., & Garvey, W. D. An 
The role of sense 
1960, 


HenNEMAN, R. H. An 


REID, 1 
perimental analysis of set: The effects of categorical 
restriction. American Journal of Psychology, 1960, 


73, 568-572. 
Wrner, B. J. Statistical principles in experimental design. 
New York: McGraw-Hill, 1962. 


(Received November 14, 1969) 


Applied Psychology 
Joena al 55, No. 2, 114-118 


INTERACTION OF ENVIRONMENT AND PERSONALITY: 
A TEST OF HOLLAND’S THEORY! 


CHARLES F. 


. ELTON? 


University of Kentucky 


Holland’s assumption that personality v: 
tested by the following hypotheses: (a) F 
engineering (n = 25). (b) For males w 
personality development is different from 
Two dimensions represented the differenc 
Intellectual Independence and Practical 
from engineering were described as 
intellectual, liberal, and skeptical of or 
showed greater variability in their pe 
liking for reflective and abstract thought. 


Holland (1966) proposed the possibility of 
characterizing persons according to one of six 
personality types: realistic, intellectual 


, Social, 
conventional, enterprising, and artistic, The 


more closely a person resembles a particular 
personality type, the more likely it is that he 
will exhibit personal traits and behaviors 


as- 
sociated with that type. Furthermore, the 


environment in which a person lives may be 
described according to six model environments 
which are analogous to the personality types, 
An essential aspect of Holland’s theory is that 
Outcomes can be predicted on the basis of 
knowledge concerning the congruence between 
personality type and environment. For ex- 
ample, a person characterized as an enterpris- 
ing personality type would be expected to be 
most productive while working in an enterpris- 
ing environment. Another Prediction which 
emerges from the interaction of personality 
and environment is related to vocational 
stability. Students whose Personality char- 
acteristics are incongruent with t 
tional choice would be 
majors. Data reported 
(1966) support this prediction. 

A more crucial test of the theory that person- 
ality type interacts with environment is found 
in the hypothesis that a student who changes 
his major should change in his personality 


1This article was accepted for 
outgoing Editor, Dr. Kenne 

* Requests for reprints sh 
College of Education, Univ. 
ton, Kentucky 40506. 


heir voca- 
predicted to change 
by Elton and Rose 


Publication þy 
th E, Clark. ws 
ould be sent tot 


l he author 
ersity of Kentuck s 


Y, Lexing- 
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ariables interact with environment was 


or males who leave engineering (7 = 27) 
there is a personality change different from that fo 


und among those who remain in 


ho leave or remain in engineering, the 
that of males in a 2-yr. college (n = 22). 
es between groups and these were labeled 
Inclination. Transfers to arts and sciences 
becoming more realistic, 


thodox religious beliefs. Tw 
sonality 


nonjudgmental, 


o-year students 
change scores and a loss in their 


; se in the 
development; while one who remains in th 
same major will not change in the same way- t 
corollary hypothesis is that an environmen 


allowing congruence between 


personality an 
vocational 


choice will produce personality 
changes which are different from those pro 
duced by an environment prohibiting co™ 
gtuence between personality and vocation4 
choice. l, 
The purpose of this study is to investigat? 
e following two hypotheses: A 
1. For males who leave engineering there X 
personality change different from that foun 
among those who remain in engineering. F 

2. For university males who remain 4 
engineering, as well as those who chang” 
majors, the personality development is dv 
ferent from that of males in a 2-yr, colles™ 


th 


METHOD 
The Omnibus P 
was administered to the 1965 f pae 
S adn reshman classes ty 
University of Kentucky and at the Ashland Communis 
College in Ashland, entucky. From lists of stl “oe 
who were continuously en, | sters, yes 
cluding the summer re cee ba 
were randomly 


nn 


C 
ersonality Inventory (OPI), For ye 


» three samples of 50 en! 

selected. Two samples repress the 
ents, and one was fversit 

College. One of the Unive™s. 


consisted of students who 
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Ey the last month of their fourth semester in col- 
btidenr as meen hive to participate in this testing, each 
finally ale offered $2 for his cooperation. Data were 
ae A ected for 27 arts and sciences transfers, 25 
Deive OPT ee and 22 Ashland males. 
Kentucky f, T scale scores of the total University of 
and 1966 freshman population who enrolled in 1965 
ponents ¢ ere subjected, by sex, to a principal com- 
Social ee Your scales, Developmental Status, 
authorits maturity Repression-Suppression, and Non- 
of thea ainnis were omitted from analysis because 
compl ah arge item overlap with the other scales. A 
deval ete description of the OPI scales and their 
fore ee may be found in the oPl research manual 
es Hor the Study of Higher Education, 1962). 
stability ¢ y the relationship between size of N and 
from this rr actor scores, the factor weights derived 
analysis eos and 1966 University of Kentucky factor 
students were used to estimate factor scores for the 
factor st in each of the three samples in this study. This 
in Rose ructuve and the OPI scale weights are reported 
with a and Elton (1968). To estimate factor scores 
Proced mean of 50 and a standard deviation of 10, the 
Wi ure suggested by Lawley and Maxwell (1963) 
as followed. 
ase mation in the factor structure between the first 
meai n administration of the OPI could shift the 
stone the factor scores. To show stability t factor 
Scores, T between freshman and senior OPI scale 
canoni arar (1964) described an innovative use ol 
Scores ie correlation. Because, in this study, factor 
pie Sr than scale scores were employed, it was 
ing in fe to eliminate the possibility of an altered mean- 
to the ieee score by applying identical factor weights 
The 1¢ 5 and 1967 OPI scale scores for each studeni 
significa. 965 OPI factor scores were tested for the 
test men of mean differences between gro 
05 le d none of these values were significant 
“evel of confidence. 
sedis, sate student in this study, five estimated factor 
the OPL = available from the 1965 administration © 
Available ane an equal number of factor scores v 
Seemed rom the 1967 administration of the DEF 
Which w, desirable to develop a measure of change 
change ld reflect both the amount and direction of 
Wo tin n each student’s factor scores between t Le 
(19 ne periods. Tucker, Damarin, and Messic! 
Measure oP one method for obtaining 2 ae 
Measure of change and this method = = ser 
€ attril change insures that observed changi . i: 
ive 1968 ea to regression effects. Consequent D Pr 
factor s OPI factor scores and the five 1907 i 
‘Ores aoe were reduced to five OPI factor C im 
Uba student. These scores Were obtained b) 
Score, g the freshman score from the sophomore 


ups by a 
at the 


Se 


es- 


answer the qu 


R Discriminant 
these three 


lon; s 
Sampl; 


A analysis was used to 
t re the factor change scores between 
of males different?” 


RESULTS 


Table 
Scor ngs 1 presents the mean factor change 
i į males. 


S sP 
between the three samples 0! 
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TABLE 1 
Mean OPI Factor SCORES AND 
DISCRIMINANT WEIGHTS 
Si Discriminant 
f Si weight 
OPI factor an 
sci- 
ences? M M2 
Nonconformity [izis | 17.96 | -05 
Scholarly Orientation —4.43 | 6.29 | .10 
Nonauthoritarianism 20.55 | 9.30 | —.08 
Social Discomfort 10.80 | 11.16 | —.02 
Masculine Role 13.06 | 21.23 | -02 


Note. -OPI = Omnibus Personality Inventory. 
an 

bn e 

en = 25. 


‘The discriminating power of the predictors 
was tested by chi-square with p(k — 1) 
degrees of freedom, where p is the number of 
variables and & is the number of groups. This 
chi-square value was 61.57 which, with 10 
degrees of freedom, was significant at the .001 
level. Each discriminant dimension was tested 
by chi-square to evaluate its separate contribu- 
tion to discrimination. On the first dimension, 
which accounted for 88% of the variation be- 
tween groups, the chi-square value was 56.76, 
significant at the .001 level with 6 degrees of 
freedom (p + k — 2). The second discrimin- 
accounting for 22% of the 
yariation between the groups, produced a 
chi-square value of 10.39, which, with 4 degrees 
of freedom ($ + k — 4), was significant at the 
05 level of confidence. Additional dimensions 
failed to achieve significance at the .05 level 
by the chi-square test. 

“The F statist ic was used to test the equality 
between group means of the OPI factor 
change scores. Students transferring to the 
College of Arts and Sciences and those re- 
maining in engineering obtained significantly 
higher change scores on the variable of Schol- 
arly Orientation (p< .01) than did students 
at the Ashland Community College. The arts 
and sciences transfers, compared to those re- 
maining in engineering, obtained a significantly 
higher score on the factor of Nonauthori- 
tarianism (p < .05). Other group mean change 
scores did not differ significantly at the .05 
level of confidence. 

The effectiveness of f 
classifying the students into three gre 
2, The best discrimination was 


shown in Table 2 
obtained for the Ashland students, the poorest 


ant dimension, 


actor change scores 1n 
three groups 1S 
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2 Holland’s assumption that pesani A p 
, i i ras 
ie [EMBERSHIP interacts with environment. In cont a lie 
EU Deion Tow Men in engineering, the transfers to ar a 
| | sciences may be described as becoming mor 
3 Arts | Ash- | Engi- | % realistic, nonjudgmental, intellectually n a 
Group gal | land | neering) hits and skeptical of orthodox religious - Be 
i i; : . $ te c4 vs 
| | 52 Noteworthy is the fact that signific ok of 
Arts and sciences i a | | 90 ferences were not found in the oe 
Ashland 6 2 | 17 68 change in Nonconformity, Scholar A “rr 
al tion, Social Discomfort, and Masculin 
scores, 


for the arts and sciences transfers, with the 
engineers falling between these two extremes, 
inane 1 is a plot of the mean discrimi- 
nant function values on the two sig 
dimensions, i 
The horizontal dimension 


The environmental differences between = 
dents remaining in engineering and those oe 
ferring to arts and sciences are Presumed: r, 
nificant those associated with faculty, oe 

peer influences, Not collected were data hire of 
in Figure 1 repre- would allow inferences about those aspects ini 
j the environment related to observed change 
Personality factor scores, — 
Two-year Students differed from oe 
differentiation on the first dimen eir personality change sco 
labeled Intellectual Independen 


ine, ane 

as both quantitativ nna 

arly Orientation Scores, positively weighted qualitative aspects. The quantitative ¢ 
a S ° 

negatively ence may be 


. iahility be- 
observed jn the variability me 
weighted. High scores on Positively w eighted tween groups in mean change scor i Re ad 
Predictors and low scores on negatively range in change Scores for the Ashland Ss 
weighted Predictors produce a high dimension 


24.98; for engineers jt was 14.94; for the = 
Score; a low score on a dimension is the result and sciences transfers it was 8.71. The eas 
i Predictors g5 had almost three times the variabillt) 
eighted Pre- of the transfers to arts and sciences bs f 
more than one and a half limes the variability 

in Table 1 indicate that of engineers, 
The patter 


n of score ch 
qualitative 


r ag the 
ange illustrates 
differences, 


On one variable 

have a high Scholarly Orientation—2.y. Ss earned m 

mension score, The mean dimension Scores lowest mean change Score, in fact, a negativ, 
Plotted in Figure 1 Support this Prediction 


Score. On 
TS to the second di- anism—2-yr, students ¢ 
mension labeled Practical inati 


mean change score N 


itari 
one origi (8 
armed the high 


“YT. students because of oa 
r by Scholarly oneni 

» though, that the mia 
mMm engineer- Ashland change score on Nonauthoritarian! 
Compared to those re- 


aw 
as a meaningful effect, but Opposite to fa 
Produced by its Scholarly Orientation eae 
Taan The differences found among 2-yr. CO tw 
students Compared to those in the other ol 
The finding of differen tia] change in the Stoups are in the direction predicted by e 
Nonauthoritarianism Score for Ss tr, land’s theory, It is Presumed that the © 
to arts and sciences Compared to vironmental difference 
maining In 


engineering lends su campus and the 


transferring to arts and sci 
ing change differently 
maining in engineering, 
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PRACTICAL INCLINATION 


a —+ Ashland 


Fig. 1. Mean va 


n py 
apean ig the observed differences. 
is ly, Ashland males did not have al 
to contrast and compare their 
personality characteristics with 
equirements, peers, or faculty who 
tie ay as vocational role models. In the 
at Ashla Such models, it is not surprising 
Pesona Ss showed the largest variability 
Other ig ity development. Or, stated an- 
“nrolleg > Personality change among students 
9 be inin a 2-vr. college would be expected 
“eulay 'uenced less by faculty, peen and cur- 
“mong stmnuences than are changes found 
Progra udents enrolled in a traditional college 


e an 


lues of dimensions in discri 


+ Engineering 


-50 


+ Arts & Sciences 


JONATA NAdIGNI IVNLOATISLNI 


a2 


minant space. 


One aspect of the difference found between 
Ashland and University Ss invites a specula- 
tive comment. This difference is portrayed by 

of Intellectual Independence in 


the dimension 
Figure 1. Could it be that the transfer program 


at a 2-yr. college produces a change in per- 


sonality characteristics which is antithetical 


to curricular objectives which stress intel- 
lectual independence? If so, it would be pre- 
dicted that Ashland students with negative 
Scholarly Orientation change scores who trans- 
ferred to a 4-yr. science program would fail in 
larger numbers than would those with positive 
change scores. The significance of this specula- 
tion is enhanced by data reported by Furneaux 
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(1962) which indicate that personality char- 
acteristics operate as moderating variables in 
academic achievement. 

Plant and Minium (1967) showed that 
change in personality scores was related to 
ability. The mean American College Test 
(ACT) score for the Ashland Ss was 20; it was 
24 for the transfers to arts and sciences and 
those remaining in engineering. The correla- 
tion between Scholarly Orientation and ACT 
was, however, only .08. Therefore, it does not 
seem probable that ability differences between 
2- and 4-yr. students account for the observed 
differences in Scholarly Orientation scores. 
Additional evidence which Supports the in- 
dependence of ability and personality change 
scores was found in the significance difference 
in Nonauthoritarianism scores between arts 
and sciences transfers and engineers. The 
mean ACT scores were identical for these two 
groups of students, 

Consistent with the nature of many ex- 
Ploratory studies the present investigation 
Suggests a number of tentative conclusions, 
The findings cannot be presumed to be univer- 
sal; the small number of Ss and the absence of 
Cross-validated data mitigate against over- 
generalization. The data do emphasize the 
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heuristic value of relating observed change 4 
personality test scores to Holland’s theory 0 
vocational development. 
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JOB OFFER ACCEPTANCE BEHAVIOR 
AND REINFORCEMENT? 


JOHN M. IVANCEVICH anv JAMES H. DONNELLY * 


University of Kentucky 


The effects of a positive reinforcement pr 
ding a job offer acceptance after previously 


d in two separate companies. A total of 196 Ss (experi- 
rcement from those firms they decided to join, 
received no reinforcement. Those Ss receiving 
b offers at a significantly lower rate than those Ss not 


dissonance (college seniors rescin 
accepting) was investigate 
mental group) were provided reinfo 
while another 196 Ss (control group) 
reinforcement rescinded jo! 
receiving reinforcement. 


During the last decade the interest, debate, 
and controversy over such concepts as balance, 
congruity, and dissonance has not subsided. 
x anything, the interest in these concepts first 
displayed by psychologists has generated more 
interest recently among researchers interested 
communications, consumer behavior, and 
organizational analysis. 

o The focus of the attitude change theor 
PA Suci, and Tannenbaum (1957); Abel- 
and Rosenberg (1958); Katz and Stotland 
(1959) ; and Rosenberg (1960) involved the 
Principle of cognitive consistency. The ap? 
Ea to interpersonal behavior suggested by 
— (1946), as well as the approaches of 
pena (1953), and Festinger (1957, 1964) 
oe emphasize that individuals tend towar 

‘untaining cognitive balance, symmetry, OF 
consonance, 
fone umerous research efforts (Brehm & Conen 
Gutt; Cohen, Brehm, & Latane, 19595 Ehrlich, 

ittman, Schénback, & Mills, 1957; Engel, 


ies of 


O35 Freedman, 1965; Fromkin 1968; and 
Shikawa, 1968) have attempted to relate 


Ost- a fg . oa 
Post-decision behavior to cognitive dissonance. 


main theme of most of these empirical 
su orts is that after a person makes @ decision, 
: . 
te as purchasing a home or an automobile, 
ae is often a lack of consistency OF harmony 
amo ns 
on the person’s cognitions. 
a) ost-decision doubts are likely to Occu" when 
Psy ae decision is important to the individual 
yc] S u he we 
a oe and/or financially (Brehm, 
outgoin article was accepted for publication by the 
2 ng editor, Dr. Kenneth E. Clark- 5 
nelly for reprints should be sent to James H- 
Nive an or Rent teste Administra 
Tsity of Kentucky, Lexington, Kentucky 40500. 


Don 


ogram on a hypothesized indicator of 


1960; Culbertson, 1957), (b) a number of 
desirable alternatives are available (Anderson, 
Taylor, & Holloway, 1966), and (c) available 
alternatives are qualitatively dissimilar 
(Brehm & Cohen, 1959). 

The present research study attempts to re- 
late the effects of positive reinforcement on 2 
hypothesized indicator of post-decision dis- 
sonance. Specifically, it reports the findings of 
a recent investigation concerning the tendency 
of some college seniors to back out (hypothe- 
sized indicator of dissonance) of a job offer 
after formally notifying a company 


acceptance l 
that they will begin their careers with the firm 
immediately after graduation. The first job 


decision of the college graduate fully meets the 
ost-decision dissonance criteria discussed by 
psychologists. First, it is an important psycho- 
logical and financial decision for the graduating 
senior. Second, most college seniors receive a 
number of job offers from competing organiza- 
tions. Finally, the job offers are generally 
from organizations in different industries, from 
different locations, and involve different 
ibilities. 
a nl concern of the researchers in the 
present study is not to resolve the heated de- 
bate surrounding cognitive dissonance (Chapa- 
nis & Chapanis, 1964; Janis & Gilmore, 1965), 
but rather to determine if a program of positive 
reinforcement reduces þack-out frequencies 
(assumed indicator of dissonance) in organiza- 
tions. Consequently, the operational proposi- 
tion investigated is: 

The number of college graduates backing out after 
formally accepting a job offer (€.g-, through telephone 
conversation and/or letter) will be reduced if positive 
post-decision reinforcement is provided by the hiring 


company. 
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TABLE 1 
AVERAGE Back-Out PERCENTAGES 


No. of firms* Average % of back outs 


| 
| 
| <1 
| 1-2.99 
| 344.99 
3-7,99 
8-11.99 
12-15.99 
>16 


menenun 


9 companies hired more than 125 college 

earna yer, The range of college graduates hired was 

from 128 to 1,650. These figures include undergraduate and 
graduate degrees. 


METHOD 
Procedure 


Nineteen large industrial corporations were contacted 
personally and by questionnaire in 1966 and 1967 to 
determine the number of college graduates who, after 
formally accepting a position, later rejected the position 
or never showed up on the job. This inquiry indicated 
that, of the 19 participating firms contacted, each 
had reported some backing out among college gradu- 
ates. Table 1 indicates the approximate percentage? of 
back outs in the 19 large industrial companies, 

This initial inquiry indicated that some companies 
have a critical back-out problem. For example, one of 
the companies contacted had a back-out rate which was 
in excess of 16%. Thus, the researchers assumed that 
back-out rates could be reduced significantly in those 
companies with a back-out rate which is greater than 
5% if positive reinforcement on the part of the firm was 

initiated during the postdecision period. 


Participating Companies 


Two companies from the 19 originally studied that 
had back-out rates of approximately 10%, agreed to 
participate in this study. Company A is located in New 
York and during 1967 had hired through the main 
ge seniors for nontechnical 
administration and liberal arts 
ny B which is located in the 
Chicago area had, during 1967, hired 140 technical 
college graduates (e &-, engineering and Physical science 

| ial inquiry (i.e., when the 19 com- 
panies were contacted), it was determined that Com- 
pany A’s back 


~out rate was 10.407 and Company B’ 
back-out rate was 10.9%. i sat 


than 1% annually and they wi 
the backing-out problem. 
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Experimental and Conirol Groups 


During 1968 and through June of 1969, 5 ORED $ 
received 186 formal job offer acceptances from un m 
graduates and, during this same period, Company p 
received 206 formal job offer acceptances. One of ever) 
two acceptances at Company A was classified into a can- 
trol group and the other into an experimental rop 
For example, suppose that on April 2, College eo 
1, 2, and 3 (CS), CSs, and CS,) formally accepted. T a 
CS, was placed in the control group, CS» into bg tage 
mental group, and CS; into the control group. Then = 
April 6, CS; and CSs formally accepted. In this a 
CS; was placed into the experimental group and Bs 
into the control group. ‘This same classification proce 
dure was followed for all of the 186 formal job accept- 
ances at Company A and the 206 acceptances at Com- 
pany B. m 

The researchers were interested in whether ian 
accepting offers had other possible job alternatives. Ka 
20% random sample of those accepting and beginning 
their careers with Company A or Company B and ie 
those accepting and later backing out indicated Har 
each of the Ss had at least one other job offer. The 
average number of offers for th f 
porting to work was 3.6, while the average number 0 
offers of those accepting and backing out was 3.3. n 

Members of the control group at each company 
received some company literature and confirmation ete 
their notice of acceptance had been received by SS 
personnel department. The experimental groups, he 
ever, received three randomly spaced personal died 
Phone calls from the personnel department port 
stressed positive factors about the company and iy 
astuteness of the college senior’s company choice. a 
first of these randomly spaced calls was made immed! 


ately after the formal acceptance was received in the 
Personnel department. 
participants (i.e, 
Company B) wer 


ose accepting and re- 


in which the firm is locata 
es associated with worki 


RESULTS 


Table 2 presents the freq 


: King 
Juencies of backne 
out among colle 


ge seniors receiving positivt 
reinforcement (experimental group) and colleg 
seniors not receiving positive reinforcemen 
(control group) for Company A, Chi-squa 
analysis was used to test for statistically 5 
nificant differences, r 
In Table 3, the frequencies of backing oul a 
the experimental and control groups at pe 
pany B are shown, Once again, chi-sat 
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analysis was used to determine if a significant 
difference exists between groups. f 

re oye of Tables 2 and 3 indicates 
utilized ‘ pn reinforcement programs 
arene = ame companies had a significant 
cia hes n reducing the back-out rate. The 
Soine colege seniors backing out in the 
those eg ; — was significantly less than 
decision Cee group Ss who received no post- 
Gite ac ——— Thus, the empirical 
With anit in Tables 2 and 3 is consistent 

€ original proposition. 


Discussion 


Pi apap through a carefully planned 
tive at IS „program, personnel selectors ar- 
job Rar decision about which of the numerous 
i ‘andidates clearing all of the selection 
' ae are to receive employment offers. 
on qualified candidates supposedly re- 
College Lene offers. It has been found that some 
will lat seniors after formally accepting an offer, 
Biles rescind the previous acceptance. This 
Study a tendency was reduced in the present 
Ment hen systematic postdecision reinforce- 
a Programs were utilized. 
dee groups in both companies, the 
remaine | percentages during the investigation 
ad be : at approximately the same level they 
tain Lie to the study. The experimental 
ion in a showed a significant reduc- 
orceme me tendency to back out. The rein- 
influenci Program was equally effective ie 
Undergy, pi both technical and nontechnica 
Alu, aduates. 
rates ugh significant reductions in back-out 
orcement brought about through the pein 
Portant e strategy utilized, a number o i 
irst, j questions still remain ani i 
T it was not determined how man3 tele- 
€ calls or what type of reinforcement (i.e., 


I TABLE 2 
~~__Back-Our FREQUENCIES IN Company A 


Did c 
back out 


Did not 


back out backing out 


Group» 


Seed 


~berimental | 91 


o! | 2 2. 
Ntro! 
inn 83 10 10. 


“n A = 5.68, df = 1, p < 02 
in both groups. ` e 
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TABLE 3 
Back-Ovut FREQUENCIES IN Company B 


Go Did not Did % 
OUP back out | back out | backing out 
Experimental | 100 | 3 f 2a 
Control | o2 | n | 1068 

| Í 


Note.—x? = 4.90, df = 1, p < 05. 
àn = 103 in both groups. 


letters plus telephone calls, letters alone, tele- 
phone calls alone, or some other similar 
method) is most effective in reducing the back- 
out rate. For those organizations who believe 
they have an excessive back-out rate, it would 
seem worthwhile to investigate the effects of 
various reinforcement strategies upon backing 
out. Also, it was not determined what the best 
source of reinforcement would be. In this study, 
the personnel department initiated the rein- 
forcement. Perhaps, the prospective supervisor 
of the new employee (e.g. the production 
manager contacts incoming production per- 
sonnel) would serve as a more effective initiator 
of reinforcement. 

Second, it would be worthwhile to determine 
whether the positive reinforcement is related 
in any degree to the first year performance on 
and/or attitudes about the job of college gradu- 
ates. This can be accomplished by studying 
related to the control group 


these variables as h 
Ss during the first year 


and experimental group 


of employment. , 
Third, the exact reasons for backing out of a 


formal job ofier acceptance would provide 
valuable information to personnel depart- 
ments. The authors of the present study have 

backing out is caused by an in- 


inferred that g i 
tense state of dissonance caused by postdeci- 


sion anxiety. A more precise analysis, however, 
may find another explanation for the back-out 
phenomena. Interviews with seniors who have 
packed out could lead to programs which could, 
in some firms, almost completely eliminate 
backing out. 
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DISPLAY CHARACTERISTICS IN INSPECTION TASKS! 


ROBERT C. WILLIGES? ann HARRISON STREETER 


University of Illinois 


a inspectors were required to detect pin-hole defects in a visual scanning task 
isplay type (static vs. dynamic) and display arrangement (random vs ordered) 
were combined factorially in a within-S design. Static (inspector-paced) displays 
yielded more defect detections and fewer false alarm errors; whereas, deeby 
arrangement did not effect detections or false alarm errors. It was concluded that 
inspector-paced displays seem to be more beneficial than externally paced displays 


7 


when rapid scanning rates are required. 


is a tial inspection is a task that man 
Industrial my oa lsg upon:te perform m modern 
& decision as ate Inspection is essentially 
meets a s aa wW hether or nota particular item 
is mime ‘ified standard. Although this task 
Perations Rees industrial manufacturing 
Conducted’ urprisingly little research has been 
Performan on factors which affect inspector 
Onomy ef a Harris (1969) provided a tax- 
and the =a he various types of inspection tasks 
IN these aes behavorial elements involved 
actors Pr ks. It is important to understand 
affect chee in this taxonomy which 
Numan ins mance before the capabilities of the 
anq E can be adequately established 
nly bh age 4 increased. ; 

“esponses (Ba y, such variables as observing 
Complexity adalamente & Ayoub, 1969), task 
(Wallack > (Harris, 1966), costs and payofis 

ack & Adams, 1969), and defect rate 


arri i 
Mental i have been submitted to experi- 
ey ristics of the 


better inspector accuracy than those in which 
items are presented randomly or in static mode. 


METHOD 


Task. The inspection task involved detecting pin-hole 
defects in }-in. diameter transparent yellow plastic 
discs. The discs were displayed in groups of 600. 
Seventy-five discs with pin-hole defects appeared ran- 
domly within each group. 

Each group of discs was displayed in two ways—a 
static display with inspection discs mounted on a flat 
6 X 40 in. light table, and a dynamic display with 
inspection discs mounted on a moving translucent belt 
on a drug capsule sorting machine. The belt moved at a 
constant speed displaying approximately 100 discs at 
a time and presented the entire group of 600 discs in 20 
sec. Both types of displays were lighted by fluorescent 
tubes mounted underneath the white lucite display 


surfaces. 

Two arrangem 
displays were employ! 
pletely random with t 
overlap. The other arrangement was or 
X 75 matrix on 3-in. centers. 


Subjects and design. Eight female undergraduate stu- 
of Illinois participated in the 


ents of the 600 discs mounted on the 
ed. One arrangement was com- 
he restriction that no two discs 
dered in an 8 


display, a estigation. Characte A uie 

. inspection items and their subse- dents at the niversity C 

uent effects on i i fe > have experiment as part of a psychology course requirement. 

received mnspectar’ per armante = None of the Ss had previous experience as an industrial 
This ši experimental treatment. inspector. 

udy dealt with simple visual scanning A within-S factorial design consisting of two factors, 

i i r angeme! » Wa b a 

display arrang nt and display type, Was employed. 

‘1 gement were random and 


he 

study pst: 

Study was to investigat 
€ ing arrangements which could structure 

te. Pection task either by providing @ pat- 


The two levels of display arran; > v 
Display type was either a static 


ordered configurations. À x ; 
(inspector-paced) or dynamic (E-paced) display of 


discs. a , , 
to beginning the inspection task, 


Procedure. Prior z s 
mples of disc defects and was in- 


each S was shown exa 
she would be allowed 20 sec. to inspect 600 


ern 
(9) Š x 
§ Scan : N Š 
*Pection T ning or by pacing the rate of in? structed that s ct 60 
spectig t was hypothesized that @ visual — Giscs for similar defects. Each S then inspected discs in 
+, Hon task whi array four factorial combinations of display 

of ask whick 5 rdered array each of the four factorial com inations of display type 

h presents an a A and display arrangement. Order of presentation of the 

vide 
balanced across Ss. 


Items ; 
= NS in g Cae 
~ 5 1 a dynamic display should pro 


i 
tis. arat 

Mitras ar ae 3 
Leo} ticle was accepted for publication by the 


conditions was counter! 
Each S was given two inspec 
four experimental conditions, t 


tion trials on each of the 
he first trial serving as 
layer of clear acetate 


ng Edit 

` Requeste a? Dr. K =, Clark. : ae ; 
Willi a ests for Eht. aged to Robert c. practice. To facilitate scoring, a i 
i t oi sy ‘Was laid over each display and the S used a china marker 
É ver any disc she thought 


q S, i 
ology y niversity of Illinois, Departmen 
ampaign, Illinois 61820. 


pencil to mark on the acetate 0 
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TABLE 1 


AVERAGE FREQUENCY or DEFECT coreg AND 
FALSE ALARM Errors Over Brock or 25 
Derects ror Two Types or DISPLAYS 


| 


| r | False 
Display type Detection | alarms 
| 
raat 11.08 | 88 
Dynamic | 
Static 14.97 | oa 


was defective. Each S completed all of her trials within 
a single experimental period. 


RESULTS 


Analyses were conducted on both the num- 
ber of defects correctly detected and the num- 
ber of false alarm errors (marking a disc 
defective when it had no pin-hole defect). Since 
it was noted that most Ss did not inspect all 
the discs under static conditions, the results 
were analyzed over three blocks of discs, each 
containing 25 defects. A 2x 2 X 2X 3 re- 
peated-measures analysis of variance design 
consisting of display type, display arrange- 
ment, trials, and blocks was used on both 
dependent variables, 


False Alarms 


The only significant effect on the analysis 

false alarm errors wa 

display type (F = 19.50, df = 1, 7, $ < 01). 
false alarms are 

shown in the right portion of Table 1. Overall 


ore false alarms 
ynamic displays then under 

static displays, 

Detections 


Significant main effects on detection per- 
formance were f 


ound due to display type (F 
= 16.15, df= 7, p< :01), blocks (F 
= 21.54, df = 2, 14, p< :001), and trials 
(F = 31.03, df = lip< 001). The Display 
Type X Blocks interaction (F = 15.90, 4 
= 2, 14, p< -001); the Display Type X Dis- 
play Arrangement x Block: interaction (F 
= 5.55, df= 2, 14, p< 05); 


X Thirds interaction (F = 13. 
É < .001) were also significant, 

- Figure 1 depicts these Significant effects, 
Static displays resulted in higher frequencies of 
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defect detections than dynamic displays. Tha y 
mean detection frequency for each display ty a | 
is shown in the left portion of Table 1. Overal i 
there was a decrease in detections over tria 
blocks. This decrease, however, was due pri 
marily to static displays. Inspectors using dy- 
namic displays maintained an inferior but ceni 
stant detection rate whereas inspectors using 
static displays showed superior detection rate ; 
over the first 50 defects, but detected fewer 0 
the last 25 defects (Newman-Keuls, p < Oh 
Obviously, the latter result is due to the ue, 
that most inspectors using static displays ee 
not complete the inspection of all the discs i 
the time allotted. a 
The main effect of display arrangement W E 
not significant (p > .05). Figure 1 shows mi 
the only effect of display arrangement appearee 
when ordered arrangements were combine 
with static displays, Under this combinatioly 
detections decreased over each block of defec 
(Newman-Keuls, p < .05). all 
Figure 1 also clearly shows that overa i 
more defects were detected during the erpa 
mental trial than during the practice trial. T y 
improvement, however, is restricted priming 
to ordered arrangements when dynamic E, 
Plays are used and random arrangements WP 
Static displays 
P < .05). In or 
practice effect 
trials, the data were grouped into the orde , | 
| 
P 


. aie an 
Presentation of conditions (four levels) 4 
analyzed by means of 


analysis of variance, N 
Practice effects resulted f 
detection or false alarm 
Sequently, any 
to be restricted 
display types and 
overall practice, 


z oul: 
are employed (Newman Ke 
der to test for a possible ove ion 

during the eight inspect 


a repeated-measi i 
o significant pete 
rom analyses of a 
S data (p > .05). ee 
inspection improvement see” 
to particular combinations 


3 th a 
arrangements rather 


Discussion 


Structuring according to display arrange, 
ment seems to have little effect on insper gn 
performance. In fact, many Ss after comple ar? 
of the experiment stated they were not aW‘ a 


that discs had been displayed in both a ra? due 
and ordered fashi 


( 
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25 


20 


FREQUENCY OF DEFECTS DETECTED 


r—— PR ACTIE =A 
2 3 
BLOC 


Fic. 1. Frequency of defect detectio 
and experimental trials as 


Seems to h; 
ieee cee than enhance inspector 
ternally “sg bs hen inspectors are forced ex- 
Tate, they as their inspections at a rapid 
errors and f mse at the expense of increased 
ne ex Poa er detections overall. ; 
Spectors tenes of these data is that m- 
Speed, Im o phasized accuracy rather than 
erion of a in inspection tasks is the cr 
Paced, į accuracy. Consequently, when not 
Slower a will perform at @ much 
Many pa but with more accuracy. 
ast rate dos tasks, however, demand a 
splay a inspection and operate on @ paced 
ajor ae assembly line inspections). The 
Ectors ilt of this study suggests that in- 
àccordin T indeed, pace their inspections 
e N the display presentation rate, but 
Per lotma: result may be inferior inspection 
spect ee when compared to alternative 
these p "paced displays. The implication of 
tee for the design of inspection tasks 
Ore pen gt display seems to be 
ay wh cial than an externally paced dis- 
€n rapid scanning rates are required. In 


a function oi 


x ORDERED 
o RANDOM 
— DYNAMIC 
—— STATIC 


y 
o 
o 


rm EXPERIMENTAL—— 


| 


addition, the results sug 
eeds to be examined—inter- 


procedure which n 
mittent pacing. An 
intermittently wou 
group or chunk 0 


own rate while each group i 


paced. When external pacing 
procedure might prove 


such a scanning 
advantageous. 


2 3 


KS OF 25 SUCCESSIVE DEFECTS 


n in successive thirds of the total task during practice 
f display type and arrangement. 


ggest a third scanning 


inspector being paced 


ld be presented a static 
f items to be inspected at his 


s being externally 
is required, 
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PUPIL DILATION AND ITS RELATIONSHIP TO INFORMATION 
PROCESSING DURING READING AND LISTENING 


RONALD P. CARVER? 


American Institutes for Research, Washington, D. C. 


Pupil diameter has been advanced as an index of inform 
question of whether pupil diameter indicates informati 
reading was investigated in three experiments involving 2 
passages of varying difficulty. A TV camera 
recorder, were used to measure the diame! 


Research has suggested that pupil diameter 
indicates the degree of mental effort or mental 
activity involved in such tasks as problem 
solving (Hess & Plot, 1964), word imagery 
(Paivio & Simpson, 1966), short-term memory 
(Kahneman & Beatty, 1966), and tone dis- 
crimination (Kahneman & Beatty, 1967). 
Beatty and Kahneman (1966) further contend 
that the pupil diameter is a valid measure of 
the momentary state of mental effort. Stated 
differently, Kahneman and Beatty (1967) 
have referred to the validity of the pupillary 
measure as an index of information-processing 
load. They explain this concept by reference 
to an analogy with the aggregate demand for 
Power (total amperage) in an electrical system. 

It would be helpful in research on learning 
from connected discourse if there existed an 
objective indicator of the moment-to-moment 
information-processing activity of individuals, 
Presently, there is no method for either 
studying or controlling for these variables, 
Psychometric tests are employed to give post 
hoc estimates of these activities, but the 


1 This research was supported by the United States 
Office of Education, Re; 


gion III, under Grant No, 
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ation-processing load. The 
on-processing load during 
4 college students reading 
and monitor, together with a video 


ter of the pupil under a reading condition 
e was obtained which would suggest that 


esults was investigated 
ening to tape recordings 
uggested that the pupil 
ze cannot be used as an 
is, in fact, processing the infor- 
sedly engaged in reading 
y that dilations occurred 


imprecision of these methods make adyan 
ment in theory and practice extremely difficult: 
The major purpose of the following reseni 
was to investigate pupil dilation as a possib i 
indicator of the information-processing activ 
ties of individuals while reading and listening. 
A minor purpose of the research was he 
investigate whether the difficulty of t e 
material would be related to pupil size. Pay” i 
Parry, and Harasymiw (1968) studied puP 
dilation as a measure of the difficulty ° 
multiple-choice test questions. They stat? 
that pupil dilation is a sensitive and appare?! 
reliable indicator of internal informatio” 
Processing, and Warrants further study. The 
interpreted their results as suggesting tH 
pupillary dilation is an indicator of itë i 
difficulty but is not as good as another indicat? 
they studied (i.e., the time taken to comple” 
the item). These results do suggest, howe¥’ 
that the difficulty of the reading or listen” 
task may also affect the Pupillary dilation- , og 
In general, the Purpose of the follow’ 
investigate pupil dilate 
sing and storage 0 rse 
ained in connected discot 


po? 
i » the effect of reading " jes 
pupillary dilation 


p x 3 set 

y Was investigated in @ ted 
of experiments an 
to listening for th 


d then the mode was shi 
€ final experiment. 


PupIL DILATION AND INFORMATION PROCESSING 


EXPERIMENT I 


The major purpose of Exp. I was to deter- 
a effect of reading upon pupil dilation. 
tt ihe pt was to investigate the effect 
pupil a ty of the reading material upon 
Eio = loi Since dilation 1s always in 
brankan a control condition, the selection 
eet ome Ame control was critical. Three 
ONE a png itions were used: fixation upon a 
Aton a pproximatey 11 in. away, fixation 
Ta rae a of a series of numbers, and fixation 
whict aOR of the words in a reading passage 

ch had been turned upside down. 


Method 


a Eight college student volunteers were paid 
Gon hr. of participation in the experiment. 
n a ni. The pupil was videotaped, using @ 
Model W hich included a Panasonic TV camera, 
Makro-ki V-220P; an adjustable extension tube on & 
le Oe lens (1:2.8/90) manufactured by Heinz 
Video oci a Lin. video recorder, Panasonic 
(two 15 aster, 2100 SD; two adjustable desk lamps 
position: fluorescent tubes in each lamp); 2 hea 
forehead o apparatus, which provided @ stationary 
device foi aie and an adjustable chin rest; and a clamp 
The anh olding the stimulus cards. y 
orizontal camera was mounted on @ 22} x36 in. 
Was ma surface, 37 in. above floor level. The camera 
Same hoy nted so that its horizontal position was in the 
Oa e nta] plane as the eyes and at a 25° angle 
camera S perpendicular to the left eye. The eye and 
Mountin lens were approximately 13 in. above the 
the left e surface and the camera was focused upon 
€ach side 7 The fluorescent lamps were positioned, on 
Position pe the head so that each lamp was 17 a vertical 
line of si in, from the eye and at a 45° angle from the 
Mat sight, 
white ral The stimulus materials were 5X 8 in. 
number a cards which had been cut to 5 X 9 m. The 
Yped oi was a stimulus card with the numbers 1-9 
Card, th he card. The 1 was on the left part of the 
left, “and 2, top middle, the 3, top right, the 4, middle 
Points 6 so on. The numbers thus provided fixation 
Center n the card for eight border positions and one 
maT Position. 
containin were eight diferent reading „cards, each 
ante eal g one 150-word reading passage The passages 
ale d | 
rae difficulty by Miller and 
P} i selected to represent four 
dificult | Aquino (1969) has 
~ Dara, evel of these passages and has pw 
tion jp e2Phs in their entirety. “Their order of presenta- 
number at ay article are used as an identification 
aragra h en referring to specific passages- The eight 
Study H s (two at each difficulty level) used in the 
Aquing “¢,ctow" in Table 1 represented by their 
identification numbers and corresponding 
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TABLE 1 


AQUINO IDENTIFICATION NUMBERS AND IDENTIFYING 
Cope DESIGNATIONS FOR THE EIGHT PASSAGES 


Very dificult 


| Set 1 | Set 2 
Difficulty | 
3 i Identifi- : Identifi- 
Aquino | Cati Aquino eni 
cation 
| no. code no. ae 
Very easy oe i BA 10 1B 
Materatelyeasy | 22 | 28 5 2B 
Moderately difficult | 27 3A 32 3B 
| 33 | 4A | 6 4B 


number, and letter identification used in the present 


study. 

The comprehension tests for these reading passages 
were Cloze tests (see Taylor, 1953). Ten words were 
arbitrarily selected in each passage as test items and 
these words were deleted when the passage was 
retyped as a test. The items were all selected from the 
lexical words (i.e. nouns, adjectives, adverbs, and 
verbs) rather than from the function words (i.¢., 
articles, conjunctions, prepositions, and pronouns). The 
items were also selected to be approximately equally 


spaced throughout the passage. 

Instructions. The following instructions were read to 
S at the beginning of the experimental session: “This is 
an experiment to investigate the function of the eyes 
when reading. You will be asked to read some short 
paragraphs and then take a sentence completion test 
on the paragraphs.” 

Procedure. Upon entering the laboratory room, S was: 
(a) instructed to sit in front of the experimental 
apparatus; (b) given the instructions concerning the 
nature of the experiment; (c) asked to place his chin 
in a chin rest, and (d) asked to fixate upon the number 5 
on the center of the card in front of him. If necessary, 
E then adjusted: (a) 


the chin rest; (b) the height of 
the chair; or (c) the foc! 


us of the television camera. 
The E focused on S’s left eye so that its image was in 
the center of the TV monitor. 


After adjustment! 
away from the app 
preview of the practice procedur 
then asked to return to the chin r 
and position of the eye were readjus 
The S was instructed to close his eyes for a) 
3 sec. The Æ then activated a stopwatch, tedi 
to open his eyes, gerian me - 
taneously. The S fixated upon the number 5 in the 
center of the card (base-line fixation control condition, 
PR end of 20 sec., S was instructed to close his 
eyes and, after 5 sec., Was instructed to open them and 

0 of the numbers 1-9 on the card. The S 


ted to continue looking at each of the 
9 was reached, S was to return 


(eye-movement control condition, 


EMC). 
At the end of 20 sec., S W 
The £ then removed the nu 


as signaled to close his eyes. 
mber card from the viewing 


T p“ 
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TABLE 2 


PASSAGE PRESENTATION ORDER FOR A 
Set oF Four Ss 


Presentation order 


| 
mc lelsitatls lets te 

j | 

| 3B | 4B 

A | 2A | 3A | 4A | 1B | 2B | 

2| 4a | ta 2A | 3A | 4B | 1B | 2B | 3B 
3 | 3a | 4A 1A | 2A | 3B | 4B | 1B | 2B 
3 | 2A | 3a 4A | 1A | 2B | 3B | 4B | 1B 


position. The S was then instructed to open his eyes and 
begin reading immediately (condition, READ). 

When S finished the passage, he was instructed to 
close his eyes. The Æ recorded the reading time with 
an electric timer. With S’s eyes closed, E turned the 
passage card upside-down and then Instructed S to 
open his eyes and look at each word on the card as 
though he were reading (upside-down looking control 
condition, ULC). That is, S was t 
left corner, look at each upside- 
to right and continue down each 
the last word was looked at in 
corner of the card. The S was a 
trying to recognize the meanin: 
words while he was looking at them, 

The S was instructed to close his eyes when he 
finished reading. The £ used a timer to record the dura- 


ing upon each word. These pro- 
-time durations which were 

e corresponding reading-time 
After the ULC 
away from the apparatus and t; 


very difficult, moderately diff 
and very easy, 
his particular p 
some idea of wl 


cult, moderately easy, 
The S was told the difficulty level of 


he would have 


general design for the 
study. The Ss Were tested individually and were 
assigned sequentially to the four Latin-square Passage 
ented was always the 


lowing seven were experi- 
mental passages. However, in the data analysis, 


Passages 2-4 were also considered by E as Practice 
passages to allow anxiety and the learning to learn 
effect to stabilize. 

Measures. The Ss were sched, 


: uled at 2-hr, intervals 
and the hour between Ss w 


as used to record the data 


Ronatp P. CARVER 


from the videotape. Three data points were ampi 
from each of the four conditions (B FC, EMC, R $ d 
and ULC) for the four paragraphs (1B, 2B, 3B, m 
4B). Thus, a total of 48 pupil diameters was measu 
h S. art 

sampling procedure for each condition a 
follows: BFC-Sample 1 was taken 10 sec. from eh 
beginning of the condition, Sample 2 was taken el eas) 
imately 2 sec. later, and Sample 3 was taken app ne 
imately 2 sec. after Sample 2. EMC- ~Sample 1 by; 
taken from the first fixation on the number 5 aa 
the 10-sec. point in the condition interval er 4 
completed a cycle of looking at 6, 7, 8, 9, 1, 2, 3, bats 
before the next sample was taken); Sample 2 was a 3 
from the next fixation on the number 5, and Sampie 5 
was taken from the following fixation on Lape: 
after Sample 2. READ—Sample 1 was taken aom, a, 
first fixation in the center of the monitor screen fo À 5 
ing the 10-sec. point in the condition interval, Sane 
was taken when S had finished the line he was rea ie 
for Sample 1 and had again reached the center = 3 
screen while reading the following line, and Samp pes 
was like Sample 2 except the measurement was ied 
from the following line. ULC—Samples 1-3 were tal 
in a manner exactly like those in the READ conato 

The pupil diameter was measured directly from ae 
stop-action image on the monitor screen, The ude 
ment was accomplished by affixing a transparent Fe 
(10 squares per inch) to the monitor screen. ei 
horizontal coordinate which was tangent to the UPPE; 
edge of the pupil was observed and recorded by tto 
likewise, the horizontal coordinate which was tangen 6 
the lower edge of the pupil was also observed ve 
recorded by £, ‘Thus, the pupil diameter was always by 
vertical diameter and this value was calculated ek 
Subtracting the value recorded for the lower edge fro 
the upper edge, i 

f any planned measurement coincided with 

immediately followed 
reversed in slow moti 
taken immediately preced 
l dilation in 
» all measurer 


als from approximately 2S t 
of the pupil in millimeters was tbe 
estimated, since the distance between the eye and 18 
camera varied with the Contour of S’s head; the aaa 
therefore, varied slightly between individuals. Howe¥""? 


. . aie 
by focusing upon circles of known diameter placed r he 
normal position of the 


system was estimated, 


the monitor screen was thus estimated to equal -0 on 


in., or 14 mm., on the actual eye. The magnificatio e 
the monitor was therefo; 


i re approximately 18 times sas 
actual size of the pupil i 


A diameter change of .1 i363 


for most individuals. I the 
impossible to estimate the horizontal tangent to oft? 
Pupil closer than the nearest .1 in, and, ther? of 


- of pupil diameter. vai for 
ee samples had been takrnctod 
the median value was se 


« 
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eee the condition for additional data analyses 
ie = Mee individual reading each passage there was 
the papi ; lameter for each of the four conditions in 
Badin analysis). Since slight changes in the head 
Stated a movement in the apparatus often neces- 
trial ene at the beginning of each passage 

freee recent could not be directly compared 
within n or between passage conditions 
pupil dia ui uals. In order to make these comparisons, 
EMC ee, under each of the three conditions— 

(iae a , and ULC—were adjusted for base-line 
fhe BFC. by subtracting the pupil diameter under 

condition. 

the aoe to the pupil diameter measurements were 
thes ae pa reading time data and the test scores on 
iligeviain sepa These data were analyzed solely to 
was eea a ner the difficulty of the reading passages 
scares: na in reading durations and comprehension 
down wor T times recorded for looking at the upside- 
recorded a X ere not analyzed. These durations were 
to E releve y to provide additional immediate feedback 
ments un m to the similarity between the eye move- 

tiñe der this condition and the eye movements 

ng the reading condition. 


R 
esulls and Discussion 


ma mean reading times for the four passages 
eased with the difficulty level of the passage 
tnd” LOS mg) athe mean une 
Correct oh the e ily and panni 
with the the comprehension tests decrease! 
gpg Me dificulty level (1B, 9.6; 2B, 6.9; 
the difè 2B; 2.4). It therefore appeared that 
Mani iculty variable had been adequately 
Ppulated. 

è pea 3 contains the mean pupil dilation for 
one ey Passages under the two control and 
bin oa conditions. There were only 

hé Si dilations which were greater than 
anq etaa of the measurement, -10 in., 
differen ARPS dilation was only .20 in. These 
size of a are quite small in relation to the 

e dilation, approximately 50 mm., 


re 
Ported by Kahneman and Beatty (1966). 


50-mm. dilation would result in a change 
itor in 


the pProximately .36 in. on the TV mon! 
pr esent study. 

coke point in the data analy’ 
Most of F variable was discovered. Al = 
“enter of e measurements were made 1n z 
ion wa the screen, some were not. A cons a 
the ne discovered between the position © 
That ae on the screen and the pupil se 
Pupil P the screen was distorted so that jea 
aana slightly in diameter as i 
in position on the screen from bottom 


sis, & CON- 
Ithough 
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TABLE 3 


Mean Purim DILATION IN RELATION TO Base LINE 
FOR THREE CONDITIONS AND 
Four READING PASSAGES 


Conditions 
Passage | | 
EMC READ | ULC 
(Control) (Experi- | (Control) 

| mental) | 

| | 
1B a | o | 0 
2B 06 | dt | 08 
3B .01 | Bol | .02 
4B a | 20) a 
: 06 j x | .05 


Note.—N = 8; Mean values are in inches of dilation as 
measured on a TV monitor which had magnified the pupil 


approximately 18 times. 


to top. During the reading of the more dificult 
paragraphs, the individuals had not progressed 
as far down the screen during the first 10 sec, 
of the READ condition as they had during 
the less difficult passages. Thus, these latter 
pupil dilations tend to be larger. 

A correction factor was formulated by using 
a test circle of consistent diameter. When the 
correction factor was applied to the data, 
the differences in Table 2 became less marked. 
These corrected values are presented in Table 
4. Notice that the largest dilation was only 
17 in. and that there appeared to be no 
trend for the dilation to increase as the 
difficulty of the passage increased. Further- 
more, the mean of the mean differences for 
each of the three conditions were all approx- 
imately equal. Thus, there appeared to be 
little evidence to suggest that the pupil dilated 
while the individuals were processing informa- 
tion during the reading of the paragraphs. 

It did not appear fruitful to try to replicate 
the results in Exp. 1 under precisely the same 


experimental conditions. 


EXPERMENT II 


In order to be relatively confident that the 
motive-incentive conditions were not a major 
variable affecting the pupil dilation, the 
instructions in the following study were 
modified in a manner designed to increase 
motivation. Furthermore, the measurement 

rocedures were modified slightly to further 


control for the screen distortion. 
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TABLE 4 


Mean Puri DILATION IN RELATION TO Base Line 
FOR THREE CONDITIONS AND FOUR READING 
PASSAGES AFTER CORRECTION FOR SCREEN 


DISTORTION 
| Conditions 
| | | 
Passage | EMC TAR | ULC 

| c Aperi (Control) 

| (Control) | mental) 
w | æ | g 07 
2B | .09 | 17 | 14 
3B | .05 | .09 | -06 
4B | Al | -16 | .06 
Xx | 0 | 2 .08 


Note.—N = 8; Mean values are in inches of dilation as 
Ean on a TV monitor which had magnified the pupil 
approximately 18 times, 


Method 


The method was exactly the same as in 


Study 1 
except for the following three changes: 


1. The instructions read to each S were changed to 
increase motivation by providing S with knowledge of 
results and by indicating that S’s performance might 
be categorized by E as bad. The instructions were: 
“This experiment is designed to investigate the function 
of the eyes during reading. We are interested in the 
differences in function between good and poor readers, 
You will read a number of short paragraphs and then 
take sentence completion tests on these paragraphs. 
I will not comment on how well you did each time, but 
I will grade each test immediately after you have 
finished so that you will know how may blanks you 
filled in correctly.” 

2. The E marked the incorrect responses in the 
Presence of S immediately after he had finished each 
of the eight tests, 


3. The measuremen 


Results and Discussion 


Mean reading times for th 
again increased with the di 
the passage (.75, .83, .95, an 


© four passages 
ficulty level of 
d 1.33 min.) and 
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mean number correct on the comprehension 
tests again decreased with the difficulty level 
(8.8, 6.6, 4.0, and 2.2). Although the mean 
reading times were higher in Study 2 than in 
Study 1, the mean comprehension scores were 
lower. Thus, it appears that the group in 
Study 2 was probably slightly less able, 
rather than more motivated. These data 
suggest that Ss were probably doing their 
best in both Study 1 and Study 2. ol 

Table 5 contains the mean pupil dilations 
for the four passages under the two control and 
one experimental conditions. Notice that no 
mean dilation is greater than the accuracy © 
measurement, .10 in., and the average dilation 
over all four passages was equal, .05 in., for 
all three conditions. i 

These data further confirm the results © 
Exp. I and provide no evidence that the 
pupil dilates when individuals are process 
ing the information contained in connecte 
discourse. M. 

It did not appear to be fruitful to replic# 
these results using the same procedures. 


EXPERIMENT ITI 


The major purpose of Exp. III was to furthen 
extend the results of Exp. I and II u” 

slightly modified procedures, In this study)“. 
were not allowed to read at a rate of the! 
own choosing, but were given a short sat 
amount of time to spend reading, This P" y 
cedure was used to produce a consistent 6 
high level of information processing Joad 


s 
Conditions 
| 
Passage | Me READ | ULE» 
(Control) (Experi- (Con 
mental) | r 
1B 04 05 0 
2B ‘00 ‘02 oh 
3B 08 ‘09 a 
3 0 05 05 
x 05 05 : A 
a 
Note.—N = 8; Mean values are inches of aiai 4 
9] in inc 
proximately 18 times nitor which magnified the P 


| 


C'S 
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al levels of passage difficulty. That is, S would 
n expected to try to process as much informa- 
lon as he could during the time interval, thus 
keeping the load at a maximum during the 
entire 20 sec. 


Method 


Pha method was exactly like that in Study 2, with 
cere as that Ss were given only 20 sec. to read 
= and 20 sec. to look at the upside-down 


Results and Discussion 


Pe era j number correct again decreased 
53 os difculty level of the passages (7.5, 
Tombar. and 1.2), and, in each of the four 
of ETEA the scores were lower than those 
Sond i 1 and 2. Although it was never 
infor r questioned that Ss were processing 
mation while they were looking at the 
Words, it was possible that the items could 
es been answered correctly without reading. 
Sa the lower mean scores in Exp. ul provide 
tion oe that Ss were processing informa- 
und While they were fixating upon the words 
der the reading conditions. 
ae 6 contains the mean pupil dilation in 
thd on to base line for the three passages 
` a the two control and one experimental 
oe Although the mean of the four 
con =e aer dilations under the reading 
accura AS 15 in. was greater than the 
Sse ot measurement, -10 in., it was 
four a y equal to the mean dilation for all 
eS under the ULC condition, -14 ìn. 
T the results of Studies 1-3 do not 
the fs evidence which would suggest that 
is ass ormation being processed while reading 
Possi ociated with a greater pupil size. It is 
= that these results cannot be generalized 
ea mode of processing the information 
it ign in connected discourse. Furthermore, 
legate that some other variable is 
e with eye movements which con- 
è ated the data in these studies, such as 
an procedures. Therefore, it 
of aie desirable to investigate the effect 
anoth ening upon pupil dilation. Listening is 
Cont, er mode of processing the information 
as i in connected discourse. — 
tobe € advantage of allowing pupil dilation 
measured during the information-process- 


TABLE 6 


Mean Puri, DILATION IN RELATION To Base LINE 
FOR THREE CONDITIONS AND 
Four READING PASSAGES 


Conditions 
VARSAEE! EMC READ ULC 
(Control) | (Esperi | (Control) 
mental) 

1B os | a 15 

2B u | #2 20 

3B —o1 | .12 12 

4B ‘04 06 “10 

Xx 05 15 14 
Note.—N = 8; Mean values are in inches of dilation as 


measured on a TV monitor which magnified the pupil ap- 
proximately 18 times. 


ing activity without accompanying eye move- 
ments, as in reading. 


EXPERIMENT IV 


The purpose of the following study was to 
investigate the effect of listening upon pupil 
dilation to determine if the results of Studies 
1, 2, and 3 could be generalized beyond the 
reading mode of processing the information 


contained in connected discourse. 


Method 

The method was exactly the same as that in Exp. IL 
with the following exceptions: 

1. There were 20 Ss instead of 8. 

2, The eight passages were tape recorded by a 
professional announcer and S simply fixated upon the 
5 on the number card while he listened to the passages 
as they were being played. Thus, the previous READ 
condition was replaced by @ listening condition. 

3. The passage order was always the same for each 


S—1A, 2A, 3A, 4A, 1B, 2B, 3B, and 4B. 
nly one control condition for each 


4, There was 0) t & 
the 20-sec. base-line condition, 


passage and that was „l 
BFC. The other two control conditions were necessary 


for reading only. 


5, There were five samples taken from each of the two 


conditions—BF'C and listening. For each condition a 
measurement was taken at the instant that the eyelid 
opened to a position above the pupil, and at S-sec. 
intervals thereafter to 20 sec. (ie. 0, 5, 10, 15, and 
20-sec. points). The 0-sec. point was measured to 
provide an indication of the precision of the measure- 
ment procedures. 

6. The dependent variable analyzed was the pupil 
diameter during the listening sample minus the diam- 
eter during the corresponding control sample. For 
example, the pupil dilation at the 5-sec. point for the 
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TABLE 7 


Mean Pur DILATION IN RELATION TO BASE LINE 
FOR THREE SAMPLING POINTS AND 
FOUR LISTENING PASSAGES 


Sample points (in sec.) 


Ex .05 | .02 


P 

Passage 3 “ | Be | p | 
iB | oœ | o7 | 00 | o | 03 
2B | wiz 04 | 01 | 01 04 
3B | 04 | 00 | 00 | .00 01 
4B | 04 —.04 | 01 06" | .02 
< | ‘00 | ‘02 


te.—N = 20. 2 
Ae significant beyond the .05 level, 


control condition was subtracted from the diameter 
at the 5-sec. point for the listening condition. This 
measure of dilation was calculated for each of the 
four sample points (5, 10, 15, 20) and the four passages 
(1B, 2B, 3B, and 4B). 


Results and Discussion 


Mean number correct for the four passages 
were approximately equal to the means in 
Studies 1 and 2 (7.8, 6.8, 4.6, and 2.6). 

Table 7 contains the mean pupil dilations 
in relation to the base-line control (BFC) for 
four sampling positions and the four passages. 
Notice that only two of the 24 ¢ tests for 
dilation were statistically significant. These 
two statistically significant differences were 
probably chance occurrences, since neither 
the marginal means for sampling points or 
levels of difficulty suggest a trend. All the 
marginals are close to zero, and none are above 


the accuracy of measurement, .10 in. However, 


in order to be sure that the size of the mean 
dilations were not large in relation to their 
variances, an analysis of variance was con- 
ducted. An ABS design (see Lindquist, 1956) 
was applied with the A variable representing 
the four sample points and the B variable 
representing the four Passages varying in 
difficulty level. Neither the F representing 
the main effects of A and B, nor the F rep- 
resenting their interaction approached statis- 
tical significance at the .05 level. 

Thus, it appears that there is no evidence 
which would suggest that the pupil dilates 
while processing the information contained in 
aurally presented connected discourse, 
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In order to demonstrate that the procedures, 
techniques, and equipment were probably 
sufficiently sensitive to dilation had it occured, 
the mean pupil diameter was plotted (see 
Figure 1) as a function of the sampling point 
for both the experimental and control condi- 
tions. These values represent an averaging 
over all four of the listening passages. Notice 
that there was a large drop in pupil diameter 
from the 0 point to the other four sampling 
points. Figure 1 graphically depicts 2 the 
potential amount of dilation and the precision 
of measurement procedures as well as the 
essential equality of pupil size in the control 
and listening conditions. F 

In order to estimate the actual size of pupil! 
dilation that these procedures could have 
detected as statistically significant, further 
calculations were made. For the data collected 
under the 5-sec. sample point and the 4 
paragraph, the standard deviations unde" 
the listening (48 in.) and control (.57 m. 
conditions, together with the correlatio? 
between these two conditions (r=.96), were 
used to estimate the mean difference thas 
would have been statistically significant F 
the .05 level. This mean difference transformé 
into actual pupil size was 09 mm. That I$ 


4.2 


*—* CONTROL 
+---+ LISTENING 


3.8 


3.6 


3.4 


MAGNIFIED PUPIL DIAMETER (INCHES) 


o 5 


10 15 
ELAPSED TIME (SECONDS) 
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it was estimated that a pupil dilation of 
around 09 mm. would be detected as statisti- 
cally significant in Exp. IV. 


Discussion 


oe Studies 1, 2, 3, and 4 suggest 
or wke ere does not dilate while reading 
That is S is listening to connected discourse. 
b at is, it does not appear that pupil size can 
cn bie as an objective indication of whether 
‘tee ee individual is, in fact, processing the 
Calle oe contained in connected discourse 
ing i ge engaged in reading oF listen- 
Der sa ivities. What appeared at the outset to 
investi potential means of controlling and 
in interact the moment-to-moment changes 
reading mation processing and storage during 
ical spe listening has not provided empir- 
Th ata which substantiated this potential. 
oe data collected were not designed to 
the saad pi evidence with regard to 
inform Sa of pupil dilation as a measure of 
data z lon-processing load. However, these 
It ain indirectly relevant to this concept. 
Were pears certain that Ss in the four studies 
Het rate Ducat canoe during the exper- 
reasonal — Furthermore, 1t seems 
Processin e to infer that the information- 
and tee load was greater under the reading 
fontol. ene conditions than under the 
the fai] conditions, Thus, an explanation for 
in ie to find dilation would seem to be 
er, 
ae e Planation for these results could be 
minimal information-processing „load iA as 
Not preci and the research techniques ae 
Small ais enough to detect the corresponding 
ausibl ilation. This explanation 15 hig y 
urin i It seems reasonable that the loa 
Tea S he processing of information during 
to Ta listening may be small in rl 
number ag during the storage of a a rs 
Since th (see Kahneman & Beatty» -ted 
ere, the conduct of the research reportes 
ents |} Tesults of other pupil-dilation exp “ 
ahne: have become available, for examp. a 
dilations > & Peavler, 1969, who report aon 
around of very low orders of magnitu 
06 mm.). Kahneman and Peavler have 
a or in their 
Toshi is, anxiety or emotion. It 5 quite 
€ that a more adequate theoretical ex- 
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planation for pupillary dilation is not level of 
information-processing load, but the anxiety 
that many times accompanies the higher levels 
of information processing. Kahneman, Peavler, 
and Onuska (1968) report dilations up to 30 
and .40 mm. for college students during the 
storage of a four-digit number. Surely, the 
degree of information processing involved in 
this task would not be greater than the reading 
and listening activities involved in Exp. HIV. 
However, it is conceivable that the anxiety 
level when storing the four-digit numbers was 
higher than that in the present studies, since 
it would be embarrassing to fail to remember 
a four-digit number or to fail to report the 
numbers on cue. 

Kahneman and Peavler (1969) have inter- 
preted the results of recent pupil-dilation 
research as rendering its validity practically 
unassailable when used as a measure of mental 
effort or processing load. Yet, there appears 
to be no research reported which was explicitly 
designed to separate anxiety or emotion effects 
from information-processing effects. This situa- 
tion, taken together with the present failure 
to find a dilation during reading and listening 
which was as large as the dilation that accom- 
panies the processing of a four-digit number, 
seems to make it reasonable to withhold 
judgment concerning validity. It appears that 
the alternative hypothesis—that the anxiety 

ich often accompanies higher 


or emotion W comp 4 
„mation processing 1S responsible 


Jevels of infor I nsib! 
for pupil dilation—still deserves attention 1n 


future research. i f 
Another problem involved in the comparison 


of these results with previous research data is 
the use of the terms, information processing, 
processing load, mental effort, anxiety, and 
emotion. It is practically impossible to opera- 


tionally discriminate among these terms, and 
y „iminations are also difficult. 


conceptual discr | : 
Jt seems that a ion for the 
failure to find dil 


cise in t e 
to pe vation-processing load and procesa 
Joad. That is, mental effort may hs in : 
changeable with processing load as Kahneman 
and Peavler (1969) have suggested, but 
snformation-processing load may not be inter- 
changeable with these two concepts as the 
discussion of Kahneman and Beatty (1967) 
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would suggest. As an example of this distinc- 
tion, it seems reasonable that experimental 
conditions may make the level of processing 
load (or mental effort, or anxiety, or emotion) 
higher during the storing of a four-digit 
number as compared to the level during 
reading and listening. However, it does not 
seem reasonable that the information-process- 
ing load during the storage of a four-digit 
number would be higher than that during 
reading and listening. Before it is possible to 
be confident about the validity of pupil 
dilation as an index of a theoretical construct, 
considerable Progress needs to be made in 
both the conceptual and operational definition 
of terms. 

In the interpretation of the results of Exp. 
LAV, it is impossible to know whether small 
dilations accompanied the information-process- 
ing activities or whether none existed. In 
either event, however, it seems impractical, 
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VOCATIONAL INTERESTS OF POLITICAL SCIENTISTS? 
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Strong Vocational Interest Blanks were collected from 177 male members of the 
Midwest Conference of Political Scientists. Their interests were most like those of 
psychiatrists and psychologists, public administrators and social workers, librarians 
and musicians, and lawyers and author-journalists. When compared with sociolo- 
gists, political scientists had interests less like those of applied biological scientists, 
physical scientists, outdoors and skilled worker occupations, administrators, and 
businessmen. Similar differences were found between psychologists and political 


scientists. 


fia increasing percentages of undergraduate 
t Scie enroll in graduate schools, more of 
Majo may be uncertain about their choice of 
to sb Much of this uncertainty will be due 
fields, difficulty of choice between similar 
Certaj For example, a student may be quite 
a that he wants to do further study in 
oo sciences, but, within that general 
areas may have trouble deciding between 
Sie as sociology, psychology, political 
vocato or anthropology. Can a measure of 
tional fo, interests such as the Strong Voca- 
Such st nterest Blank (SVIB) be of help to 
Students? 
k T SVIB has demonstrated its usefulness 
tion L me among such varied occupa- 
al fields as the biological sciences and 
eal sciences, between art and business, 
Teas T social service and verbal-linguistic 
More di ut finer differentiations have be 
tom s ficult to make. SVIB data collecte 
bell io ciologists (Rossmann, Lips, & Cani 
Statisti ) indicated that while a number © 
etwe leally significant differences were ione 
Sists en profiles of sociologists and psycholo 
iS ir basic patterns were Very similar 
tu p resembled those of a group of a 
teach} S headed toward careers in coues 
ing (Rossmann & Bentley, 1969). 
ihe Some better scheme for differentiating 
discipli Soa interests of people in the ren 
“tae should be attempted. Kuder 
Outgoing article was accepted for publication by the 
Re 8 Editor, Dr. Kenneth E. Clark. i 
Rossman ests for reprints should be sent to Ee 
College SC Office of Educational Research, Maca 
> St. Paul, Minnesota 55101. 
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and Campbell (1966) have expressed concerns 
about the problem of building scales that 
discriminate between occupations while retain- 
ing occupational similarities. Campbell sug- 
gests that since the SVIB is generally used to 
indicate a general area of interest to the client, 
rather than a specific occupation, attempts 
at sharper occupational differentiation may 
not prove to be particularly fruitful. Kuder, 
on the other hand, feels that sharpness in 
differentiation may be lost by the exclusive 
use of the men-in-general technique. 

With the latter point in mind, we have 
extended the earlier sociologist research by 
collecting data from other groups of behavioral 
scientists. The present study presents data 
from political scientists and focuses both on 
their interests and the extent to which these 
interests can be differentiated from two other 
groups of behavioral scientists—sociologists 


and psychologists. 


METHOD 


brief questionnaire were mailed to 400 
ame Midwest Conference of Political 
jentists i ry 1967; completed forms were 
ame a es (61%) "These were screened to 
ho had less than 3 yr. experience 
hout PhDs (N = 23), or those who 
didn’t enjoy their work (N = 3). 
as done because earlier experience 
i tions had suggested that some mini- 
ba pone anys success and satisfaction should be 
al to increase the homogeneity of the sample. The 
final sample included 177 political scientists who can 
S described as PhDs with at least 3 yr. of experience 

they enjoy their work. ‘ i 
eae a describing the sample are included in 
Table 1. While the respondents were located in 24 
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TABLE 1 
Backcrounp Data on 177 PHD 
POLITICAL SCIENTISTS 
Data | N | % 
| 
Co ay 10 | 5.7 
31-40 56 31.6 
41-50 64 36.2 
51-60 31 17.5 
61 and over 16 9.0 
Date of degree 
Prior to 1946 | 36 20.3 
1946-1955 | 58 32.8 
1956-1965 g 83 46.9 
Employing institution saci 
University | 133 75.1 
Liberal arts college | 30 | 17.0 
State college | 12 | 68 
Nonacademic | 2 it 
Major job responsibility $ | 
Teaching 153 86.4 
Administration 18 10.2 
Research 4 2.3 
Nonacademic 2 | 1.1 


states ranging from New York to California and Texas 
to Minnesota, 83% were from the Midwest. In general, 
they were predominantly members of a teaching 


faculty in a university setting and had a median age 
of 43, 


ResuLTS 


Psychologists, public 
workers, librarians a 


and author-journalists, The interests of politi- 
cal scientists were least like those o 


i f physical 

Scientists, members of the skille and semi- 

professional occupations, and businessmen 
The responses i 


responses of Str 


were found on 103 of 
eds by 29 the analogous 
ng items found in the 
ann et al., 1967), There 
€ interests of Political 
entiated from those of 


number of differentiati 
sociologist data (Rossm 
is little doubt that th 
scientists can be differ 
men in general. 
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Basic Scales 


What does it mean conceptually for mem- 
bers of an occupation to have the kind of 
interest pattern described above? Campbell 
et al. (1968) have attempted to clarify inter- 
pretations of the occupational scales by de- 
veloping a set of basic interest scales, including 
in each, items which correlate relatively highly 
with each other. Norms for these basic scales 
were developed by scoring one of Strong’s 
early groups (Strong, 1931) which had been 
retested in 1966-1967 (as adults). 

When political scientists are scored on these 
basic scales, the four highest scales seem tO 
fit quite well the interests that might be ex- 
pected for a political science professor—an 
interest in teaching, speaking, writing, and the 
subject matter of law and politics. Scales 
falling at the other end of the continuum 
(mechanical, agriculture, sales, merchandising 
technical supervision, and business manage 
ment) support the previously mentioned ies 
jection of business and technical occupations: 

Inspection of individual items lends furthe™ 
Support to these findings. Eighty percent 07 
more of the respondents said they would lik 
to be a college professor, an author of a note 
an editor, a foreign correspondent, and i 
governor of a state, Equally strong was the! 
dislike for any occupation which involve 
selling (e.g, auto salesman, life insurance 
salesman, or real estate salesman), 


Scientists be diff 
behavioral scientists? 


Psychologists (Kriedt, 1 pp 
Sociologists (Ross 
Scales scored, difieren d 
Points (one-half stan e 


un 
me magnitude were fo ist 
ientists and psycholog 


—_ 


oe 


VOCATIONAL INTERESTS OF POLITICAL SCIENTISTS 


ges On 20 scales, differences of five 
eee ‘ore points or more were found 
etween political scientists and both psycholo- 

gists and sociologists. 
ite Lg gine seem to suggest that as 
have pee sa sociologists, political scientists 
a TE tage ees Sele ate applied bio- 
and einen physical | scientists, outdoors 
and bu ed worker Ocenpalons; administrators, 
tie Se men. These same differences hold 
= nen contrasting political scientists and 
ot clits but in addition, when compared 
titer hologists, political scientists have 
aithor, mep like those of librarians and 

e ee , , 

en eS E meaning is given to these differ- 
scientist ien these three groups of behavioral 
Scales Or are compared on the SVIB basic 
Politie 1 the (22 scales, differences between 
More ie Scientists and psychologists of 5 or 
Scales ian score points were found on 8 
‘ee, only one difference of this mag- 
Political vs found between sociologists and 
higher se scientists, Political scientists had 
cores than psychologists on three scales 


Publi i 7 sgi 
Se speaking, writing, and law/politics: 
© psychologists’ scores were higher on 


Sci 
tech- 


en z R p 
ce, mathematics, medical service, 
the 


nicą ini 

Hee Ce and mechanical. Thus, 

Verbal lin political scientists as being more 
/linguistic in orientation but considera- 

e Re A | s 

gist Ss scientific/mechanical than psycholo- 

S emerges, 

a < ri aye 

ank order correlation (rho) between politi- 

while 

ogists 


Cal sci A 
the citntists and psychologists was -76 


ithe eee holo: j 
Was .96 for political scientists and sociol 
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Discussion 


From the preceding data, it is apparent that 
sharp differences do exist between the interests 
of political scientists and those of men in 
general. The interests of political scientists 
also seem to differ somewhat from those of 
psychologists and sociologists (particularly the 
former), but an examination of the compara- 
tive mean profiles points up the difficulty of 
translating these differences into practical use- 
fulness for a counselor or individual student. 
If differences which can be helpful to a pro- 
spective graduate student are to be found, the 
development of a behavioral science men-in- 
general group may be necessary. 
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EFFECTS OF TEMPORAL PLACEMENT OF UNFAVORABLE 
INFORMATION ON DECISION MAKING DURING 
THE SELECTION INTERVIEW 


ROGER N. BLAKENEY? AND JOHN F. MacNAUGHTON 


University of Houston 


One hundred and twenty-six evening school management students rated an 
applicant for employment after listening to standardized audio tapes of a 


simulated interview in which the location and content of one piece of unfavor- 


For years, the inadequacies of the decision unfavorable decision (reject) should tend f 
made during the selection interview have been occur. However, if both occur at the sre 
discussed. However, only in recent years has time, then the effects should cancel each othe 
research into the decision-making Process and a marginal decision should result. d 
characteristic of this type of interview begun. Bolster and Springbett (1961) develope” 

Springbett ( 1958) varied the order of pres- interview protocols using favorable and W | 
entation of three kinds of information in the favorable items of information, Interviewe"s 
selection interview. He found that the first were given a test score to establish a favo” 
rating, whether based on the application form, able, neutral, or unfavorable set toward the 
Personal appearance of the candidate, or the applicant. Then they were asked to read E 
interview was significantly related to the final first item of information on the protocol a” | 
tating. Webster (1964) applied the “gating” rate the applicant after which they read tbe 
Phenomenon (Bruner, 1957) to provide one second item and rated again and so on Ther i 
possible description of the perceptual process results confirmed the lane fa primacy 
Mm an interview. Bruner uses the concept of effect, p ji 
gating to refer to the latter stages of decision so intel 
making during which an observer’s openness to Mrak a s Srei that forcing “had 
additional 8S, as Springbett (1958) ba? 
ae r = = oie reduced. Webster may affect the fin decision, She found A 

oncluded that if favorable data come early in i j t 
the interview and unfavorable data come i om a a ka Significantly changed bY ore 
then there is a tendenc: ee ee 


the final decision 
the content of t 


tant 
around the Conformation of the earl favorabl E oi 
. e i 
impression. The unfavorable data akoak th Wiener (1969) foung dogmatism, t ri 
a en sured by ee 
tend to be disregarded, 


Dogmatism scale oe jp 
ajor source of variabili joo" 
ms of interview informat! 


interviewers who obtained il 
be sent to Roger y, SCOres on the Dogmatism scale gave SP ter 
avioral Management Cantly less favorable ratings than did ' gje 
n, Cullen Boulevard, viewers whose sı 


were low, 
138 


true; if unfavorable data i í 
thi 
favorable data late in the i i ster: J ate 


1 Requests for Teprints should 
Blakeney, Department of Beh 
Science, University of Housto 
Houston, Texas 77004, 


a 


UNFAVORABLE INFORMATION DURING THE SELECTION INTERVIEW 


ae Primary purpose of the present study 
Mec. gece the effect of certain varia- 
Bogan, their interrelationships—on the 
view m maling process in the selection inter- 
Bitcem € major focus was on the temporal 
sede of unfavorable information in the 
a eo A secondary focus was on the con- 
i is or subject matter (work history, 
hich iy background, personal-social ) in 
bedded e unfavorable information was em- 
af th d. A third concern was the dogmatism 

e rater as measured by the Dogmatism 

) scale, = 


Subjects METHOD 


eo Were 126 evening students in Behavioral 
Ness Ad ent Science classes of the College of Busi- 
ranged Munistration, University of Houston. They 
students, T classification from juniors to graduate 
50 with The Ss were volunteers, male, and aged 21- 
of fup è median age of 25. They had at least 2 yr. 
experienc e JOD experience. None of the Ss had had 
nce as a personnel interviewer. 


Taped 7 nterview 


reconigstimmulus to which Ss responded was a tape- 
iat interview in which the temporal placement 

Was Fesai of a piece of unfavorable information 
Such int, ematically varied. The design resulted in 18 
rief eens Each recording began with the same 
establish roduction (approximately 2 min. long) to 
View wa a common set. The remainder of the inter- 
in Jen th, divided into three segments each 6-6} min. 
tent Sth. Each segment dealt with a different con- 
Sia „Work history, academic history, and 

fol a Social background. These segments were 
Y a common closing remark by the inter- 
he ee in a real interview, would have led 
e selling of the company. All of the inter- 
views, ich were simulated campus recruitment in- 
A used the same interviewee and interviewer. 

e AS, in fact, identical except for the order of 
the unf. © major content areas and the substitution of 
Since tha orable for the favorable bits of information, 
Same p> Were edited or pieced together from the 
master tape, 


Thre 
from © Pieces of unfavorable information, 


Views, 
tery;, 


one 


142 Sach content area, were used. These were all 
tion deat long. The unfavorable pieces of informa- 
alt with the following topics: (4) work his- 


tory 


acade V2? not liking supervisory responsibility ; b) 


tivities € history (A), too many extra-curricular ac- 
Socia} p Sulting in low grades; and (c) personal- 
Sa S), inability to get along with people. m 
unfavorable 18 interviews, one of these Lage = 
able j e informati a piece of favor- 

e ion replaced pi S dealing 


~~ a P 
With a Tmation of comparable length an 
The —_ Same content. 


e ` 
Order in which the three segments occurred 
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was varied systematically to produce all six possible 
orders, which are as follows: 


A wW PS PS A w 
wW A A w s W 
PS PS WwW A wW A 


Then, for each six orders of the three content 
areas, one of the unfavorable pieces of information 
was placed in one of three possible temporal loca- 
tions. This produced the 18 counterbalanced place- 
ments (u = unfavorable, f = favorable) : 


Au Af Af Wu Wi We 
Wr Wu WE Af Au Af 
PSu PSf PSÍ PSu 


PSf PSf 

PSu PSf PSf PSu PSf PSt 
Af Au Af Wi Wu wi 
Wt Wf Wu Af Af Au 
Au Af Af Wu Wi wi 
Pst PSu PSf PSf PSu PSf 
wf wi Wu Af Af Au 


Thus, only one item of unfavorable information 
occurred during each interview, replacing comparable 
favorable information. The interviews were, then, 
identical except that each contained one piece of 
unfavorable information, the placement of which 
was varied in counterbalanced order by content 


area and temporal placement. 


Procedure 

The 126 Ss were randomly arranged into 18 sets 
of 7 each. The number of Ss listening to a given tape 
at one time varied from two to four. One of the 18 
as played for each set of Ss. They were in- 
n to the tape in order to evaluate the 
an applicant for a “general manage- 
sition in a large industrial corpora- 
a rating form on which 


tapes w: 
structed to liste! 
interviewee as 
ment trainee po: 


tion.” The Ss were given a m 
nt on six characteristics and on 


ted the applica: tics 
R e aMi, The six, characteristics—each 
defined—were ambition, creativity, perseverance, de- 
cisiveness, social adjustment, and maturity. However, 
only the overall rating was used in the analysis. 
The Ss were also given the D scale either before 
or shortly after they made the ratings. 


Hypotheses 


The hypotheses of this investigation were as fol- 


lows: 
1. The earlier the unfavorable information comes 


in the interview, the less favorable the ratings of 
the interviewee will be; more specifically: 

(a) When the unfavorable information comes during 
the first third of the interview, the ratings of the 
applicant will be the lowest; (b) when the unfavor- 
able information comes during the second third, the 
ratings will be intermediate; (c) when the unfavor- 
able information comes during the final third, the 
ratings will be the highest. 


2, The ratings will be lowest where the unfavorable 
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TABLE 1 


Mean RATINGS OF TAPED INTERVIEWS 


ARRANGED BY TEMPORAL PLACEMENT 


OF UNFAVORABLE INFORMATION X CONTENT AREA ORDER 


Content area order TE. 
Temporal Row 2 
pement A PS W AWPS | PSAW | pswa WPSA | WAPS 
| | T: 2 Tape 17 | Tape 18 
i i Tape 5 Tape6 | Tape 11 Tape 12 Į Į ER 
ae 337 3.14 3.43 3.43 3.14 3.00 3.28* 
| 
r im i E a , r Tanchk Tape 14 
third Tape3 | Tape2 | Tape 9 Tape 8 Tape 15 Tape 1 sik 
pm 3.29 | 2.86 | 337 4.14 3.29 | 3.29 3.40 
inal third Tape 1 Tape 4 Tape 7 Tape 10 Tape 13 | Tape 16 ” 
re 4.43 3.43 | 4.14 3.86 4l | 42 4.05 
| 
Column X 3.76 315 | an 3.81 352 | 3152 3.58 
Note.—Rating scale of 1-7. Content area abbreviations: A = academic history, PS = personal-social, W = work history. 
a t = .223 for comparison between mean for first third and mean for 


information is embedded in the social-personal con- 
tent area and comes during the first third of the 
interview (on the assumption that the social-personal 
area was subject to more interviewer bias). 

3. There will be a negative correlation between 
D-scale scores and the degree of favorability of the 
interviewer's ratings. 

4. For the high-D Ss, the effect of the temporal 


location of the unfavorable information will be 
stronger than for low-D Ss. 


RESULTS 


The mean ratings of each tape arranged by 


the interaction of Temporal Placement X Con- 
tent Area Order are presented in Table 1, An 


analysis of variance (Winer, 1962) was per- 
formed on these 


TABLE 2 


SUMMARY oF ANALYSES oF VARIAN! 


CE: TEMPORAL 
PLACEMENT X CONTENT Arg, 


A ORDER 

Source of variance | SS df | MS | F 
Temporal placement (A) | 14.11 | 3 7.50 | 6.66" 
Content area order (B) 7.58 5 | 1.52 1.43 
AXB 6.27 | 10] 363| ‘59 
Within cell | 114.00 | 108 1.06 

Total 130.96 | 125 

*p <ð 


second third. 


the hypothesis. Neither the content area order 
nor the interaction of Placement X Order he 
Statistically significant, Therefore, the ori 
Were rearranged into Table 3 which presen 
the interaction of Temporal Placement 
Content Area. A second analysis of variance, 
Presented in Table 4, was performed sc 
temporal placement and content area as mea 
effects. Again, the temporal placement Wí 
the only significant effect. t 
Since the content area and order were n 
significant, they were not used in the ae 
analysis. The temporal placement and e- 
scale data were analyzed using multiple * 


f ct. 
scale as an interaction eien 
The D scale was also to haye been added 
as a linear effect, 


F he 
The first multiple regression used t 


h " cer 
dummy variable coding of the temporal pla” 
ment as the inde: 


pendent variable and the e 
ing as the dependent variable. The F = t 
from Table 5, was 4.08 which is significan ffi- 
the .01 level, The multiple-correlation ¢° 4 
Clent was .25 which Meant that the tem Pr gi 

nted for 6.25% of the Ye, 
: ngs of the unfavorable i? to 
mation. The multiple regression which weet! 
have added the D scale as a linear ® -p 


Proved unnecessary, The correlation bet™’ 
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TABLE 3 


MEW Rine oF TAPE 
fran RATING OF TAPED INTERVIEWS ARRANGED BY TEMPORAL PLACEMENT OF 
UNFAVORABLE INFORMATION X CONTENT AREA 


Temporal Content area 
placement | Xx 
f 7 Row X 
a | Academic | Personal-Social Work history 
a | if 
First thi | mi a j 
hird Tape 5 | Tape 6 Tape 11 | Tape12 | Tape 17 Tape 18 
3.57 aks 3.14 3.43 3.43 3.14 3.00 3.288 
See s z 35 3.43 | 3.07 
ond third Tape9 | Tape 14 | Tape3 | Tape 15 Tape 2 Tape 8 
| 3.57 | 3.29 3.29 | 3.29 2.86 | 4.14 3.40" 
| 3.43 | 3.29 3.50 
Final thi i | 
third Tape 10 | Tape 13 | Tape4 Tape 16 Tape 1 Tape 7 
3.86 | 4.14 | 3.43 4.29 4.43 4.14 4.05 
4.00 | 3.57 4.29 
Colu r | | 
S Nh X | 3.59 3.52 3.62 3.58 
i = 
pte Rating scale of 1-7. 


‘g 
=°.223 9 f 
for comparison between mean for 


th A 
fore. an and the D scale was —.01. There- 
effect in ould not have contributed as linear 
dummy hen ‘multiple-regression analysis using 
Mth ‘ariables. 
Pendent. irae multiple regression, the inde- 
Ment of egies were the temporal place- 
Scale a i nfavorable information and the 
fect. The e rater coded as an interaction 
Pendent ey again constituted the de- 
4 amg The results are reported in 
produc a only the first multiple regres- 
eee Ea results significantly different 
» the analysis was terminated. 


Sion 
Tom 


Discussion 


Theias 

m n 5 

e elie hypothesis in this study was that 

curred ag the unfavorable information oc- 
the interview, the more unfavorable 


© fat 
tings would be. This hypothesis was 


Stin TABLE 4 

MARY 

~ ee ANALYSIS OF VARIANCE TEMPO 
—__ SEACEMENT X CONTENT AREA 


RAL 


Sour, = 
Te ce of variance SS a | ms | F 
m d 
Con; Poral Placement (A 706 | 6.72* 
) | 14.11 2 | 7.06 |6- 


WX B aea (B) 


first third and for second third. 


generally verified with the exception of Propo- 
sition 5. With the unfavorable information 
occurring in the second third of the interview, 
the ratings were higher than when the un- 
favorable information came during the first, 
however, only slightly higher and clearly not 
significantly higher. 

According to Hypothesis 2, the ratings 
should be the lowest (most favorable) when 
e information was imbedded in 
d came during the 
This hypothesis 


the unfavorabl 
the social-personal area an 
first third of the interview. 
was not supported. 
Based on Hypot 
tion between the D 
and their ratings was eX 
sis was not supported. 


hesis 3, a negative correla- 
-scale score of the raters 
pected. This hypothe- 


TABLE 5 


SuaMary OF ANALYSIS or VARIANCE FOR MULTIPLE 
LINEAR REGRESSION WITH TEMPORAL PLACEMENT 
CODES AS INDEPENDENT VARIABLES AND 
APPLICANT RATINGS AS THE 
DEPENDENT VARIABLES 


ss | af | MS| F 
Due to regression ! 8.87 2 aos 4.08* 
Deviation about regression 113.83 | 123 | 1. 

Total 142.70 | 125 


Source of variation 
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TABLE 6 


SUMMARY OF ANALYSIS oF VARIANCE FOR MULTIPLE 
LINEAR REGRESSION WITH TEMPORAL PLACEMENT 
AND D SCALE AS AN IN TION EFFECT 
AND APPLICANT RATINGS AS THE 
DEPENDENT VARIABLE 


Source of variation SS df | MS| F 
Due to regression | 9.66 | 5 | 1.93 | 1.74 
Deviation about regression | 133.05 | 120 | 1.11 


Total | 142.71 | 125 


Note.—The independent variables were temporal placement 
and the D scale as an interaction effect, 


According to Hypothesis 4, the effect of 
temporal location of the unfavorable informa- 
tion would be stronger for high-D Ss, This 
hypothesis also was not supported. 

The primary conclusions for this study 
would appear to be: (a) Temporal placement 
of the unfavorable information is an impor- 
tant effect, confirming the conclusion of Bol- 
ster and Springbett (1961), (b) Temporal 
placement accounts for only a small percent- 
age of the variance in the ratings. (c) The 


first portion, Suggests a need f 
of the applicability of B 
in a decision sequence 


or further study 
runer’s (1957) stages 
to the selection j 


A nter- 
view. 

The findings of this study are in agreement 
as well, with Bolster a z 


nd Springbett’s (1961) 
conclusion that there was a modified 


5 primacy 
effect, even though their study involved rat- 
ings based on written information, If the 


rimacy effect operated in the usual man 
pi y ner, 
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and in accordance with Bruner’s (1957) 
stages, all three propositions of Hypothesis 1 
should have been verified. It would be piste 
teresting to know if this is an effect of having 
used taped interviews instead of written in- 
formation. It might also be a function of not 
having used intermediate ratings as Crowell’s 
(1961) study suggests, } 
The rejection of Hypothesis 3 is not in 
keeping with Wheeler’s (1969) findings of 4 
negative correlation between the D-scale score 
of the raters and their ratings of specific pieces 
of interview data, This could have resulted 
from the fact that Wheeler’s (1969) Ss were 
older and more experienced than those of the 
Present study. There is a need for additional 
research to unravel the relationship betwee" 
these variables, 
The time required to develop tapes such as 
those used in this study is one of the major 
limitations of the use of audio tapes. How- 
ever, the use of recordings of simulated selec- 
tion interviews Provides a way to contro 
many variables, such as content, voices, 4” 


. in- 
so on, that cannot be controlled in a real 1 
terview, 
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NON PROFESSIONA 


ALAN W. LAU? axp NO 


tories used for professional occupations, 
Inventory scores received by Navy enli 


reliability of individual scores, parallelin 
Vocational Interest Blank. Comparisons 


age to the measurement of in- 
tions, mer hong men in professional occupa- 
been” vi a in nonprofessional occupations have 
Ported a ually neglected. Clark (1961) re- 
E sone effort in this area when he 
recruit cl an interest inventory for Navy 
ventory = a and assignment. This in- 
tory (Nvip Navy Vocational Interest Inven- 
erentiated h emphasized activities which dif- 
rades and etween men in various skilled 
Sis was echnical occupations; less empha- 
Activities Toes on professional or unskilled 
Published h Clark and Campbell (1965) 
fatlier iny, an abbreviated version of the 
Nterest Sima the Minnesota Vocational 
i ventory (MVII), for use in voca- 


tio 
Nal guj 
u è 
Patong» idance among nonprofessional occu- 


Items 
Ctween 
8toup 
8toups 


from the NVII which differentiated 
the interests of a general reference 
(Navy men in general) and criterion 
of men who were members of specific 


aca 


hors 


1 Th 
e ini 
and qo „Pinions expressed are those of the autho 
i 


on J 
bartment t necessarily reflect those of the Navy 


“R 
Lag S 
, 
Seay, 


*Seare| 
te 


hehe for reprints should be sent to Alan at 
tch Psychologist, Personnel Measurement 
A Department, Naval Personnel and Training 
Laboratory, Department of the Navy, San 
‘Tlifornia 92152. 

ee the Minnesota jape h 
vas done to simplify the wording of e 
ral ite S, and 32 triads were eliminated. he 
ms inet ed in the area scales of the two inven- 

© identical, 


D; 


> 


some 
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STABILITY OF VOCATIONAL INTERESTS WITHIN 


L OCCUPATIONS ' 


RMAN M. ABRAHAMS 


rav Dorsi ini 
Naval Personnel and Training Research Laboratory, San Diego 


The use of empirically derived occupational interest scales assumes that indi- 
apas scores and criterion group characteristics remain stable over relatively 
ong periods of time. While these assumptions have been tested with inven- 


they have not been tested in the non- 


professional interest domain. This study compared Navy Vocational Interest 


sted men in a varicty of occupational 


specialties with their retest scores obtained 5 yr. later. Results show substantial 


g reliabilities obtained with the Strong 
of interest profiles of criterion groups 


tested in 1951 with those of men entering the same specialties 13 yr. later also 
showed considerable stability. Findings should be generalizable to the Minnesota 
Vocational Interest Inventory, a revision of the original inventory. 


Navy occupational specialties were selecte 
for scoring keys. Using this rationale, Clar 
constructed a total of 19 occupational key 
In addition to these empirical occupation: 
keys, Clark developed homogeneous (or area 
scales. The area scales draw together item 
with a common core into homogeneous group 
ings, and provide relatively pure measures 0 
interest traits or factors. 

The use of empirically derived occupationa 
scales requires two assumptions of stability 
First, since scales are constructed on indi 
viduals tested as members of occupationa 
s assumed that they would hav 
ally the same had they beet 
tested prior to occupational choice. In othe 
words, it is assumed that individuals’ score 
remain stable over time. The second assump: 
tion is that men entering a given occupatior 
several years after scale development have the 
same interests as the original criterion group 
For instance, if mechanics tested several years 
ago differed consi 


derably from mechanics 
tested today, scales constructed on the earlier 
group would be inappropriate. 

These assumptions have been adequately 
ed with inventories used for professional 
occupations. However, with nonprofessional 
occupations these assumptions have not been 
examined. The only reported study on relia- 
bility of the NVIT scales, though rather lim- 
ited by the short retest interval of 30 days, 
indicated fairly high reliability for a sample 


groups, it i 
responded essentl 


test 


TABLE 1 


NUMBER OF Men by INITIAL SCHOOL ASSIGNMENT 
Wao Returnep NVIL 


Sent | 


Returned 
Rating NVI | NVI 
Hospitalman | 26 | 20 
Electronics Technician | 22 20 
Yeoman | 20 | 18 
Storekeeper | 12 | 8 
Engineman 19 | 18 
Machinist’s Mate | 31 27 
Submarine* | 125 | 97 
Total | 255 


| 208 


Note.—NVII = Navy Vocational Interest Inventory. f 

a Of the 97 men in the Submarine school, 62 were in mechani- 
cal occupational specialties; 31 were in electrical-electronics 
specialties; 4 men were in other miscellaneous specialties. 


of 98 trade school students (Clark, Gee, & 
Perry, 1953). 

The bulk of research with the inventory has 
been toward establishing the ability of scales 
to differentiate men in different occupations. 
Other relevant research has indicated that 
NVII test scores are related to Navy techni- 
cal (Class A) school achievement (Clark et 
al., 1953; Spies, 1966), job performance of 
IBM workers (Ghei, 1960), and job satisfac- 
tion of Navy Yeomen (Perry, 1960). Recent 
research conducted by Lau (1969) has shown 
that when Navy enlistees are engaged in voca- 
tional activities which are consonant with 
their interests, they are more likely to reenlist 


and to receive higher performance ratings and 
technical school grades. 
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While a good deal of research has been 
directed at validity, the two assumptions of 
stability have been virtually ignored. Conse- 
quently, the present study sought to furnish 
evidence on the stability of individual scores 
and of occupational groups on the scales of 
this inventory. This study examines the 5-yr. 
test-retest reliability of the occupational and 
area scores of the NVII and the stability of 
occupational scale profiles for men entering 
occupational groups 13 yr. apart. If it can be 
shown that scores are reasonably stable and 
the original scales are still appropriate, the 


inventory may be used with more confidence 
in vocational counseling. 


METHOD 
Subjects 


In 1964, the NVII was administered to 1,009 re- 
cruits at seven technical Navy schools prior to the 
beginning of training. These seven schools were Hos- 
pitalman, Electronics Technician, Yeoman, Store- 
keeper, Engineman, Machinist’s Mate, and Submarine. 
The interest test was administered on an experi- 
mental basis and scores were not used for the as- 
signment of men to the various schools. 


Procedure 


By searching Navy personnel records in 1969, 255 
of the original testees were identified either as reen- 
listees or as men who had extended their first 
enlistments. These men were followed up, and NVI 
booklets were mailed to them at their duty stations- 
Completed inventories were received from 208 men 
(82%). Table 1 presents the number of men by ini- 
tial school assignment who returned the inventory- 


TABLE 2 


MEANS AND STANDARD DEVIATIONS on RELEV. 
For CLARK’S CRITERION Grou 


ANT OCCUPATIONAL Krys 
PS AND THE 1964 


OCCUPATIONAL GROUPS 


m No. of Original criterion groups 1964 Occupational groups 
4 items a — ai = 
N x SD N x SD 
Hospitalman 69 230 ae 
Electronics Technician 71 Pa aoe E 4 es Tae E? 
Engineman 70 237 59.1 6.6 113 S 6.5 
Machinist’s Mate 61 254 608 o4 i BA a 
Storekeeper 64 146 653 103 si E He 
iain as ia! 67.1 11.0 99 71.9* 8.1 
e Bee have been transformed to standard scores (X = 50, SD = 


* The difference be 


e of 
10) based on the standardization samp! 
tween mean scores of the original and 1964 groups is significa 


nt at p = 01. 
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BM 
BT 
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DC 
EM 
EN 
ET 
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FS 
GM 
HM 
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Fig, 
Vocational Int 


di 
Mean profile on Navy 
Hospitalmen. 


Nvir 
1 se 
rele tPational key obtained on nine area scales and 
men nt key (i eys. From Clark’s original work, a 
for M that ie, a key developed specifically for 
ll but th occupational specialty) WS available 
e Submarine school. 


RE 
ESULTS AND D1scUSSION 


hirt 

een-V, 

The § Year Occupational Group Stability 

y ir i i 

Whether pt alysis addressed the question of 
y occupa- 


i er š 
noha ee — of men in Nav) I 
Sey ne i ave changed since Clark’s origi- 

“"Patio ere collected. If the interests © 
nal groups have substantially 


+5 
erest Inventory 
(Table entries 


—— Clark Sample (N=25) 
—— — 1964 Sample (N=266) 


(rho = 83) 


+30 


+20 +25 


ional scales for 1951 and 1964 


+10 +15 


occupat: 
are in raw score units.) 
tional keys would 
fferentiating men 
] specialties. 


changed, the original occupa 
not be of much value for di 


currently in these occupationa 
To examine this question, mean scores on 


occupational keys obtained by men in the 
occupational specialties tested in 1964 were 
compared with scores obtained by men in the 
same specialties in 1951.4 For this analysis, 


e Clark had not ¢ 


a 
4 Sinci onstructed scales for men 


in the Submarine school, these men could not be 
included in this analysis. Area scale scores obtained 


by Clark’s original criterion groups are also not 


available. 
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mean profiles for small randomly selected 
samples of men in the six specialties were ex- 
tracted from Clark et al. (1953) and com- 
pared with the mean profile for men in the 
same specialty in 1964. 

The 1951 and 1964 profiles revealed a 
great deal of occupational group stability 
within each of the six occupational specialties. 
Figure 1, which shows the profile for Hospital- 
man samples tested 13 yr. apart, is presented 
as an illustration. To measure the degree of 
profile similarity, Spearman rank-difference 
correlations for each of the six Navy occupa- 
tional groups were computed. Profile correla- 
tions were as follows: Hospitalman (p = .83), 
Electronics Technician (p = .93), Engineman 


(p = .95), Machinist’s Mate (p = .91), Store- 
keeper (p = .91), and Yeoman (p = .61). 

Scores earned on all occupational scales by 
the total sample of men tested in each of the 
original criterion groups were not available. 
However, Clark et al. (1953) does present 
each criterion group’s score on its own scale. 
For a further stability assessment, mean scores 
on their own scales (i.e., the scale constructed 
on this group’s responses) for Clark’s criterion 
group samples are compared with those of 
men in the 1964 occupational specialties in 
Table 2. 

While four of the six comparisons yield sig- 
nificant differences, only two show the direc- 
tion of change that might cause concern, that 


Test-Retest 
Reliability 


Aviation Machinist's Mate (AD) 
Boatswain's Mate (BM) 
Boilerman (BT) 

Damage Controlman (DC) 
Engineman (EN) 

Gunner's Mate (GM) 

Machinist's Mate (MM) 

Aviation Electron 


(AT) 
Electrician's Mate (EM) 


ics Technician 


Electronics Technician (ET) 
Fire Controlman (FC) 

Radarman (RD) 

Radioman (RM) 

Communications Technician (CT) 
Quartermaster (QM) 

Storekeeper (SK) 

Yeoman (YN) 

Food Serviceman (FS) 


Hospitalman (HM) 
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all Navy occupation 
[X = 50, SD = 10] 
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Test-Retest 
Reliability 


Mechanical 
(22) 


Health Service 
(19) 


Office Work 
(20) 


Electronics 
(17) 


Food Service 
(20) 


Carpentry 
(18) 


Sales-Office 
(17) 


Clean Hands 
(15) 


Outdoors 


(15) 


0 


s 3. Mean test and retest profile of N: 
Navy occupational samples. (W = 208; 
each area scale is given in parentheses.) 


dace creases. For Electronic Technicians the 
ess e from the original criterion group 1% 
for M an one-half of a standard deviation and 
viati achinist’s Mates about one standard de- 
conside, These comparisons, however, may be 
One ered rather demanding, since they isolate 
no a from a large set of scales and give 
exam ication of overall profile similarity- For 
n ple, the comparison presented earlier for 
original sample of Machinist’s Mates with 
fer new sample yielded a rank-order correla- 
of .91 between profiles. 
ing otter analysis may be made 
€ ability of each relevant sca 


by examin- 
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Test 
Retest 


15 


y area scales for all 
number of items in 


5 10 


in raw score units; 


ferentiate the 1951 and 1964 occupational 
groups from the original reference group of 
Navy men in general. The percentage overlap 
between the distributions of the reference 
group and each of the original and new occu- 
pational groups on their own scale was com- 
puted. For the 1951 groups the average per- 
centage overlap with the reference group is 46, 
compared to 48 on the new groups. 

These results taken together strongly sug- 
gest that the original occupational keys are 
still appropriate for measuring the interests of 
men in the same occupations. However, before 
this conclusion can be made, stability of indi- 
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STABILITY OF VOCATIONAL INTERESTS 


vidual scores over time on the same indi- 
viduals must be examined through test-retest 
correlations. 


Five-Vear Scale Reliability 


The next two analyses sought to assess the 
reliability of scale scores over a 5-yr. interval. 
The first analysis compared test and retest 
Scores for the entire sample, while the second 
analysis examined scale reliability within oc- 
Cupational specialties. 

Figure 2 presents a test and retest mean 
Profile and scale reliabilities on the occupa- 
tional scales for the entire sample. Figure 3 
Presents parallel information on the area 
Scales. Further, the last column of Table 3 
Provides test-retest means, standard devia- 
tions, and reliabilities for the entire sample. 
Clearly, there is considerable stability for both 
Occupational and area scales, Occupational 
scale test-retest reliabilities range from .54 to 
‘82 with a median value of .70. Area scales, 
which generally contain about one-third the 


` Number of items in the occupational scales, 


range in reliability from .50 to .81 with a 
Median of .62. The median 30-day reliabili- 
ties reported by Clark et al. (1953) for occu- 
Pational and area scales are .82 and .83, re- 
Spectively. While the 5-yr. reliabilities are, as 
Would be expected, considerably lower than 
the 30-day values, the median occupational 
Scale reliability is comparable to the median 
Value of .68 obtained with the SVIB over a 
3-yr. interval on 189 Harvard freshmen 
(Campbell, 1966). To insure the generality of 
the NVIT test-retest results, a parallel follow- 
Up of men who left the Navy is in progress. 
_The final analysis examined the scale relia- 
ilities within occupational specialties. | For 
this comparison, the constructed occupational 
Sroups were mechanical (V = 109), electrical- 
electronics (V = 52), clerical (V = 27), and 
health service (N = 20). The results of this 
analysis, including means, standard deviations, 
and test-retest correlations for all scales and 
Stoups are given in Table 3. 
__ These data indicate several interesting find- 
ings, First, the test-retest correlations within 
Occupational groups are generally lower than 
for the total sample. Inspection of the stan- 
dard deviations reveals, as would be expected, 
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lower variabilities within occupational groups 
than for the total sample, thereby restricting 
the magnitude of test-retest correlations. Sec- 
ond, inspection of test and retest mean scores 
on the relevant area and occupational scales 
within each occupational group shows a con- 
sistent and significant decrease at the time of 
retesting for all but the mechanical group. 
Concomitantly, on over half of the relevant 
scales, variability significantly increases after 
5 yr. in the occupational specialty, suggesting 
that actual experience in a specialty expands 
the range of interest in that work. 

A reasonable hypothesis for such increased 
variability and decreasing means on relevant 
scale scores seems to lie in the trial and error 
process of ultimate vocational choice. A group 
entering a given occupation presumably would 
be subjected to common occupational stereo- 
types, resulting in their being somewhat 
homogeneous in their attraction to that field. 
Actual exposure to the work itself provides a 
more meaningful basis for determining likes 
and dislikes. With experience in place of 
stereotypes, it would be anticipated that some 
men would be disappointed and disenchanted 
while others would be satisfied; thus, increased 
variability and decreased magnitude of scores 
on that occupational scale would be expected 


upon retesting. 


CONCLUSIONS 


With respect to the stability of nonprofes- 
sional interest scales and groups, the follow- 
ing conclusions seem warranted: f 

‘te The reliability over extended time inter- 
vals of nonprofessional occupational interest 
scales parallels that obtained with profession- 
ally oriented measures and appears to be high 
enough to justify their use in counseling. 

2. The interest profiles of men who entered 
the same nonprofessional occupations 13 yr. 
after the initial scale development were quite 
similar to those of the original criterion 
groups. It appears that these scales, con- 
structed several years ago, are still appropriate 
for groups tested more recently. 

3. Actual exposure to the work of an occu- 
pational specialty appears to lower, slightly, 
the general level of interest and increase the 
range of scale scores. 
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ASSIMILATION OF NEW 


MEMBERS IN A LARGE 


AND A SMALL CHURCH* 


ALLAN W. WICKER? anp ANNE MEHLER 


University of Wisconsin, Milwaukee 


Consistent with behavior setting theory, 


new members of a small church (338 


members, 2 = 26) reported greater assimilation than new members of a com- 
parable large church (1,599 members, 72 = 40). The assimilation measure was a 


composite based on participa’ 
ticipate, sense of belonging to 
bers and officers. College students as 
the small church (7 = 
reported assimilation. Th 
new vs. established members) ©} 
records of attendance and contr 
activities by members of the sma! 


The present article is one of a series of 
studies examining various aspects of Barker’s 
(1960, 1968) behavior setting theory. Accord- 
ing to the theory, the behaviors and experi- 
ences of members of voluntary organizations 
are influenced by the degree to which an 
Organization’s activities are undermanned. In 
an undermanned activity there are many jobs 
to be done relative to the available population, 
and thus if members are to obtain the satis- 
factions the activity provides, they must work 
harder, engage in a wider variety of tasks, and 
assume more responsibility than would be the 
Case if there were an abundance of personnel. 
Greater responsibility for the activities leads 
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21) or the large church 
e 2X2 analyses of variance (large vs. small church, 
f self-report data on participation and church 
ibutions revealed greater support for church 
Il church and by established members. 


tion in church activities, felt obligation to par- 
the church, and acquaintance with church mem- 
ked to role play new members of either 


(n=21) did not differ in 


the members to experience involvement, chal- 
lenge, and self-worth. Also, members in under- 
manned activities are likely to recruit others, 
to lower requirements for admission, and to 
share their responsibilities. 

Research on the theory has dealt with high 
school extracurricular activities such as school 
plays and athletic contests (Baird, 1969; 
Barker & Gump, 1964; Wicker, 1968, 1969a; 
Willems, 1967) and church activities such as 
worship services, business meetings, and social 
events (Wicker, 1969b). The research has 
shown that the activities of small schools and 
churches are undermanned relative to those 
of large schools and churches. Consistent with 
the theory, members of the small organiza- 
tions tend to show more behavioral support 
for their organization’s activities (e.g., they 
serve in more positions of responsibility, con- 
tribute more money, attend more frequently) 
and they tend to have more feelings of self- 
improvement, challenge, importance, and 
concern about activities. 

One aspect of behavior setting theory which 
has not been previously studied is the effect 
of undermanning on the assimilation of new 
members into an organization’s activities. The 
present study examines this relationship in a 
large and a small church. It would follow 
from the theory that the need for additional 
personnel in the activities of the small church 
would lead members to seek out potential 
members and to make them feel welcome and 
perhaps even obligated to participate. Also, 
the newcomers would be rapidly drawn into 
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the relatively large number of positions of 
responsibility available to them. Increased 
participation would lead new members to meet 
and interact with many of the church’s mem- 
bers and officers. And later, new members 
themselves might recruit others to participate. 
In contrast, in the large church which had 
a much larger pool of available personnel to 
carry out its activities, there would be less 
need for members to recruit new members, 
and the newcomers in this situation would feel 
less welcome and obligated to participate. 
There would be fewer positions of responsi- 
bility open to the new members, and they 
would be less likely to serve in them. Their 
acquaintance with other members and officers 
would be less intimate, and they would feel 
little pressure to recruit others. 

The present study also examines in a fac- 
torial design the relationship of church size 
and of recency of membership to members’ 
support for church activities. A previous study 
of church members who were selected without 
regard to length of membership (Wicker, 
1969b) provided a source of data on longer 
term or established members with which to 
compare the present data on new members. 
Thus, it was possible to test main effects of 
size and recency of membership, as well as 
their interaction. While Barker (1960) does 
not predict an interactive effect, it would be 
of interest to determine, for example, whether 
the difference between levels of support for 
church activities by new and old members 


is the same or different in large and small 
churches. 


METHOD 
Subjects 


Data were obtained from members of a sm 

) J all 
United Methodist Church (338 members) and from 
a comparable large church of the same denomination 


(1,599 members). Both churches were located in 
metropolitan Milwaukee, Wisconsin. 


opo! More complete 
descriptive data on the churches, and evidence that 
the activities of the small church are undermanned 
relative to those of the large church are presented 
in Wicker (1969b). In each church, membership 
records were used to identify all married couples 
who had joined the church between 3} and 30 mo. 
prior to the investigation. Persons who had par- 
ticipated in a study conducted a year earlier were 
not included. There were 30 people in the small 
church and 81 in the large church who met the 
criteria. Thirty persons were randomly eliminated 
from the large church, leaving 51 in the sample, 
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The Ss were first contacted by a personal letter 
from their church pastor; questionnaire data vere! 
obtained in group meetings at the churches or on 
vidually at Ss’ homes. Questionnaires were completed 
by 26 Ss from the small church (87% of those 
contacted) and by 40 Ss in the large church GEM: 
The difference in participation of the members pi 
the two churches does not approach significance 
(x2 = .89). The average length of membership for 
the small church participants was 15.0 mo., Come 
pared to 17.8 mo. for the large church. The he 
samples were highly similar in average age, num a 
of children, sex distribution, percentage who had 
attended another church in the 2 yr. prior to joining 
the present church, reported attendance at the pre- 
vious church, responsibilities held in previous church, 
and satisfaction with previous church. 


Questionnaire Material 


1. New member questionnaire. Questions dealt 
with the following: demographic variables (ages 
sex); involvement in and satisfaction with the 
church that S previously attended (if any); yall 
of importance of various factors in choosing 1l 
church; beliefs about differences in large and sma” 
churches; and a series of questions to measur 
assimilation into the church. Six of the assimilation 
questions dealt with three activities common to bot 
churches: church choir, social clubs for are 
couples, and the women’s or men’s organization. pi 
cach of the three activities, Ss were asked how ae 
quently: (a) they had been invited to join ny 
attend, (b) other members had offered to accomp ad 
them, (c) they had attended, and (d) they the 
asked someone else to attend. Also for each of 


; 
three activities, Ss indicated on 4-point scales re 
welcome they felt and the extent to which they, $ 
obligated to participate. For the above questions, 
responses were summed across the three activit 


see . inet 
An additional index of felt obligation was obt 
from Ss’ respon: 


ses to questio whether t ch 
felt they should help by catalina ena ate chute 
school class, working in other church activities, Pe 
an officer in a church Organization, and mak! 
monetary pledge to the church, The index W, ns 
number of “yes” responses to the four quest g 
Finally, a g jon ” 


wil 
„gencral question about assimilatio™ art 
the following: “T 


© what extent do you feel a} 
of, or close to, this church and its members 

6-point graphic rating scale was provided. 5s 
2 Acquaintanceship range questionnaire. T urh 
were given a list of some members of their mnie 
and were asked to indicate if they (a) recok @ 
the name, (b) knew the Person personally, an list 
addressed the Person by his first name- Bat i 


5 


% of t 
bership) were listed a ecard oa 1 shi) 
rs (also 5% of the membé 
me functions were serve 
€ two churches; the ae 
d offices in several case 


i 
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OUPS 


3. Participation questionnaire. The questionnaire 
contained a list of all the activities which had 
occurred in each church during the 15-wk. period 
Preceding the testing sessions. The Ss checked the 
activities they had attended; indicated the number 
of hours spent in each; and noted whether they had 
held a major responsibility (leader category), a 
minor responsibility (worker), or merely attended 
(member). (See Barker, 1968, Barker & Gump, 1964, 
and Wicker, 1969b, for a fuller description of the 
responsibility categories.) 


Role-Playing Subjects and Procedure 


In an attempt to learn if there is a generally held 
view that church size is related to new members’ 
behaviors and experiences in church activities, the 
Present questionnaires were also administered to 
college students who were asked to role play being 
a new member in either a large or a small church. 
Two forms of the questionnaire, one for the large 
church and one for the small church, were prepared 
for the role-playing Ss. Each form was completed 
by 21 undergraduate students at the University of 
Illinois, à 

Information about the church and the ave 
member was provided in the instructions. 
trate, here is the description of the small 


Imagine that you are married and have two 
children. About 2 yr. ago, you moved to an 
industrial suburb of a metropolitan area of ap- 
Proximately one million people. About a year ago, 
you and your spouse joined a nearby Methodist 
church having around 300 members. The church 
Pastor is well liked by the church members, and 
he is known as being a good speaker. He is in 
his 60s, and has been assigned to the church for 
2 yr. The present church building was constructed 
Several years ago. 


rage new 
To illus- 
church: 


questions on as- 


Both forms contained identical 
he new-member 


Similation taken verbatim from t n 
questionnaire, Questions on acquaintanceship and 
Participation differed on the two forms, as they did 
on the original questionnaires. To obtain informa- 
tion on acquaintanceship range, Ss were told “ 
Suppose they had been given a list of 12 Se 
church form) or 16 (large church form) church 
officers, and then to estimate the number of these 
officers whose names they would recognize, waon 
ey would know personally, and whom they wou 
Address by the first name. The same questions were 
asked about members, with either 17 (small church 


form) or 80 (large church form) being the So 
mem i a articipa’ 
shure bers specified. Data on Pine ‘Ss how 


activities were obtained by 
me of the 42 (small church form) or 104 a 
d urch form) activities occurring iN the ¢ rA 
ee a 34-mo. period they would have atten he 
=a number of hours they would have spent m a 
poe ties during the period, and the Domber i 
= in which they would have had a responsible 
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Analysis of Questionnaire Data 


A preliminary correlational analysis on data from 
the church members indicated that all of the present 
assimilation measures were positively correlated, with 
the highest coefficients in the .80s. It was therefore 
decided to treat the various measures as items in 
an overall test of assimilation of new members by 
calculating composite scores on which group com- 
parison would be made. Scores on each of the 12 
assimilation measures listed in Table 1 were con- 
verted into standard scores. Composite assimilation 
scores were obtained by summing across the 12 
standard scores for each S. 

Two ż tests and a 2 X 2 analysis of variance were 
employed to compare groups on assimilation. One t 
test compared the new members of the large church 
and the small church, while the other compared Ss 
in the large church and the small church role-playing 
conditions. The analysis of variance tested the main 
effects of type of S (church member or role player) 
and church size (small or large) and the interaction 
of the two variables. Composite assimilation scores 
were calculated separately for each analysis using 
standard scores based on data from all Ss involved 
in the comparison. For example, in the comparison 
of members of the small and large churches, the 
standard scores making up the assimilation composite 
were computed using the means and standard devia- 
tion of the distributions for members of both 
churches. Thus, composite scores indicated degree of 
assimilation of each S relative to all church members, 
and not just relative to members of his own church. 
For the analysis of variance, the distributions used 
for calculating scores were from data for all Ss (both 
church members and role-playing Ss). 

Since complete data were required to sum across 
the measures, it was necessary to drop from the 
analysis several Ss who had not completely filled 
out the assimilation questions. Of the church mem- 
bers, 5 from the small church and 4 from the large 


church were eliminated, leaving ms of 21 and 36, 
respectively. Of the role-playing Ss, one small church 
S were dropped, leaving ms 


1 one large church drop 
a for each condition. Examination of the data 


suggested that elimination of these Ss had a negligible 
effect on group means. It was also necessary to omit 
the measure of feeling welcome, since a number of 
church members who had not attended the activities 
did not indicate how welcome they felt to attend. 


Data from Church Records 

Frequency of members’ attendance at Sunday wor- 
ship service was determined from church records; in 
each church a sign-up sheet is passed along the pews 
during each service, Also available from church 
records were the amounts of money pledged and 
contributed to the church by S and his spouse (a 
combined figure for both) during the fiscal year in 
which the study was conducted. 


Comparison of New Members and 
established Members 


Data from the earlier study of members of the 
two churches (Wicker, 1969b) were employed in the 
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analysis of the effects of church size, Ts a 
membership, and their interaction oñ agen i 
church activities. In an attempt to make tl : = =. 
of the two studies yo es ee 
steps were taken: (a) new me A eee ger 
study who did not have a minor chil ‘ 

i i rior study had at least one 
ee o ta the earlier study Tophan 
been members for 30 mo. or less when they Provided 
data were assigned to the new member cag 
leaving in the established group only persons w é 
had been members longer than 30 mo. The ns an 
mean length of membership for the resulting four 
groups were as follows: small church, new members, 
n= 30, F= yr.; large church, new members, 
REI X 14 yr.; small church, established mem- 
bers, n=18, X=7.4 yr.; large church, established 
members, n = 33, X = 11.8 yr. 

New and established members were compared on 
their participation in church activities, Sunday wor- 


TABLE 1 


MEAN SCORES ON ASSIMILATION MEasy. 


RES FOR 
CHURCH MEMBERS 


AND ROLE-PLAYING Ss 


Church Role-playing 
members Ss 
Measure TERP OE a a 
Small | Large | Small Large 
church | church | church church 
No. of invitations to join or z -a 
attend three activities 5.7 5.5 8.3 8.2 
No. of offers to take S to 
three activities 3.0 1.4 5.8 7.2 
Frequency of attendance at 
three activities 4.7 1.6 5.8 8.1 
No. of times S has asked 
oth o attend three 
acti les 2.8 38 4.0 4.7 
Felt obligation to participate 
in three activities (larger 
no, indicates greater felt 
obligation) 6.6 5.4 6.1 7.5 
Felt obligation to support 
four church activities 
tyes” answers to four 
questions) 2.7 2.0 2.7 2.2 
ense of belonging to the 
church (1 = not at alla 
pare = very mucha 
-Dari 42 | 3.6 5 
No. of church officers known "a +a 
personally or addressed by 
irst name 5.1 
No. of church members se na s6 
known personally or ad- 
dressed by first name 1.0 2.6 8.7 19.5 
No. of activities attended 3.6 2.8 14.6 18.4 
No. of hours spent in i = 5 
activities 5 E e 
No. ofactivitiesinwhiens | 165 | 141 | 292 | 24.5 
wasa worker or leader 2.0 8 7.7 6 
Composite assimilation score| 2" = ia 
based on the 12 measun 8: 
t tests jx m 
Composite assimilation score| S% | -1.7 | —.on 2 
based on the 12 Measures: 
eet Ste] 3.9¢] 6,26 
Note.—For chure 


hh members in sm: meh 5, 
all = 
large church, » = 36. Both groups Small church, » = 21 


` ; in 
of 20. role-playing Ss had ao 
a1 for diff 


4, df = 55, p > 05 members 
‘tween means fi 

nditions = -70,df = 
e Fs for anal of variance 
nsi type of S, F = 28.47, df 
"Sdf = 1,93, p < 05, 


Sh df = 1,03; 
$ Interaction, 
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ship service attendance, pledges, and contritione © 
limitation of the comparisons is that the te re 
on most of the new members (all except cay ee 
the earlier study) were obtained 1 yr. after gee i 
measures on the established members. es pe 
exception is data on members’ pledges, w ae 
for the same period. Both churches are re see 
stable, however, and identical periods of time 

used for the 2 yr, 


RESULTS 


Group means for the 12 assimilation pa 
sures and for the composite scores are shoy m 
in Table 1. As expected, new members fro n 
the small church were significantly higher i 
the assimilation composite than new ere 
from the large church (t = 2.14, df = é 
p < 05). Moreover, there was no differen 
between the mean assimilation compon 
scores for the large and small church PA 
playing conditions (t=.70, df= 38, vail 
Suggesting that the present results for errs 
members are not due to any widely held V! rs’ 
of the effects of organization size on membe 
experiences, as- 

All four groups were compared on the 
similation composite using an unwels ce 
means analysis of variance for unequal). 
frequencies (Winer, 1962, 222- ize 

ere was no overall effect on church w 
(F < 1), but the type of S main effect 93; 
highly significant (F = 28.47, df=) ting 
P<.01), with role-playing Ss Epo ett 
greater assimilation than church mem 5 
The interaction of church size and type ° 3, 
was also significant (F= 4.51, d= h tio? 
P<.05). Figure 1 shows that assimila the 

y church members is greater P put 
small church than in the large church | is 
for role-playing Ss, greater assimilati Jr 
large church conditio ests 
the analysis of variance SUE? to 
ole-playing Ss were not 2°, gia 


i À im 
duplicate either the overall level of 25% 


tbe 
tion of new members or the direction °; the 
difference jn assimilation of members 
small church and the large church jn 
Examinati 


ch Den 
Corresponding large chur ce pti 
for 11 of 12 Measures, The single ©, pe 


znow” 
was number of church members kn? 
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+8 @—— CHURCH MEMBERS 
Grosa O ROLE PLAY SUBJECTS 
+6 ae 
= aa 
g 
O 
$ +2 
O 
O 
Š o 
K 
t 
x 
So 
= =R 
yn 
x 
= -4 
S 
-6 
-8 
SMALL LARGE 
CHURCH CHORE 


Fic. 1. Mean assimilation scores for church members 
and role-playing Ss. 


Sonally or addressed by the first name, and 
this result is somewhat ambiguous since 
Members of the small church were given a 
Much smaller list of names (17, compared 
to 80 for the large church). Role-playing Ss 
'n both conditions generally had higher scores 
than the church members, and the small 
church mean was higher than the large church 
Mean for only 6 of the 12 measures. 
Comparative data on support for church 
activities by new and established members in 
the large church and the small church are 
reported in Table 2, along with results of 
unweighted means analyses of variance. Con- 
sistent with earlier research (Wicker, 1969b), 
church size is significantly related to the 
range of activities entered (p < .01), respon- 
Sibilities taken (p < .05), worship service at- 
tendance (p < .01), pledges (Ż < 01), and 
Contributions (p < .06). For each measure, 
Steater support is shown by members of 
the small church. The table also shows that 
new members attend fewer activities (Ż < 
01), and a narrower range of activites 
P < .01), serve in fewer positions of leader- 
ship (p< .05), and attend worship Serv 
ss often (p < .10) than persons who have 
cen members for more than 30 mo. For none 
Of the measures is the interaction of size 0 
church and length of membership significant, 
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suggesting that the difference between new 
and established members’ support for church 
activities is the same in the large church and 
the small church. 

Discussion 


The present results on new members’ 
assimilation into the two churches supports 
the hypothesis derived from behavior setting 
theory that assimilation is facilitated in 
organizations whose activities are relatively 
undermanned, that is, small organizations. 
When compared on composition assimilation 
measures based on reported participation in 
church activities, felt obligation to partici- 
pate, felt closeness to the church, and range 
of acquaintance with other members and of- 
ficers, the members of the small church 
showed greater assimilation than the members 
of the large church. 

The fact that college students role playing 
members of small and large churches showed 
no difference in assimilation tends to discount 
alternative explanations of the obtained dif- 
ferences for church members, such as demand 
characteristics, questionnaire wording and 
format, and culture-wide views about the 
effects. of organization size on members’ 
experiences. 

The factorial analysis of members’ support 


for church activities generally indicated 


TABLE 2 


COMPARISON OF New AND ESTABLISHED MEMBERS 
IN A LARGE CHURCH AND A SMALL CHURCH 


Small church» | Large church 


Measure* i stab- E 

Me sor || see sh 
Tae mem- | mem- mem- 
erse | Bers | bers | ‘bers 
5.5 10.2 3.4 aS 


~ No. of activities attended 

2. No. of kinds of activities 
attended 

, No. of activities in which 
S was a worker or 
leader i 

4. Sunday worship service 

attendance 


4.6 6.4 2.4 47 


2.8 4.6 1.1 2.8 
19.9 26.2 14.1 14.5 


Pledged contribution ($) | 173 | 258° | 145 | 141 


5. f À 
6. Actual contribution ($) 126 187 109 109 


Te ns. 

(two-tailed) for small versus larg 
churches were: p < .01 for TETERA 2, 4, and sip ius EE 
Measure 3, and p < .06 for Measure 6; Measure 1 was ns. 
nificance levels (two-tailed) for new versus established 
churches were: p < .01 for Measures 1 and 2; p < .05 for 
Measure 3; and $ < .10 for Measure 4; Measures 5 and 6 


were ns. 


Significance le 
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greater support by members of the small 
church and by persons who have been mem- 
bers longer than 30 mo. The significance of 
the effects of church size is consistent with 
earlier research supporting behavior setting 
theory (Baird, 1969; Barker & Gump, 1964; 
Wicker, 1968, 1969a, 1969b; Willems, 1967). 
The relationship of recency of membership 
and support may be at least partially due to 
the fact that not all new members remain in 
the church; some become inactive and drop 
out. Thus, the established members belong to 
a more select group which has shown suffi- 
cient interest to remain in the church longer 
than 30 mo. Also, the established members 
are probably less geographically mobile than 
new members. 

A question can also be raised as to whether 
new members select churches of the size which 
makes demands consistent with their willing- 
ness to expend effort, since they tend to 
show the same church size differ 
port as do established members ( 
Data collected in the present s 
reported above indicate that 
of large and small churches a 
bers of small churches, 
of large churches: 
(b) spend more ti 
(c) are more regul 
(d) work harder to 
have a closer rel 


ences in sup- 
see Table 2), 
tudy but not 

new members 
gree that mem- 


e data suggest that church 
members have accurate perce; 


respectively, by 
church, friendliness of m 
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service, church school, youth program, church 
facilities, involvement of the church in social 
issues, and social activities.) Also relevant 1 
the selectivity question is the fact that 427o 
of the members of the small church and 25% 
of the members of the large church reported 
visiting another church before deciding to 
join their present church. In sum, while 
beliefs regarding the correlates of church size 
probably do have some influence on the choice 
of a church, the immediate situation of under- 
manning or overmanning is undoubtedly also 
a critical determinant of levels of support. 
Some unanswered questions regarding the 
influence of church size on members’ levels 
of participation and the assimilation of new 
members may be clarified by a study 3” 
progress on the merger of a church having 
400 members and a larger church of 950 
members. Data on levels of participation þe- 
fore and after the merger and feelings O 
being welcome and obligated to participate 
will permit a test of the effects of church size 


. H i e 
largely uncontaminated by the selective choic 
factor, 
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DOUBLE CROSS-VALIDATION OF A WEIGHTED 
APPLICATION BLANK OVER TIME 


DARRELL E. ROACH?! 


Nationwide Insurance, Columbus, Ohio 


This research is concerned with a second cross-validation of a weighted application 
blank for predicting tenure of clerical employees. The sample consisted of em- 
ployees hired during a later time period than the initial and cross-validation 
samples. The blank was developed and cross-validated in a previous study. Applica- 
tion blanks for 154 clerical employees hired during late 1953 and 1954 were scored 
using the weights previously developed. A substantial loss in predictive power was 
found in the additional cross-validation. Changes in labor market conditions, 

cific changes in personnel policies appear to be responsible 


manpower needs, and spe 
for the loss in efficiency. 
samples of employ 
continuing validity. 


The utility of the weighted application 
blank for the identification of probable early 
terminators among job applicants has been 


‘sh Street, Columbus, Ohio 43216. 


The results suggest cross-validation of such blanks with 
ees hired during a later time period and periodic checks on their 


generally developed on a sample of employees 
hired 2-3 yr. prior to the actual use of the 
instrument in the selection process, the ques- 


\ 
fA, demonstrated across a variety of employee tion may be raised as to whether such instru- 
Populations, Mosel and Wade (1951) used ments may be time sensitive and not be as 
biographical data to reduce turnover among valid for the present applicant population. 
department store clerks and Kriedt and Gadel A study by Buel (1964) in a 3-5-yr. follow-up 
(1953) used the technique to predict clerical found that, in spite of a move of the office, a 
terminators. Dunnette and Maitzold (1955) different applicant population, and a tight 
and Kirchner and Dunnette (1957) reported labor market, 13 of the original 16 items con- 
successful prediction of early termination tinued to discriminate and the validity was 
among seasonal workers in a cannery and fe- reduced from .49 to .33. Dunnette, Kirchner, 
ale office employees. Fleishman and Berniger Erickson, and Banas (1960) report continued 
(1961) report successful identification of early discrimination for the blank developed in an 
terminators among clerical employees in a earlier study. oes a supthet follow-up of 
University setting. — that study by W r a 1202) fonnd ‘thar 
The characteristic model for such studies is after 5 yr. the vali a nac he pes to near 
(> to obtain application blank data for employees zero and only 3 of a feat (hee 6 items held 
, {red 2-3 yr, prior to the research. The sample up- Thus, it appeal* a ie research By the 
“is then clas ied i Jy terminators, long- stability of the predictive power of the weighted 
ina ainan ia vidde group who 4 A fay roms conclusive agg 
erminated emple jor to the study but could be situ? ‘ s 
wêre a Sa «> Lag inal The ee as meen n was to 
the early terminators. Application blank items conduct a secon } GLOSS Veh 1 7 ion o a weighted 
are then emmed i find those which dif- application a np) ay oes —_ 
€rentiate betwee long- and short-term @ later perio j an the sample use in the 
JA groups. * etween the long lly dropped original validation and cross-validation in 
itis Tec a ee elie ae ‘calculated order to determine the stability of its predic- 
or the discriminating items and these weights tive power prior to incorporating it into the 
and items are cross-validated either by a hold- company S pees peers J Soe 
Out sample or by selecting another sample of it was desired to identify specific factors which 
“mployees hired during a different time period. appear EY La responsible for any changes in 
ten this is the ye following the initial validity by renewing personnel policy changes, 
Validation sample external factors, etc. 
~2!ce the scoring keys for such blanks are METHOD 
; a 
Reses ests for reprints should be sent to ape’ The development and cross-validation of the weighted 
+) High at, Service, Nationwide Insurance, application blank used in the present study is reported 


157 


158 DARRELL 
by Minor (1958). The sample in this study barn 
of 440 female clerical employees in the lower y m 
grades (2-5) hired during 1952-1953. Only z ee 
employees were included in order to Simna sh ia 
porary summer workers who are emp poe Ps fex 
hourly basis. One-half of the sample was use i at if 
development of the prediction key and the ot EP a 
for cross-validation. The samples were selected so that 
each consisted of a long-tenure group (tenure ranging 
from 42-54 mo.) and a short-tenure group ( terminated 
prior to 9 mo). The termination group was limited to 
voluntary terminations thus excluding discharges and 
resignations due to pregnancy. The composite correla- 
tion between the tenure criterion and the weighted 
blank was .53 for the initial sample and .46 for the 
cross-validation sample. f 
The sample in the present study consisted of 199 
female clerical employees hired during late 1954 and 
early 1955. Only full-time salaried employees on the 
lower salary grades were included. The application 
blanks of these employees were scored using a slight 
modification of the weights developed in the study by 
Minor (1958). These weights were z scores for the item 
‘alternatives and the modification consisted of adding 
the largest negative z to the weights for a given item, 
dividing by the smallest positive value after the addi- 
tion of the constant, and rounding to the nearest 
integer value. This greatly simplified the scoring and 
since the z scores are relative, the procedure would have 
a negligible effect on the relationship to the criterion. 
During the period covered by the Minor (1958) 
study, the company was using the Life Office Manage- 
ment Association test (No. 2 Form A) which was 
published by the firm of the same name for use by its 
member companies. The test consisted of six subtests 
involving number checking, following directions, 
vocabulary, proverbs and sayings, simple arithmetic, 
and spelling. The scores were the errors in each subtest 
and the time required to complete the total test. These 
Scores were then combined into a weighted total score. 
This total weighted score was the only test score utilized 
in the weighted blank developed in the prior study. 


TABLE 1 


DISTRIBUTION OF WEIGHTED APPLICATION 
BLANK Scores 


Long tenure Short tenure 
Score 
F % T Yo 
27-29 1 11 = j = 
24-26 4 44 1 1.6 
21-23 15 16.5 2 3.2 
18-20 20 22.0 13 20.6 
15-17 24 26.4 23 36.5 
12-14 20 22.0 15 23.8 
9-11 7 7.7 7 11.1 
6-8 = — 1 1.6 
3-5 Sa mn S 1.6 
n 91 100.1 63 100.0 
M 17.10 14.81 i 
o 3.97 3.54 
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Employees hired during the period sampled in a 
present study were tested with the Wonderlic personn : 
test. Since the item content of these tests appeared ie 
similar, it was assumed, perhaps naively, that = 
tests would be fairly equivalent and the Wonder! $ 
scores could be used for the purposes of the pen 
study. The weight for the Wonderlic scores was deriv po 
from the Life Office Management Association score, 
assigning comparable percentile ranks. f b- 

The present sample was classified into three ur 
samples. There were 63 employees who terminate 
prior to 9 mo, 45 who terminated between 10-23 aa 
and 91 who were still employed at the time of a 
study, thus having 23-29 mo. of service. Only ve 
long- and short-tenure groups were used in the analy a 
and discharges and resignations due to pregnancy wet 
excluded from the short-tenure group in order that the 
data would be comparable to the previous study. l 

The means and standard deviations of the tota! 
weighted score and each of the weighted item scores 
were computed. Point biserial correlations were do 
calculated for the total weighted score and each 
weighted item with the criterion. 


ResuLrS 


Table 1 shows the distribution of ba 
weighted application scores for the long- 4% 
short-tenure groups. The scores for the long: 
tenure group range from 9-27 with a mean 
17.10; for the short-tenure group they range 
from 5-24 with a mean of 14.81. A / test of the 
difference between the means is 3.75 which 1° 
significant at the .001 level, The point biseri 
correlation between the scores and the criterion 
is .29. This is substantially below the 46 fou" 
in the initial cross-validation. 

Examination of the distributions of i 
Scores indicates considerable overlap, we 
almost complete overlap in the low 8° 


range and less overlap in the higher range 
Scores. A total of 22% of the long-tenure group 
scored above 20 co 


e 
mpared to 4.80% for i 
short-tenure group. At the other end of 
distribution, howeve 


re 
r, 7.7% of the long-ten zo, 

group had scores below 12 compared to 14 

for the short-tenure 

Table 2 shows th 


group. 


he m each 
weighted items and the tests for the eignifice ns 
of the difference between means, In addi’ 
a chi-square test was calculated to tes te 
differences in the responses to the item © of 


of th? 


the | ant discrimination be o of 
a al and short-tenure groups, but only 

e Items show a reversal of direction’ nce 
two of the 


i 
ql 


JALIDATION OF A WEIGHTED APPLICATION BLANK 


TABLE 2 
Item MEANS, DIFFERENCES, f, AND CHi- SQUARE TESTS OF SIGNIFICANCE 


Item = y X i- | MA-MT 
eer Item X actives | X temi | MAM t £ |a 
1 Age at time of application 1.12 -78 „34 3 
2 | Distance of home address from company 2.10 1.79 31 4 
3 | Length of time married at time of application 1.76 1.46 30 3 
4 | Difference between salary expected & salary received 1.68 1.70 —.02 4 
5 | Source of reference to company 3.35 2.05 -30 8 
6 Average tenure on last three jobs 1.18 1.48 —.30 4 
7 | Reason for leaving job 1.95 151 Ad 5 
8 | Major field in high school 2.80 2.53 327 5 
9 | Father's occupation (classified by North-Hatt scale) 1.42 1.42 00 1 
10 Related to or knows company employees 39 32 07 3 
11 | Wonderlic test score 58 81 —-30 3 


Ot. 


č terminated difference. 


in response categories. Apparently, a weighting 
procedure which assigns the highest weights to 
the most discriminating categories achieves 
Significance in some cases where the raw 
Category frequencies do not. 

The point biserial correlations of the items 
with the tenure criterion for the initial valida- 
tion, and first and second cross-validation 
samples are shown in Table 3. Almost all of 
the item-criterion correlations show further 
shrinkage in the present sample. Items 1, 2, 
6, 8, 9, and 10 which held up fairly well on 
cross-validation showed substantial shrinkage. 
Only Item 3 appeared to hold up under this 
second cross-validation. Also, it should be 
noted that Variables 6, 10, and 11 show a 
reversal of sign. 

Discussion 

_ The weighted application blank developed 
in the previous study by Minor (1958) dis- 
Criminates between long- and short-tenure 
employees in the second cross-validation. The 
Predictive efficiency of the instrument, how- 
ever, is sharply reduced in the present study. 
Such shrinkage is to be expected between the 
validation and cross-validation samples since 
the initial validation capitalizes on all chance 

ifferences, Similar loss in efficiency would not 

© expected between samples not involved in 
the determination of the scoring weights. 
. The overlap of the distributions renders the 
'nstrument relatively ineffective for selection 
Purposes. Such instruments are primarily used 
as rejection devices in that low scoring appli- 
fants are rejected since they have a high 
Probability of terminating within @ short 
Period after the initial employment. With the 


overlap at the lower end of the distribution, 
the probability of early termination of an 
applicant who scores below 12 is .56 compared 
to .41 for the total score range of the two 
samples. At the other end of the distribution, 
we note that applicants who score above 20 
have a probability of .87 of being in the long- 
tenure group. Thus, it appears that the validity 
of the instrument in this second cross-valida- 
tion is primarily associated with this smaller 
overlap in the high score range. Consequently, 
it does not appear useful as a rejection device 


for screening out poor tenure risks. 
The examination of the item validities yields 


some clues as to possible reasons for the loss 
of predictive efficiency in this second cross- 
validation. The items concerning age at time 


TABLE 3 


CoMPARISON OF IreM-CRITERION CORRELATIONS OF 
VALIDATION AND First Cross-VALIDATION 
SAMPLE WITH SECOND CRoss-VALIDATION 


SAMPLE 
Sample 
Item Item title 
ER 1 | 2 | 3 
1 Age at time of application 24 | .28 13 
2 | Distance of home address from si | tp i 


company i . 
3 | Length of time married at time 
of application 

4 Difference between salary ex- 
pected and salary received 
5 | Source of reference to the com- 


pany 34 | .15 
6 | Average tenure on last three jobs | .25 | .27 
7 Reason for leaving last job .25 | .10 
8 Major field in high school -28 | .20 
9 Father's occupation (classified by 

North-Hatt prestige scale) AZ | Ad .02 
10 | Related to or knows company 

employees 16 | .28 | —.08 
11 Life Office Management Associa 

tion test score or Wonderlic a5 | 08 | —.19 


a Wonderlic used in Sample 3 on Item 11. 
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of application, length of time married at time 
of application, source of reference to company, 
and reason for leaving last job, held up fairly 
well in the second cross-validation. The items 
dealing with distance of home address from 
Columbus and major field in high school 
showed some shrinkage but were still in the 
same direction. The remaining items either 
failed to discriminate at all or reversed 
direction. 

In the time period between the two studies 

some personnel policies were changed which 
could affect the validity of these items. In 
1953 a major change was made in the salary 
schedule in which the beginning salary levels 
were raised. This could have affected the dis- 
criminability of the item dealing with the 
differential between salary expected and salary 
received. In 1954 a change was made in the 
company’s relative rule. Prior to that time the 
company policy prevented the employment of 
relatives of present employees. This 
modified so that female relatives of fe 
employees could be employed. This, of c 
has a direct bearing on the item referr: 
the applicants being related to or kno 
company employee, 

The change in the selection test also affected 
the results. In the previous study it was found 
that high scorers on the Life Office Manage- 
ment Association test tended to be better 


tenure risks. With the Wonderlic this tend- 
ency was reversed. It was 


was 
male 
ourse, 
ing to 
wing a 


test performance and tenure, 
Another factor which could be c 
to the loss of predictive efficiency o 
ment is that during the 1952-1953 
the company was under: 


ontributing 
f the instru- 


l yees and it was faced with a 
tight labor m in the area. 


ely unsele 
the 


1953 sample 
cted popula- 
: usual employee 
population. j 
It appears that the 
blank when used to pred 


sensitive to changes in 


weighted application 
lct Probable tenure js 
Personnel policies and 
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employment conditions. This suggests thal 
it would be worthwhile to cross-validate such 
blanks with a sample of employees hired during 
a different time period than the original 
validation sample. In addition, the need for 
periodic monitoring of such instruments to 
determine loss in predictive efficiency is 
reaffirmed. 
SUMMARY 

This study reports a second cross-validation 
of a weighted application blank using a sample 
of employees hired during a later time period 
than the initial and cross-validation samples. 
There was a large decrement in the predictive 
efficiency of the instrument between the 
initial and second cross-validation samples. 
Examination of specific items which lost 
predictive power and review of company 
history suggests that changes in labor marke 
conditions, manpower needs, and personne 
policies may be contributing factors to the 
loss in efficiency. The results suggest the 
desirability of cross-validating weighted aD 
plication blanks with a sample of employees 
hired during a later time period than = 
initial validation and reaffirms the need {0 
Periodic monitoring of such instruments. 
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SCREENING EMPLOYMENT APPLICANTS FOR 
ATTITUDES TOWARD THEFT? 


PHILIP ASH? 


University of Illinois at Chicago 


The Reid Report is a three-part instrument (scoreable attitudes-toward-theft in- 


ventory, biographical data blank, 
predict the likelihood that an em 
For a sample of 254 employment app 
related with evaluations base 
device the evaluation alternatives w 
mendation, not recommended. Chi-squ 
A Pearson product-moment correl 
and Reid Report evaluati 


Employee theft of money and merchandise 
constitutes a heavy tax on business, industry, 
and ultimately the consumer. Jaspan (1968) 
estimated that white-collar workers alone steal 
More than $5 million a day in cash and 
Merchandise, It is estimated that the loss of 
Stock certificates and securities from the New 
York Stock Exchange through theft was 
$12.4 billion in 1968 (Anonymous, 1969). 
Police-reported losses in 1968 for shoplifting 
Were $504 million, and $381 million for em- 
ployee theft (Statistical Abstract of the United 
States, 1969). These are undoubtedly gross 
underestimates. Martin (1970), Chairman 
of the Board of Carson, Pirie, & Scott (a 
Chicago department store chain) reported 
Mat a survey of retail department stores 
Mm that city showed shrinkage (unexplained 
disappearance of money and merchandise— 
Praarily a euphemism for theft by employees 
and shoppers), which ranged from 1.6% to 

` /o of gross sales, an amount typically 
Sreater than before-tax profits. 

a sess to control and reduce | 
thri., 4 Particularly employee theft 

principal techniques: apprehen 1 
Prosecution of the thief, institution of security 
Procedures and devices to make theft difficult 


or its : £ i 
Z ts apprehension easier, and screening of 


osses due 
, include 
sion and 


1 * 
Grateful acknowledgment is made to John E. Reid 


deny S0ciates, and particularly to John Reid, Presi- 

of Se and to J. Kirk Barefoot, formerly Dire 

eid Rew McKesson and Robbins, for making W 

® Report and polygraph data available for analysis. 

€ quests for reprints should be sent to the author, 

Dat aie of Psychology, University of Illinois, 
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admissions list of past delinquencies) designed to 
ployment applicant will steal from his employer. 
licants, Reid Report evaluations were cor- 
d on polygraph (lie detector) interviews. For each 
ere: recommended for hire, qualified recom- 
are for the 3 X 3 table was 45.15 (p < .001). 
ation coefficient between polygraph evaluations 
ions was 43 (p<.001). 


employment applicants in an attempt to 
identify those who might steal if employed. 

As seems to be true in the case of most other 
crimes, apprehension and prosecution has very 
little deterrent effect. If fact, an offender is 
apprehended in less than 20% of the crimes 
that are committed (Faltermayer, 1970). 

Security devices also proliferate, but ap- 
parently only with minor effect. What the 
mind of one man can invent, the mind of an- 
other can evade. 

The screening of employment applicants is 
the third line of defense. Several screening 
devices have been utilized. 

In high security and high trust positions 
(e.g; bank tellers and bank guards, armored 
car money transfer personnel, police depart- 
ments) the polygraph (or lie detector) is 
used for screening applicants. The 


frequently 
device has been demonstrated to possess a 
high degree of reliability (Bersh, 1969; 


Horvath & Reid, 1971; Kubis, 1962; Reid, 
1971) in establishing an applicant’s innocence 
or guilt with respect to specific | criminal 
activities. The procedure 1s relatively ex- 
pensive, however, and unions have strongly 
resisted its routine use in employment selec- 
tion. Eleven states now have antipolygraph 
s: Alaska, California, Connecticut, Dela- 
Hawaii, Massachusetts, New Jersey, 
and 


law: 


ware, j 
Oregon, Pennsylvania, Rhode Island, 


Washington (Reid & Hunter, 1970). 

A second source of information is the refer- 
ence, credit, and police check. At best, how- 
ever, this check is incomplete: it never detects 
the previously unapprehended. At worst, it 
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is deceptive: ex-employers are too frequently 
unwilling to commit themselves in a written 
reference to an allegation that a previous em- 
ployee stole from them. The possible costs in 
libel suits far outweigh the limited return 
earned by candor. . 

A third source of information is the test or 
inventory that might predict delinquency. 
Significant predictions have been claimed for 
a wide variety of instruments. Among them are 
Activity Vector Analysis (Clark & Hasler, 
1967), the Minnesota Multiphasic Personality 
Inventory (Hathaway & Monachesi, 1953), 
the Porteus Mazes (Porteus, 1954), the Id- 
Ego-Superego Test (Mangold, 1966), the 
Rorschach (Ichimura, 1966), knowledge of 
criminal argot or vocabulary (Gibson, 1966), 
the Kvaraceus Delinquency Scale and Check- 
list (Bechtold, 1964; Kvaraceus, 1954), the 
California Personality Inventory (Gough & 
Peterson, 1952), the Reid Report (Ash, 1970; 
Reid, 1967), and a variety of other devices 
too numerous to list. 

The present study is concerned with an ap- 
plication of the Reid Report to the identifica- 
tion of potential thiefs. The Report differs from 
most previous devices in two respects: first, 
it has been designed to predict only one kind 
of delinquent behavior, theft; and second, its 
Purpose should be transparent to the examinee, 


METHOD 

The Reid Re 
The first sec 
including two sets of items. The first set 


sure of punitiveness, as reflected in attitudes toward 
punishment for crimes of theft (e.g., 


“Do you believe 
there are some special cases where a person has a right 
to steal from an employer?” ; “An employer discovers 
that a long-service, trusted employee has been taking 
a few dollars out of the cash register every week. 
Should the employer have him arrested?”), The second 
set of questions is intended to measure the individuals 
own attitudes and behaviors relating to theft (eg., 


“Did you ever think about committing a burglary?” 
“Are you too honest to steal?”), á 


with admissions of previous 
during a polygraph examinat: 
males—both black and white—a v 


20, 21-25, 26-35, 36 and over) for both 
white applicants for positions of trust, fı 
volving the unsupervised 
bank tellers and messengers, 


i requently in- 
handling of money (e.g. 


» police applicants, security 
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guards, dairy route salesmen). In an unpublished s ay 
by the author, a cross-validation based upon PE 
selected in the earlier study yielded a validity of E 
with the results of polygraph-determined theft moy 

The second section of the Report consists of a detaile 
biographical data blank covering previous employments 
education; personal history (e.g., marital status, oer 
dresses over past 10 yr.); financial history and a 
debtedness; medical and social history (use of alcohol 
and drugs, psychotherapy, experience with pone jon 
cluding questioning about arrest for and conviguon 
of a variety of theft-related crimes, and gambling 
practices). i r ‘hich 

The third section includes a list of questions to whic 
a “yes” response constitutes an admission of a gop 
mitted theft or other defalcation (e.g., “Did you write 
a check knowing there was not enough money in the 
bank more than three times?” ; “Did you make a false 
insurance claim for personal gain?”), 

In the typical employment use of the Report, an oe 
plicant completes the form in the employer’s persons 
office. Part 1 is scored and all three sections are analyzed 
by a staff member of J. E. Reid and Associates. On oy 
basis of the item Score, plus evaluation of responses ag 
the bio-data and the admissions questions, an evalua- 
tion of “recommended for employment” or “not rec 
mended” is made. Norms exist for the score itself, 
the bio-data and the admissions questions are also ui 
into account in the final recommendation. Thus, a 
example, a middling to high score, which might z 
itself lead to a “recommended” evaluation, may be ie 
versed by a bad debt history, work instability 3e 
numerous admissions, Incredible as it may secm, pr 
plicants in significant numbers do admit to practica a 
every crime on the books. Furthermore, on the ie 
part, on both the punitiveness and self-reflective aa 
“faking good” seems to be rare. In the absence of boc 
it may be speculated that response to the question 
Is strongly determined by the individual's own ey 
tices: someone who steals approves of punishment oe 
for persons who steal more than he does (punit ut 
items), and sees as admissible more “thoughts @b° 
stealing” than more rigidly honest respondents. 

These speculations about why people admit to f 
are well supported by interview experiences with on 
ir implicit norms © 


a] n 
persons’ theft patterns and their Fial 


heft 
em” 


5 o 
) Iter question, “As far as yout oint 
onesty is concerned do you think you are [a T-P" or- 
scale ranging fro; 


m far above averagi far below A 
a elow 
P age to 


k „tern® 

se who do choose these rating a elf. 
7 

i ver scores on the inventory *™ gu 

tn ston “What Percentage of asa cher $ mean 

=$ steal something from thei any?” the. te 

estimate is 1065, EA eir company stim 5 


5-5007 m low scorers t: ypically €$" ney 
he The question, «The igs Paaa nes 
(t at did not belong to me) that I have taken fron’, 000 
would be about [can’t remember, none, $1 t0 $S for 


in eight increments] 
the response given, 
the case of o 


ries srsti ion 
has elicited, in justificat p ae 
the most interesting ins!s ia ad 
ne applicant, for example, an in! 
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SCREENING FOR ATTITUDES TOWARD THEFT 


mission of $600 was erased and changed to $1,000. 
During the interview, the applicant was asked why he 
made the change, and whether he still had doubts 
about the right answer. He responded that (a) he knew 
everyone took some money and the examiner would 
know he was lying if he claimed never to have taken 
any himself; and (b) upon reflection, figuring a 5-yr. 
Period, $1000 was only $4 per week, and “Doc, isn’t 
ve Sor Isn't that par for the course? I want to level 
À 5 you, and you would think I was some kind of a 
reak if I said I never took anything!” (This was an 
applicant for a job as bank teller, who was then em- 
ployed in another bank and looking for a change of 
Scenery.) In short, the applicant who has committed 
thefts from his employers generally seems to believe 
that “everyone does it” and that denial of any thefts 
would be viewed by the examiner as transparent lying. 
ie Tespolises tend to reflect what he considers to be 

€ norms for theft behaviors. 
a The foregoing attitudinal sets probably militate 
against “faking good,” even though the purpose of the 
inventory should be obvious to most examinees. 
ih Oredures and sample. McKesson and Robbins, a 

ce-reorganized drug manufacturing company, used 
both the polygraph and the Reid Report to screen 
employment applicants. Applicants were interviewed 
While polygraph tracings were made, at plant locations 
in 34 cities distributed across the United States. On the 
basis of the polygraph examination, a security depart- 
ment _ recommendation—“recommended for hire,” 
qualified recommendation,” or “not recommended” 
“Was made. In addition, all applicants completed a 
Reid Report, which was sent to the office of J. E. Reid 
and Associates in Chicago for scoring and evaluation. 
Chis evaluation was made completely independently of 
the polygraph determination, which was, at the time of 
evaluation, not communicated to Chicago. 

In all, Reid Reports were evaluated for 1,022 employ- 
Ment applicants. Polygraph recommendations, however, 
Were submitted for 254 of them. 


RESULTS 


wae comparison of the distribut 
eport recommendations for case 


ion of Reid 
s for which 


TABLE 1 
FOR ALL CASES EVALUATED 


REID Report EVALUATIONS 
LYGRAPH 


AND ror Cases ror WaicH Po 
EVALUATIONS WERE AVAILABLE 


Evaluation 
Cases Qualified Not 
Recom- | recom- recom- Total 
mended | menda- | mended 
tion 
Reid Report + 
Rolygraph findings D 254 
y 156 18 ge 100 
All Rei 61.4 7.1 31.5 
I Reid Reports 314 1022 
o: 3 
VA oe 3 9.0 30.7 100 
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TABLE 2 


ASSOCIATION BETWEEN REID REPORT EVALUATIONS 
AND POLYGRAPH EVALUATIONS FOR 
254 EMPLOYMENT APPLICANTS 


Polygraph evaluation 


Reid Report | Recom- | Quali- Not 
evaluation mended | fied | Fecia Total 
N| NÊN |n| %|N| % 
Recommended |129| 50.8] 12| 4.7 15| 5.9|156| 61.4 
Qualified 13] 5.1) 4 1.6] 1 A] 18 dal 
Not recommended | 35| 13.8] 13| 5.1 | 32 | 12.6 80| 31.5 
Total 177 | 69.7| 29 | 11.4] 48 | 18.9 | 254 | 100 


Note.—x? = 45.15, df = 4, p < .001;r = 43,2 < 001, 


polygraph examinations were available with 
all 1,022 Reid Report evaluations failed to 
reveal any significant differences. 

For the sample of 254 cases for which both 
polygraph-based recommendations and Reid 
Report evaluations were available, a high 
degree of relationship was found between 
them (see Table 2). 

Chi-square was 45.15 (df = 4), significant 
beyond the .001 level of confidence (Peters & 
Van Voorhis, 1940, p. 498). Since the recom- 
mended-not recommended judgment may be 
considered a continuous variable, a Pearson 
product-moment correlation, corrected for 
broad categories (Peters & Van Voorhis, 1940, 
p. 395-399), was computed at r = .43. 

The hit ratio (same recommendation for 
both polygraph and Reid Report was 65%; 
the miss ratio (recommended vs. not recom- 
mended) was 19.7%. Qualified recommenda- 
tions, either from the polygraph data or from 
the Reid Report evaluations, accounted for 
the balance of the cases, 15.3%. 

Furthermore, analysis of the misses in- 
dicates that de facto agreement was greater 
than the statistical tally revealed. 

For example, among the 15 misses not recom- 
mended (polygraph), recommended (Reid 
Report), six applicants earned passing scores on 
the Reid Report and no theft evaluations on 
the polygraph, but the polygraph examiner re- 
jected them for other reasons, such as medical, 
heart condition, heavy drinker, wanted sales 
job—none open, chronic absenteeism. 

On the other hand, among those applicants 
who passed the polygraph examination but 
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were not recommended on the basis of their 
Reid Report responses, the Reid Report almost 
always elicited both (a) a low score on the 
first part, and (b) admission on the last part 
of the report of past thefts of various sorts, 
or (c) poor histories including many changes of 
jobs and home addresses, and credit indebted- 
ness which the applicant could probably never 
liquidate. , . ; 
In summary, the Reid Report gives promise 
of being a device which can identify, at a rea- 
sonable level of confidence, applicants who 
probably have a previous history of theft be- 
havior, and who might be presumed to have 
attitudes conducive to stealing from subse- 
quent employers, given the opportunity. 
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SELECTION AND EVALUATION PROCEDURE FOR 
OPERATING ROOM PERSONNEL? 


URY GLUSKINOS? ann THOMAS F. BRENNAN 


Temple University 


Sixteen surgeons and supervisors of operati 
behavioral statements pertaining to OR performance on a 1-7 im 


wo statements were retained and incorpora 
agreement among raters as measured by 


d phase of the study, 163 OR personnel 
he 42-item criterion list and of these, 40 were 
ting of the Wonderlic, O'Connor Finger Dex- 
les of the California Psychological Inventory. 
s revealed that 54% of the variance could 
l technical knowledge, attention to detail, 


Forty: 


the basis of mean rated importance and 
their standard deviations. In the secon 


were rated by their supervisors on t 
administered a series of tests consis! 
terity Test, Press Test, and three sca! 


Factor analysis of the criterion variable: 
be accounted for by four factors: genera 
patient awareness, and social interaction. T! 


ng room (OR) personnel rated 100 
portance scale. 


ted into a final criterion list on 


he first three factors were significantly 


related to one or more of the predictor variables. 


In recent years attention has been focused on 
tüs inadequacy of the criterion in most pre- 
diction studies. Nevertheless, even today, most 
prediction studies approach the criterion pro- 
blem in a most superficial way, leading to 
criterion contamination and limiting its reli- 
ability. Exceptions are such studies by Hemp- 
hill (1963) on engineers and Smith and 
Kendall’s (1963) study on nurses. The study 
described here outlines a systematic approach 
ts the criterion problem in a little explored 
Vocational specialty, that of the operating 
room (OR) personnel. 

For many years the demands of hospitals for 
qualified OR personnel has greatly exceede! 
the supply of such personnel available. Until ap- 
Proximately 10 yr. ago, ORs have been staffed 
Primarily by graduate and registered nurses 
who had undergone intensive training in OR 
procedures, Since nursing schools screen appli- 
cants prior to admission and subject their 
nursing students to rigorous examinations in 
theoretical knowledge and clinical performance 
op to graduation, no great need has existed 
or a specific OR selection procedure. ; 
However, in recent years, hospital nursing 
schools have reduced or omitted OR training 
ear curriculum and many hospitals se 

aati’ an . 
Professional OR. training PREM i surgical 
= m personnel (i.e. ame 
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roductivity, 
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row is now at the Israel Institute for P 
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technicians). It might be noted that the armed 
forces have employed corpsmen in their ORs 
to function as scrub and circulating surgical 
technicians. Upon discharge from the service, 
these men and women have demonstrated to 
hospitals the value of the nonprofessional, 
functioning as members of the surgical team 
(Peers et al., 1966). Hospitals are now recogniz- 
ing that the need to staff their ORs can now be 
partially satisfied by employing technicians to 
assist the registered nurse (RN) in the OR. 

Individual hospitals construct their own 
training programs which vary in length from 6 
wk. to 18 mo. and selection of applicants is 
hospital. Applicants to these 
programs are primarily premedical students, 
licensed practical nurses (LPN), women who 
have been unable to complete their nursing 
training, auxiliary hospital personnel (orderlies, 
dietary personnel, etc.) who have expressed 
an interest in such a program, and in some 
cases, individuals with no prior medical experi- 
ence, who have a sincere interest in medicine 
and were attracted to the program (Boyer, 
1966). Selection procedures have not seemed 
effective according to the high attrition rate as 
reported by some hospitals (Turkenkopf, 1967). 
Tt was with this thought that this study was 
undertaken. 

The objectives of this st 
a testing procedure for the selection of OR 
personnel and to construct an adequate 
standardized criterion for the evaluation of OR 
necessity of which is recognized 


unique for each 


udy were to develop 


personnel, the 
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in the field of hospital management (Metzger, 
re construction of a standardized evalua- 
tion form could be undertaken with the know- 
ledge that OR tasks are essentially the = 
among the different types of hospitals in the 
United States, much the same as surgical pro- 
cedures are performed in the same manner, 
regardless of the location or type of hospital. 


METHOD 


Criterion Development 


A list of 100 descriptive behavioral statements applic- 
able specifically to performance in the OR was collected. 
The statements were submitted by OR nurses, OR 
supervisors, and surgeons and based on actual observa- 
tion of OR personnel performing these tasks during 
surgical procedures. These 100 statements were then 
submitted to 16 surgeons and OR supervisors from 
various hospitals, and were rated by them on a scale 
of 1-7 (least important-most important, respectively). 
Mean importance ratings and standard deviations were 
computed for each of the statements. Only those 
statements with means above 3.80 and standard devia- 
tions below 1.75 were used as criterion variables. The 
final evaluation form consisted of 42 statements with a 
rating scale of the form: 


1 2 3 4 5 
Statement Rarely Seldom Frequently Often Always 
The order of the rating scale was randomly alternated to 
reduce response sets. 

Predictor variables were chosen on the basis of a 
limited testing time availability and their appropriate- 
ness to the types of statements contained in the final 
evaluation form, The tests selected for use in this study 
were the: (a) O’Connor Finger Dexterity Test; (b) 
Wonderlic Personnel Test; (c) three scales from the 
California Psychological Inventory (CPI; 


bility, Self-Control, and Socialization); and (d) Press 
Test which yields five 


of Part 3, that is, Ss’ ability 
in the presence of distracting 
Scores was obtained, 
Procedure. The evaluation form was mailed to 10 
hospital OR supervisors in Seven states and the Tue 
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of Columbia. Supervisors from 17 hospitals responded by 
completing evaluations on 163 OR personnel. I a 
sample consisted of 76 RNs and 87 OR tenner 
(ORT). Of the 17 hospitals responding, the Po 
hospitals from the Philadelphia area agreed to SA 
their personnel tested with the predictors chosen for a 
study. The four tests were administered by the seco! A 
author to 40 OR nurses and ORTs drawn from ee 
seven hospitals. The sample consisted of 38 females sn 
2 males whose ages ranged from 19 to 54 yr. (A oes 
= 28.4 yr.) with an average of 4.2 yr. experience 1 
the OR. 


RESULTS 


Analysis of the Criterion Variables 


A principal component factor analysis 
performed on the evaluation scores of the 163 
RNs and ORTS and the resulting factors wer" 
rotated to simple structure. Four factors which 
accounted for 54% of the total variance were 
retained. These factors lend themselves t° 
fairly unambiguous interpretation; they de- 
fined specific areas of OR task performance 
Below follows a description of these factors an! i 
the major defining statements, each statemen 
being followed by its factor loading and mean 
importance rating. For each factor, interne 
consistency reliability also 
computed. i 


coefficients were 


Factor 1—General Technical Knowledge 


This factor contains statements which mea- 
sure an individual’s performance of routine oS 
Procedures, such as, application of principle 
of sterile technique, preparation for surgic® 
procedures as to type of instruments and equipe 
ment required, and knowledge of the sequen 


Vel E re 
of events for a Particular operating procedu! ’ 
etc. 


Sees that su 
(.74, 6.06) 


Anticipates the needs of the 
Is able to threa 


* 7 erly 
ction machine is functioning proP 


surgeon (.73, 6.37) 
d needles quickly (.72, 4.25) 
The Spearman-Brown 
coefficient for this factor 
r= 96. 


iabilitY 
corrected reliab? e 
was computed t 


Factor 2—Allention to Detail 


d- 
- Joa 
The statements with significant facto! Jo! 1's 


ings on this factor measure the individ nti 
Performance of Specific tasks which are esse aD 
to effective performance in the OR. The a are 
™Portance ratings of all of the statement? 


di 
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high and have low standard deviations, from 
which we can infer that surgeons and OR 
supervisors consider these statements to be of 
extreme importance in the OR. It is also 
interesting to note that the mean score of the 
sample (V = 163) on this factor was 28.01 with 
a standard deviation of 2.89 and a range of 
18-30, 30 being the highest score attainable 
on this factor. 


Maintains aseptic technique throughout the procedure 

(78, 6.93) 

oe track of needles on the surgical field (.76, 6.62) 
ollows proper surgical scrub technique (.72, 6.56) 


The Spearman-Brown corrected reliability 
coefficient for this factor was computed to be 
r= 89. 


J $ 
Factor 3— Patient Awareness 


The statements which make up Factor 3 are 
Primarily patient oriented with an underlying 
Safety factor involved in most of the state- 
ments. For example, the first statement listed 
below for this factor measures the frequency 
with which a nurse checks her shoes for con- 
ductivity. With the use of explosive anesthetic 
gases in the OR there is a hazard which arises 
from the production of static electricity on the 
clothes of OR personnel. OR personnel thus 
wear shoes which will conduct this static 
electricity from their bodies through the shoes 
to a special flooring which is grounded and 
thereby eliminate the hazard of possible arcing 
of static which can set off the explosive gases. 
It is essential that these shoes be checked and 
cleaned frequently to insure the efficient con- 
duction of this static electricity. This particular 
statement is particularly patient oriented since 
the greatest concentration of these explosive 
gases is centered in and around the patient. 


S her shoes each day for conductivity (.67, 5.18) 
D ows sympathy to her patient (58, 4.87) 
oes not move patient after surgery without the con- 


Sent of the anesthesiologist (54, 6.06) 


he Spearman -Brown 
oefficient for this factor was com 
F = BD 


reliability 


corrected 
puted to be 


F ; P 
actor 4— Social Interaction 

Tae aaae SOE! 
I Factor 4 measures an individual’s social 
behavior in relation to superiors, co-workers, 
Patients, and self. The number of social behavior 
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statements collected and observation of per- 
formance demonstrated the close personal 
contact which exists in ORs. The importance 
of accounting for a social behavior factor inOR 
performance is also reflected by the relatively 
high mean-importance ratings given to these 
statements by surgeons and OR supervisors. 


Shows courtesy to her co-workers (.77, 5.30) 

Accepts constructive criticism (.77, 6.18) 

Will readily accept new members to the staff (.62, 5.24) 
The Spearman-Brown corrected reliability co- 
efficient for this factor was computed to be 
r= 81. 


Prediction of Performance 


Scores for the four factors plus a total 
criterion score were computed for each S and 
these scores were then correlated with the 10 
predictor scores (see Table 1). The greatest 
number of significant relationships was found 
for Factor 2, attention to detail. The highest 
correlation on this factor was obtained with 
the Wonderlic (r = 55, $ < .01). It is some- 
what surprising that such a significant relation- 
ship was found for this factor when one con- 
siders the tight distribution of scores as men- 
tioned above. Similar correlations were also 
found with the Wonderlic and the total 
criterion score and scores on Factor 1, general 
technical knowledge (r = 53 and .56, respec- 
tively). The Socialization scale of the CPI 


TABLE 1 
Test SCORES 


CORRELATION MATRIX OF TE: 


with Five EVALUATION SCORES 
Factor 
Test 
Total 1 2 3 4 
O'Conni 
De: 


EN 
Wonderlic 
Personnel Test 
Press Test 
Part 1 
Part 2 
Part 3 
Part 1-2 
Part 2-3 
California 
Psychological 
Inventory 
Responsibility 
Self-Control 
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predicted Factors 1 and 2 but failed to pre- 
dict the social interaction factor (Factor 4) 
with significance. Factor 1, general technical 
knowledge, and Factor 2, attention to detail, 
were predicted by the Press Test which isa 
measure of an individual’s ability to function 
effectively under stress conditions. Since effec- 
tive OR performance depends upon the person 
reacting to events occurring in rapid succession 
which often do not follow in the usual or 
expected sequence, Factor 1 and Factor 2 are 
validations of the Press Test (see Table 1). 
The Press Test also predicted Factor 3, patient 
awareness. 

Factor 4, social interaction, could not be 


predicted from any of the test variables used 
in this study. 


Criterion Analysis 


An analysis of variance between RNs’ and 
ORTs’ five criterion scores was performed to 
detect any differences in the rated performance 
of these two groups. 

The results of this analysis indicate that the 
RNs were rated by their supervisors signifi- 
cantly higher (p < .01) than the ORTs on their 
total score as well as their scores for general 


TABLE 2 


Means, STANDARD DEVIATIONS, AND ANALYSIS oF 
VARIANCE RATIOS FOR CRITERION FACTORS 
oF RNs anp ORTs 


Factor Group| X SD | N |F ratio 
Total score RN 182.49 | 15.36 | 76 | 20.73* 
ORT | 168.58 | 22.43 87 
Total | 175.06 | 20.61 163 
General technical] RN 90.20 | 9.78| 76| 22,55 
D k $ 2.55* 
knowledge ORT | 80.82 14.60 | 87 g 
Total | 85.19 13.39 | 163 
Attention to RN 28.93 | 1.78| 76|15 
i .93 X 15.86* 
detail ORT 27.21 | 3.40| 87 a 
Total | 28.01 2.89 | 163 
Patient aware- RN 19.82 | 3.12| 76 50* 
ness ORT 18.26 | 3.61 | 87 iý 
Total | 18.99 | 3.47 |163 
Social inter- RN 43.53 | 5.13 | 76 hi 
action ORT | 4229| 538| g) 22 
Total | 42.87 5.28 | 163 | 
| 
*p < 0l 
a 
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technical knowledge, attention to detail, and 
patient awareness (see Table 2). Since the RNs 
undergo longer and more intensified training in 
OR technique as well as other areas of patient 
care than do the ORTs, one would expect that 
they would perform at a higher level technically 
in the OR situation, but not necessarily higher 
socially. This point is verified by the statisti- 
cally nonsignificant difference between the two 
groups on the social interaction factor. 


DISCUSSION AND CONCLUSION 


In the interpretation of the above analysis 
of variance, a question could possibly be raised 
that the OR supervisors (also RNs) may have 
rated their nurses more favorably than their 
technicians as the result of a bias or halo effect. 
Also, the fact that RNs are considered pro- 
fessional whereas the ORTs are considered non- 
professional may have contributed to the 
differences in the ratings. If this were the case, 
one would logically expect that the social 
interaction factor would be particularly prone 
to the halo effect, but this notion was not 
Supported by the analysis of variance. These 
results, then, lend support to the criterion 
dimensions as measures of OR performance 
Since the statements comprising the criterion 
dimensions were considered the most important 
task performed in the OR by OR nurses, supe"? 
visors, and and discriminate 
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5. For any psychological or physical mea- 
surement, an increase in the number of items 
(provided those items are similar in content as 
indicated by large internal consistency coeflic- 
ients) will increase the reliability of the 
measure. i 

The usefulness of behavioral statements as a 
starting point for criterion development was 
shown in this study which attempts to measure 
and predict OR performance. Factor analysis 
revealed four basic dimensions accounting for 
54% of the variance of OR performance. For 
three of these dimensions, sizable relationships 
were obtained from a combination of four 
rediro, As a concurrent validity study, 
: e predictors probably underestimate the 
magnitude of the correlations. Since the sample 
involved in the prediction phase of the study 
Was rather small (V = 40), replications should 
be made to cross-validate the predictors used. 
Additional tests may be used to predict the 
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social interaction factor which could not be 
predicted with the current battery of tests. 
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Work groups with unstable membership 
stable membership. Contrary predictions 
ship were derived from the I 
ture. Employees in unstable 
but more favorable organiza: 
Likert’s theory, there were n 
variables and end-result variables, su, 
structure described by Bennis, Incre 
unstable membership is sugg 
for the decreased peer leader: 


work groups rej 
tional context, 


g: 
8: 


ased 


When an organization abandons traditional 
structural forms that are readily depicted by 
simple organization charts, and moves toward 
a more dynamic and fluid structure, one result 
may be that work groups with unstable mem- 
bership become a permanent feature of the new 
structure. Current theories of organizations dis- 
agree about the effects of such unstable struc- 
ture on organizational functioning. This article 
contrasts two such theoretical positions and 
presents empirical evidence relevant to the 
issue. 

Likert (1961) has summarized 


a considerable 
body of evidence 


supporting the conclusion 
of interaction process in 
roups is an important de- 
izational performance, One 
rom Likert’s theory of or- 
the superior sho 
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TABLE 1 


MEAN SCORES FOR PLANNABLES AND ASSIGNEDS ON MEASURES OF 
Four PEER LEADERSHIP FACTORS 


Work group structure 
Peer leadership factor Plannable Assigned t 
SD x SD 
Support 
Work group is friendly 3.86 96 3.95 1.02 
Work group pays attention to you 3.56 92 3375 .96 2.04* 
Work group willing to listen to you 3.63 1.01 3.70 99 
Goal emphasis 
Work group encourages effort 3.17 1.12 3.25 1.05 
, Work group has high standards 3.66 91 3.72 97 
Work facilitation 
Work group shows you how to improve 3.32 1.07 3.59 1.06 2:65% 
Work group helps plan ahead 3.25 1.14 3.28 1.26 
I Work group gives you new ideas 3.49 1.03 3.50 1.12 
nteraction facilitation 
Work group encourages teamwork 3.22 1.18 3.46 1.13 2.19* 
Work group emphasizes team goal 2.94 1.47 3.19 1.10 2.12* 
Work group exchanges opinions 3.19 1.06 3.41 1.08 2.12* 


Note.—The N is rou s 
Note, ghly 140 plannables 
and will not be reported. Two-tailed tests were 


> <0. 


x METHOD 
Subjects 
The data were collected as part of a longitudinal study 
of organizational change in a Southern oil refinery? The 
maintenance division of this refinery includes a depart- 
ment that differs from the others in that its employees, 
labeled “plannables,” constitute a manpower pool with 
no permanent work group structure. ‘The work done by 
this department may be in any part of the large plant, 
and is scheduled in advance for brief periods only. Men 
are assigned to jobs on an ad hoc basis, with frequent job 
changes according to refinery needs. They may spen 
only: a few days or as much as several months on any one 
Job assignment, A plannable does not work as a member 
of a stable work group; he works with whoever else hap- 
bens to be assigned with him toa particular job. Agiven 
plannable works under many different supervisors, de- 
bending on what job he is given. In contrast to the fluid 
Situation of the plannables, the assigned” wage-earners 
have permanently assigned posts in their departments, 
work in fairly stable groups, and have relatively endur- 
he relationships with their supervisors. Data collected 
ë om 151 plannables in one department and 582 assigne 
mployees in two departments were used in the present 


analysis, 
——— 


3 The larger study is a multicompany research pro- 
Sram undertaken by the Business and Industry Group 
x the Center for Research on the Utilization of Scienti- 
lic Knowledge, Institute for Social Research, University 
of Michigan. t 


and 525 assigneds. The 
used throughout. 


N's for particular measures vary only slightly from these, 


Measures 


Machine scorable paper-and-pencil questionnaires 
were administered on company time in the Spring of 
1968. The questionnaire included 112 5-point Likert 
scales. It measured the following classes of variables: 

Causal variables. These include independent variables 
that can be altered by management and that determine 
the course of developments within an organization and 
the results it achieves. The causal variables measured 
included leadership exercised by both supervisors and 
peers, company concern for employee welfare, general 
administration, and technology. The dimensions of 
leadership measured were support, goal emphasis, work 
facilitation, and interaction facilitation (Bowers & Sea- 
shore, 1967). 

Group-intervening variables. These reflect the quality 
of behavior within work groups. The items measuring 
these variables have been derived from a conceptual 
framework found useful in previous research and utilize 
the reference phrase “your work group.” Aspects of 
work group functioning measured included coordina- 
tion, decision making, communication, control, cohe- 
siveness, trust, and perceived work group effectiveness. 

Contextual variables. These differ from group-inter- 
vening variables in that they refer to behavior that ex- 
tends beyond the immediate work group and involves 
such aspects of organizational functioning as relation- 
ships between departments and events cutting across 
more than one hierarchical level within the organi: 
tion. These variables can be thought of as the accumu- 
lated effects of the ways in which other groups in the 
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organization function, and as a context for the behavior 
of the focal work group. 3 

End-result variables. These are dependent variables 
which reflect the achievements of the organization. 
Productivity and other performance data were not 
available for the present analysis, but questionnaire 
measures of employee satisfaction and mental health 
provide some indication of results being achieved by the 
organization in terms of psychological return to its 
members, 

Likert (1967) has elaborated the nature of causal, 
intervening, and end-result variables, and their inter- 
relationships across time. The contextual variables de- 
scribed above were categorized separately for the pres- 
ent analysis after examination of the data indicated that 
this would be useful. Thus, Likert’s “intervening vari- 
ables” have been divided into two categories labeled 
contextual and group-intervening. 


REsuLts 
Causal Variables 


Thirteen items measure the four supervisory 
leadership factors. The only two differences be- 
tween plannables and assigneds that are 
nificant indicate that assigned supery 
their men how to improve their perfo: 
a greater ‘extent (p < .01), 
supervisors hold group meeti 
(p < .01). Each employee wa: 


sig- 
isors show 
rmance to 
and plannable 
ngs more often 
s asked what he 
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thought his immediate supervisor needed in 
order to be a better manager; the only one of 
seven such items that yields a significant differ- 
ence indicates that the plannables more than 
the assigneds felt that their supervisors have a 
situation that lets them do what they already 
know how to do and want to do. None of the 
nine additional questions regarding adminis- 
tration, company technology, technical skill of 
supervisors and peers, or company concern for 
employee welfare yields a significant difference. 
Table 1 shows that the assigneds are higher 
on all 11 items measuring peer leadership and 
significantly higher on 5 of them. 
Contextual Variables 


Table 2 shows that the plannables score 
higher on 11 of the 12 measures of organiza- 
tional context. Furthermore, one measure of 
each contextual variable reveals a statistically 
significant difference, 

Figure 1 contrasts the distribution of control 
reported by the two groups. The plannables at- 
tribute significantly less influence to the first 
level of supervision, and significantly greate" 
influence to top management. 


TABLE 2 
MEAN SCORES FOR PLANNABLES AND ASSIGNEDS ON MEASURES or CONTEXTUAL VARIABLES = 
Work group structure 
Contextual variable Plannable Assigned t 
x SD x SD 
Coordination | 
Departments plan coordinate w 
ell 
Lanes department problems wel] on L TA mt 
ow department problems h. i p adt nog 
Communication P andled 3.90 1.14 3.51 131 31s" | 
You get information fr i l 
d om ot 
uperiors open to ideas net Units 278 1.08 2.50 113 | 2.59" 
Motivation 3.06 1.16 2.92 1:12 
Conditions encoura; , , 
O ge hard work 
Participation in decision S 3.34 1.03 3.10 1.16 2.19" 
Objectives set jointly ëy l 
Decision levels optimum 2.57 1.35 2.71 1:44 
Decision makers seek ideas 3:20 -96 3.05 1.06 
Decision makers get information a 1-07 2.45 1.10 
Lower level problems known 2.90 1.00 2.75 115 s 
Conflict handling 2.83 1.16 2.58 118 2.28" 
Disagreements worked through Š l m 
3.53 97 | 3.31 1.08 2.19 ' 
08 | = 
aS IA, TS 
Mtp <0 
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5.0 
E © PLANNABLES 


A GREA k 
To a N oY ASSIGNEDS 


DEAL 


Quite A 3.0 | 


BIT 


AMOUNT OF CONTROL 


2.5 


SOME 2.0 


1.0 


FIRST LEVEL EMPLOYEES 


MIDDLE 
SUPERVISORS 


T MANAGEMENT 


TOP 
MANAGEMEN 


LEVELS 


Fic. 1. Control curves for plannables and assigneds. (Significance levels of differences between p. 
nables and assigneds in amount of control attributed to different hierarchical level 
P < .05; first-level supervisors, $ < 025; differences at other levels are not significant.) 


HIERARCHICAL 


lan- 
s: top management, 


company, and pay, nor on two items measuring 


Gr , 
oup-Intervening and End-Result Variables 5 
Si ; mental health. 
E a measures of seven aspects of —, 
group functioning detect no differences 3 RA > 
Similarly, The plannables and assigneds do not differ in 
in supervisory leadership. 


etw : 
ie een plannables and assigneds. 
of aS es no significant differences on measures 
Satisfaction with job, supervisor, work group, 


any consistent way 1 


Hence, this causal variable could not have ex- 
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erted differential influence on other variables 
measured for the two groups. However, stabil- 
ity of group membership is a causal variable 
that can explain all of the differences obtained. 
Thus, the higher levels of assigned peer leader- 
ship would be expected, since assigned work 
group membership is sufficiently stable to en- 
able peer leadership to emerge. The plannables 
unstable group structure handicaps the de- 
velopment of peer leadership. The greater fre- 
quency of meetings held by plannable super- 
visors might be a response to this situation. 
Since the temporary and changing nature of 
plannable groupings does not allow for the de- 
velopment of a group tradition, or automatic 
performance of role behavior, frequent meet- 
ings would seem much less dispensable there 
than in the assigned situation where changes 
occur less frequently and past ways of doing 
things more readily transfer from one time 
period to the next. On the other hand, con- 
textual variables, being aspects of relations 
among work groups throughout the organiza- 
tion, would be strengthened by constant move- 
ment about the refinery and more frequent 
contact with members of othe 
and departments, 
plannables. The fluid 
ation can also explai 
less need for a freer 
visors, since 
subordinates, 
clear- 
the lower | 


ship, they do 
group-intervening variables, A. 
advantages accruing to the assi 
higher peer leadership are o 
vantages enjoyed by the pla; 
of a more favorable context. The net result is 
similarity between assigneds 


: and plannables 
m measures of work group functioning and 


Pparently, the 
gneds from their 
fiset by the ad- 
Nnables jn terms 


FINE 


end-result variables. This interpretation as- 
sumes that context and peer leadership operate 
in combination to influence group-intervening 
variables. The manner in which leadership and 
context interact is not clear from the data 
available. One could argue that the plannables, 
due to their more favorable context, have 
managed to score as high as the assigneds on 
group-intervening and end-result variables 
despite the lower level of plannable peer lega 
ership. This interpretation could be defende 
on grounds of a linkage mechanism. Likert 
calls individuals who are members of more im 
one work group “linking-pins.” Conceiving © 
nonsupervisory personnel as potential ea 
makes possible a great expansion of the tota 
amount of linkage within the organization. 
In an extreme case such as the plannable de- 
partment, every member could be a link-pi”- 
The higher context scores for plannables sug- 
gest the hypothesis that unstable work group 
structure induces lateral link-pin behavior an 
the part of nonsupervisory employees, €” 
hancing their total system perspective a 
raising their context scores, thereby offsetting 
the costs of the loss in peer leadership tba 
results from work group instability. Jes 
The higher context scores of the plannab = 
and the Similarity of the plannables and @ 
Signeds in measures of group-intervening a 
end-result variables are evidence that unsta? 
work group membership does not necessari y 
impair a depart nei 
temporary groupings the plannables have a 
dently bear the absence of M° 
ationships [Bennis, iza- 
ajor disruption of organiz% 
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EFFECTS OF JOB ENLARGEMENT ND JOB CHANGE ON 
CONTIGUOUS BUT NONMANIPULATED JOBS AS'A ~~ 
FUNCTION OF WORKER$',STATUS 4? ` 


I 


RONALD C. BISHOP? anp JAMES W. HILL è 


Southern Illinois University 


The jobs of low-status workers were either enl: 
nce of high-status co-workers whose jo 
found generally to be of no greater influence than job 


in the presei 
versa, Job enlargement was 
change without enl: 
cerned. Low 
responded unf 
directions were 
effect of job enlargement in an 
combination of factors than has 
population for field studies of worl 


of = Soons questions concerning the effects 
revi nlargement have been raised by recent 
ews of the literature on the subject (Hulin 
le 1968; Lawler, 1969). Two such 
Spean a ee the impetus for this study. 
Blood Er it has been suggested (Hulin & 
Eeee ie that a lack of controls in a 
results a job enlargement studies leave the 
Sar e Ri to alternative interpretations. For 
jobs pe ice (1953) reports a study involving 
Bean an Indian weaving mill. Initially, job 
tionall SERS were highly specific and func- 
sikune assigned. Enlargement involved as- 
Performis of jobs on an area basis with workers 
their ter more of the functions necessary in 
jobs ie e locations, instead of simply those 
hirere ved in a specific function as before. 
Were ve S i efficiency and decreases In damage 
were rt The implications of the study 
enlargem the changes in the direction of job 
ment pene encouraged small group develop- 
Itis hich resulted in increased motivation. 
1N this Possible, of course, that any job change 
results situation might have produced similar 
studies as occurred in the classic Hawthorne 
Mes (Roethlisberger & Dickson, 1939). A 


atus workers tended to be 
‘avorably when their jobs were not 
attributed to a double Hawthorne e 
organizational context involv 
been previously considered. The utility of the S 
ker behavior was discussed. 


argement so far as work 
favorably 


arged or changed without enlargement 
bs remained the same, and vice 


ers’ job satisfaction and tension were con- 


affected by job manipulation but 

manipulated. These opposing 
fiect. It was concluded that the 
es a more complex 


ation might be that the job method 
Rice (1953) recognized 
some of these problems but was unable to 
control or measure the relative effects. It is 
that field studies of this kind, for 
practical reasons, are unable to control many 
important variables, but this realization simply 
places an added burden on the applied psy- 
chologist to be constantly alert for situations 
where more controlled studies can be done 
within real work situations. 

Another aspect of job enlargement which 
has received little systematic attention con- 
cerns its influence on workers who have con- 
tiguous but unenlarged jobs. Job enlargement 
or change may have an overall positive effect 
on those receiving the manipulation, but it 
may turn out to be dysfunctional for workers 
observing but not receiving the treatment. At 
least two studies imply that these dysfunc- 
tional effects may be a function of status 
levels of those whose jobs are not directly 
affected (Alderfer, 1967). The importance of in- 
dividual status in a work group and its effect 
with respect to job satisfaction has been re- 
ported elsewhere (e.g., Homans, 1953; Roeth- 
lisberger & Dickson, 1939; and Vroom, 1964). 
Because of the tendencey of work group mem- 


third explan 
was more efficient. 


recognized 


iq 

Thi s , 
dissent Study was carried out in connection with a 

al : . . . 
epartment meme by the, sen author to i bers to differentiate among themselves on the 
* o i G y ois y . : m 
Yerity. ‘The authors chology, Southern Tllino® op- basis of status, this factor would be expected 
e authors gratefully acknowledge the help: : ee ` 
š to have an important mediating effect on the 


P eiia ans 
Micar co Stions of James G. Hunt and William M. 
Re 
Nishop Dae for reprints should be se 
veni epartment of Management, 
any tsity, Carbondale, Illinois 62901. 


Now 7 
W at Western Michigan University. 


nt to Ronald Cc. 
Southern Illinois 


`~ 


an 


results of any change which was introduced in 
an organization. The effect on the organization 
may in fact be to decrease rather than increase 
total worker performance and satisfaction. The 
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seriousness of such effects was pointed out 
in a study by Bavelas, reported by Strauss 
(1955) where the positive effects of job en- 
largement were nullified by increased Pe 
plaints of workers on unenlarged jobs. ne 
final result was that the job enlargement was 
cancelled, and within a month, six of eight 
girls who had been on the enlarged task quit. 
Strauss, referring to the Bavelas results, con- 
cludes that “The factory is a social system 
made up of mutually dependent parts of a 
system. It may be dangerous for management 
to try a new approach in one small part unless 
it is prepared to extend this approach to the 
whole organization [p. 96].” 

The purposes of the present study were (a) 
to compare the effects of job enlargement with 
those of a simple job change in a controlled 
field setting, and (b) to compare the effects of 
job enlargement and job change on contiguous 
but nonmanipulated jobs as a function of 
workers’ status in the work group. 

Job enlargement as it is used here follows 
the definition provided by Kilbridge (1960) : 
“The expansion of job content to include a 
wider variety of tasks and to increase the 
worker’s freedom of pace, responsibility for 
checking quality, and discretion for method 
[p. 357].” This definition is similar to what 
Lawler (1969) described as vertical job en- 
largement and what Herzberg (1968) termed 
vertical job loading. These definitions each 
emphasize the personal involvement of the 
worker in the successful completion of his 
task rather than simply giving him more tasks 
to complete. 

Job change is here defined as a change in 
job content which requires a different t 
but no increase in responsibility 
number of tasks performed. 

Dependent variables were job satisfaction 
as measured by the work subscale of the Job 
Descriptive Index (Smith, 1967), job perfor- 
mance as measured by quantity and quality 
of output, and tension as measured by the 
Anxiety Differential (Alexander & Husek, 
1962). Measures were taken before and after 
manipulation for workers whose jobs were 
altered as well as for workers whose jobs re- 
mained the same. A measure of tension was in- 
cluded among the dependent variables as a 
means of identifying a possible underlying 


ask 
or in the 
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correlate of changes in job satisfaction and pre- 
formance. In addition to these three variables, 
measures of individual status in the work group 
were taken sociometrically before and after ag 
job manipulation. Status was also m ida 
pendent variable in that the intraanen a 5 
job change or job enlargement was depender 
upon the workers’ status in the group. 


METHOD 
Research Setting 


This study was conducted in the Employment ee 
ing Center at Southern Illinois University, which lly 
sheltered workshop for the rehabilitation of mene 
and physically handicapped persons. In keeping with 
this purpose, clients work on jobs contracted a 
local industry. It is emphasized that these are ra 
jobs and that productivity and quality standards m ith 
be maintained. Clients work from 8:00-3:30 daily hi $ 
15-min. breaks at 10:00 a.t. and 2:00 p.st. and a tens 
hour lunch period. Some clients work under an iT but 
tive pay system and others on an hourly base by 
during the experiment all participants were Přiiness, 
the hour. Records are kept of attendance, h at o! 
and grooming, as well as quality and ga 
performance. The clients have regular supervis om reas: 
are expected to clean up and maintain their wor ament 
One major difference between this work CAVI a . 
and others is that the clients often partie are 
university training and research projects. ae a 
thus used to being questioned about themselves kind 
their attitudes and do not regard research of thi 
as being unusual. 


Subjects 


Forty-eight client-wo 
to one of eight work 
performed by works} 


signet 
rker Ss were randomly aio 
groups. The task was one ie arch 
hop clients for a local MANU" ilar 
and each S had Previously been assigned tO S erie! 
operation one or more times during his work eSP yom 
in this setting. Thi 
ranging in age from 
ing the workshop ranged from 2 mo. to 9 Y ‘pelo 
average experience about 2 yr, No S with an py 
75 was included in the group. Eight Ss ha Pen 
impairments, but none were considered serious ask 
to influence th 


. igne 
0 u elir performance on the assig" 
Twenty-six Ss were illiterate, 


Procedure 


Fach group worked 
mental task which co; 
bolts. Each § indepen, 
or bolt, but the whole 
The number of Corre 
of quantity of produci 


he 

a total of 5 days 00 i 
nsisted of sorting a 
dently sorted a A or ewe 
group sorted either C i me? ty 
ct items sorted was 5 it 

: tion, while those item’ ad 5 
Sorted or omitted Were recorded as errors * 
as the measure of quality of production. 


t si 


EFFECTS OF JoB ENLARGEMENT 


On the afternoon of the second work day, each S 
was interviewed privately and asked to rate the members 
of his work group including himself on the dimensions 
of ability, intelligence, and influence. The total rating 
on each of these factors by each member of the work 
oe constituted a worker’s status rating within the 
gh Se work group. On the morning of the third 
ad > aY an attitude and a tension measure were 
D ministered to each S: the work subscale of the Job 

escriptive Index (JDI; Smith, 1967) was used to 
measure job satisfaction, and the Anxiety Differential 
GD; Alexander & Husek, 1962) was employed as 
ie: measure of tension. Administration to Ss whose 
ieracy level made it necessary was conducted orally 
ol individually. Test-retest reliability (7-10 day 
tion sia as established ona sample from this popula- 
ee pete to the formation of the experimental groups 

as .95 for the JDI and .88 for the AD. 
coe through the third work day, either a job 
orbs or job enlargement was introduced to three 
their on of the work group selected according to 
ther as coug status (high or low) in the group while 
air maining three members continued on their 
riginally assigned tasks. The job change consisted 
merely of assigning a particular size nut to Ss who 
had been sorting bolts or vice versa. The job enlarge- 
raa the additional assignment of sorting a 
Dikalan nut or bolt, assembling the nut and bolt, and 
tine ng the assembly in a packing box with the admoni- 
shin! Be sure to get these right because they will be 

ipped out at the end of the day.” 
eae the enlarged task involved a more co 
forged eae methodological sequence ihan s unen- 
eei e ep 
factor tò l is Org er it. eterm 8 a sid spa 
made apply to this difference, a time study was 
has of both the enlarged and unenlarged versions of 
e “perimental task and a standard time computed 
differs, bemance on each. Deviations from this expected 
of eine: in standard times was used as the measure 
way ange in quantitative performance. Also, in this 
wee Tipes differences in performance due to method 

re controlled. 
o as well as job satisfaction and anxiety mea- 
EN ere readministered on the afternoon of the 

A and the morning of the fifth work days. 

i ii time during the procedure were Ss informed 

Was th ne experimental nature of the work groups, nor 

as Bei ere any indication that they regarded this jo 

bet ing different from other work they had done 
ore, 


mplex 


Desi gn 
workers 


Eig P : 
ight experimental groups composed of six wol 
assigned 


ca 7 
om n in the study, with four groups 
alend to enlargement condition and four groups 
ition Mi the job-change-without-enlargement coe 
enlar is Inder each job manipulation condition = 
hehe and job change), the jobs of the oa 
o! the T e members were manipulated while the jobs 
n eact hree low-status members remained the oa 
aa h of the remaining four work groups, the ui $ 
Status members’ jobs were manipulated while the 
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jobs of the three high-status members remained un- 
changed. This design made possible a test of the 
effects of job enlargement compared to job change 
without enlargement as a function of the workers’ 
relative status in their work group. This was possible 
not only for workers whose jobs were manipulated, but 
for contiguous nonmanipulated workers as well. 
Analyses were done on difference scores for each 
group. Differences between premanipulation and post- 
measures of quality and quantity of 
relative status, job satisfaction, and 
X 2X 2 fac- 


manipulation 
performance, 
tension were analyzed by means of a 2 
torial analysis of variance. 


RESULTS 
Performance 


While changes in quantity of output were 
not found to be significant under any of the 
experimental conditions, several significant 
changes in quality did occur. Table 1 shows a 
significant (p < -05) main affect for manipu- 
lated versus nonmanipulated workers. It can 
be seen in Table 2 that workers whose jobs 
were manipulated accounted for significantly 
more errors than their co-workers who did not 
undergo either job enlargement or job change. 
Two significant (p < .05) two-way interac- 
tions were obtained. The first was between 
the job enlargement and change conditions 
and whether or not the workers had their 
jobs manipulated. It can be seen in Table 2 
that workers whose jobs were enlarged suffered 
a decrement in quality of performance (t = 
3.76, df = 6, p < 01) while those whose jobs 
were merely changed without being enlarged 
showed less change. Since this difference could 
have been due to the greater complexity of 
the enlarged task, a correlation between error 
frequency and the levels of task complexity 
was computed (rbis = 37, P < .05). This 
would suggest that error frequency was at 
least a partial function of a greater Com- 
plexity of the enlarged task over the unen- 
larged task. Nonmanipulated workers, on the 
other hand, showed an improvement in quality 
of performance under enlargement and essen- 
tially no change in performance under job 


change. 


The apparently facilitative effect on quality 


of performance for nonmanipulated workers 
when job enlargement was introduced for 
their higher status co-workers was not sig- 
nificant. However, in all four experimental 
conditions, low-status workers were more re- 
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TABLE 1 


ANALYSIS OF VARIANCE OF CHANGES IN QUALITY: 
ERRORS IN PERFORMANCE 


Source 


Task (job enlargement-job 
change) (A) 

Level of involvement | 
(manipulated-non- | 
manipulated) (B) | 

Worker status level | 


| 1 |1773.50| 4.14 
| 


| 4463.17 | 10.41* 


pä 


i r | 1 |1844.73| 4.30 
A xB ie E [3577.50 | 8.34* 
AXC 1| 367.76) 86 
BXC | 1 [4069.55 | 9.49* 
AXBXC 1 | 2600.69 6.07* 
Error | 8 | 2875 | 
*p = .05. 


sponsive (see Table 2) than were high-status 
workers. Also, it should be noted that under 
three out of four conditions, responses of non- 
manipulated workers at one st 
in the opposite direction to th 
lated workers at the other st 

The analysis of varian 
scale of the JDI (see T 


significant main effects 
indicates th 


faction wit 


atus level were 
ose of manipu- 
atus level. 

ce of the work sub- 
able 3) yielded two 
(p < .05). The first 
at nonmanipulated workers? satis- 
h their work declined while m 

lated workers? satisfaction increased (except 
for one condition). The other shows that 
higher status workers displayed significantly 
greater declines in satisfaction with their work 


as measured by the JDI than their low-status 
co-workers, 


The third Significant effect (p < .05) for 
the analysis of variance of changes in JDI 
work subscale responses was a two-way inter- 
action between the job enlargement-job change 
Conditions and { ù 


and the manipulated-nonmanipu- 
lated conditions, It can be seen jn Table 4 


TABLE 2 


anipu- 


TABLE OF MEAN Di 
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TABLE 3 

ANALYSIS OF VARIANCE OF CHAN 
Scores: Jos Descripti 

Work SUBSCAL 


s IN SATISFACTION 
NDEX 


Source df | MS I 
Task (job enlargement-job -| 4.00 
change) (A) 6.25 


Level of involvement 
(manipulated-non- 


| 0.24" 

manipulated) (B) | 1 | 64.00 1 
Worker status level | E 76 

(high-low) (C) ipl 42.25 er 
AXB | 1 | 42.25 H 
A XE | 1) 9,00 ne 
BXC 1 | 30.25 | 484 
AXBXC 1l 4.00 i 
Error 8| 6.25 

1 
*p =.05. 


that workers whose jobs were changed showed 
an increase in satisfaction while those T 
jobs were enlarged showed a small a 
decrease. Workers in contiguous but a 
manipulated jobs showed a greater det 
in satisfaction under the job change condi 
than under the job enlargement condition. 
Changes in AD responses were foun : 
reflect less tension for those whose jobs ý 
manipulated while reflecting more tensto! (si 
workers whose jobs remained the same uble 
Table 5). This finding may reflect a $ in- 
Hawthorne effect (ie, facilitating aro of 
hibiting effects) due to the recognition ip- 
those who were singled out for the job M? 
ulation and the disappointment of those 
were excluded. + status 
The analysis of variance of changes in § cant 
ratings (see Table 6) yielded two signi kers 
(p < .001) main effects. Manipulated ye to 
experienced an increase in status comp?! 


ere 
for 


TABLE 4 
T 


N 
, ACTIO. 
Eca ABLE OF MEAN DIFFERENCES IN SATIS! 
FFERENCES IN GROUP A J A pa 
OF PERFORMANCE: FREQUENCY EA a og sean te a4 
© SUBSCALE 
| 
Job enlargement | Job change Job enlargement | Job ehate? A 
Stats Manipu- | Nonma- | Manipu- | Nona. Status Mads, Hass, pef- 
| lated | nipulated | lated nipulated Manipu- Nonma- | Manipu- nipt!? 
| | lated nipulated| lated 0 
High | 10.98 TOS | —480 | co High -45 | Ie 250 
Low | 9945 | =21,35 | 1340 | 3'5q Low 24 E 0 pe 
| e = Là | 0 | =$ 0. | 
a: 


| 


; 


. 


y 


— 
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TABLE 5 TABLE 6 
ANALysis OF VARIANCE OF CHANGES IN TENSION: ANALYSIS OF VARIANCE OF CHANGES 
ANXIETY DIFFERENTIAL SCORES Is STATUS IN GROUP 
Source df| MS F Source df | MS F 
a (job enlargement-job | Task (job enlargement-job 
A c: ange) (A) 1 | 225.00 | .68 change) (A) 1 ae elt! 
evel of involvement | Level of involvement 
(manipulated-non- fll (manipulated-non- 
manipulated) (B) 4 [2756.25 8.29* manipulated) (B) 1 | 606.39 | 84.92* 
W ne status level | | Worker status level 
i (high-low) (C) (high-low) (C) 1 | 319.52 | 44.75* 
A % 2 AXB 1| 19.14 | 2.68 
Tr | AXC 1 | 23.97 | 3.33 
nee S ; | BXC 1 1.89 .26 
ABRE | AXBXC 1| 189| 26 
tror 8 | 332.625 Error 8 7.14 
*b = 05, — _ *p = 001, 


Nonmanipulated workers whose status de- 
clined. When status changes occurred follow- 
Ing the introduction of the job manipulation, 
ower status workers were more likely to show 
PAS changes while higher status workers 
to aya This result can be largely attributed 
Seng pies a a technique employed which 
impro ; y rifol for higher status workers to 
for lo ve k heir positions in the group as well as 
wer status workers to decline in status. 

ae summary, results showed quality rather 
aig quantity of performance to be affected 
“ Job enlargement. This finding is similar to 
tose of Kilbridge (1960), Kuriloff (1966), and 
eg and Stewart (1963). However, in the 
found i ina, job enlargement generally was 
dent y OBE of no greater influence on depen- 
eran than the simple job change. It 
nonm. e pointed out too that contiguous but 
in th anipulated workers often showed changes 
Worle 9 opposite direction from those whose 
s was redesigned. In addition, group status 
work “te important factor with low-status 
than fr found to be more responsive to a 
Statu: high-status workers in most cases. THE 
mani level of the workers whose jobs wale 
chan Pulated wasa significant factor ea 
ae w quality, satisfaction with work, 

y, and rank in the work group. 


DISCUSSION 
hange and job 


The similarity between job cH f 
aria- 


enlar 
largement effects on most dependent va 
es in this study raises an important question : 


How often might the effects attributed to job 
enlargement in other studies be due instead 
simply to a job change? Perhaps merely 
selecting a group for a new task accounts for 
changes in satisfaction, tension, and status 
like those found in this study. This might be 
another example of the Hawthorne effect 
where special attention is seen as a source of 
recognition for those selected, and not being 
selected is cause for disappointment (negative 
Hawthorne effect). Of course, activation theory 
(Scott, 1966) would predict similar results. 
However, some difficulty would be encountered 
in explaining why a more complex change 
(i.e. job enlargement) did not have a greater 
effect than a simple job change. 

The results concerning manipulated versus 
contiguous but nonmanipulated work groups 
are a function of the status level of the workers. 
In general, however, nonmanipulated but con- 


tiguous workers showed decreases in status and 
satisfaction with work and increases in anxiety. 
The results support those of Strauss (1955) 
and Alderfer (1967) concerning the negative 
effects of changes in job design on contiguous 
but nonmanipulated groups of workers. One 


might speculate that if these results were 
widespread in the organization, the total 
effect could be dysfunctional. 

The data on quality seem confounded with 
task complexity and make interpretation ques- 
tionable. This is illustrated by the dramatic 
decrement in quality that occurred under the 
job enlargement condition and the fact that 
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the two levels of task complexity (job enlarge- 
ment and job change) were found to show a 
relationship to error frequency. 

Why ees workers generally ae 
show a decline in satisfaction 1s noe: can 
Perhaps they were only too well aware ii ; 
their own high status and were coppice by 
any job change in which they oa directly in- 
volved or merely watching. Both V TOOAL (1964), 
Homans (1953), and others have indicated 
that high-status workers are more satisfied 
with their jobs than low-status workers, and 
this might make them more resistant to any 
change than low-status workers. Conceivably, 
low-status workers may perceive a direct 
change as an opportunity to improve their 
position in the group. . 

Anxiety decreased as status increased for 
those whose jobs were manipulated and the 
opposite was true for those whose jobs were not 
manipulated. In addition, nonmanipulated 
workers displayed declines in satisfaction while 
manipulated workers showed an increase over- 
all. This suggests that tension and status may 
be moderators of the relationship between job 
satisfaction and job design. It further suggests 
that the so-called Hawthorne effect may be in 
fact a function of a perceived increase in status 
accompanied by a decrease in tension. These 

results add strong support to the findings of 
Vroom (1964) and Homans (1953) that status 
and satisfaction are positively related. In the 
present study, perceived increases in status 
were associated with increased satisfaction. 

Recognizing the interaction in the present 
study, it was found that low-status workers 
were more responsive on most dependent varia- 
bles to changes in their work environment than 
high-status workers, Fenchel, Monderer, and 

Hartley (1951) have reported a similar finding 

that individuals exhibited greater striving in 
groups where their status was low than in 
groups where their status was high. The re- 
sults appear to indicate that, at least under 
these conditions, status must be considered 
when the relationship between job design and 
job satisfaction is examined. 


The generalizability of experimental results 
from an § population such as the one used in 
the present study remains open to question. 
Certainly, this population is free of many of 
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the constraints surrounding more normal eer, 
groups, such as high production quotas I 
union restrictions. Further examination of 1 
usefulness for controlled studies of worker be- 
havior seems warranted. Industrial research ii 
a sheltered workshop environment can have 
several advantages. (a) The population 15 
worth considerable interest in its own right. 
(b) The situation can be controlled by * 
investigator more than is usual in tie: 
work situations. (c) It is a more realistic wor i 
environment than we find in experiment? 
studies using college students and artificia 
work tasks. Further, as the present results 
indicate, these workers seem to respond to 
many of the same kinds of factors that other 
work groups do. 2 

All of these results suggest that the affec i 
of job enlargement on organizational pet 
formance involve a much more complex oa 
bination of factors than has been previous A 
examined. While the number of replication” 
under each of the experimental conditions F 
the present study is a bare minimum, a 
results would, nevertheless, suggest that k ‘ 
factors examined are worthy of considerate 
in future investigations of job enlargement 
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IPLOY N IN THE 
7 M IMPACT OF EMPLOYEE PARTICIPATIO; 
LONG- TE DEVELOPMENT OF PAY INCENTIVE PLANS: 


A FIELD EXPERIMENT REVISITED ! 


KENNETH C. SCHEFLEN,* EDWARD E. LAWLER III,? axo J. RICHARD HACKMAN 


Yale University 


In an earlier study Lawler and Hackman examined the effects of worker participa- 
tion in the development of pay incentive plans. In the original study, three work 
groups developed their own incentive plans to reward high attendance, and identical 
plans were then imposed by company management in two other work groups. A 
significant increase in attendance was found during the first 16 wk. following imple- 
mentation of the plans only in the groups where the plans were participatively de- 

present study cover a 12-wk. period beginning 1 YT, 
after the original plans had been installed. After the data reported in the carlier 
study had been collected, the incentive plans were discontinued by company 


veloped. Data reported in the 


agement in two of the three 


company management. 


In a previous study, Lawler and Hackman 
(1969) examined the effects of employee par- 
ticipation in the development of pay incentive 
plans. The Ss were part-time unskilled workers 
who cleaned buildings in the evening. In the 
experiment, three autonomous work groups 
developed their own incentive plans to reward 
good job attendance (participative condition). 
These plans were then imposed by the company 
on two other work groups (imposed condition). 
There were two control groups: one group 
talked with the researchers about job attend- 
ance problems but received no treatment, and 
the other received no attention from the re- 
searchers. Attendance was monitored for 16 
wk. following the introduction of the new pay 
plans. A significant increase in attendance 
appeared only in the participative condition. 
Before the change, the average employee in the 
participative condition worked 88% of his 
Sagan hours; after the plan went into effect, 
scheduled hoes tees eet 98% oF i 

€ imposed condition, the 
comparable pretreatment figure was 83% and 

1 This article w. icats 
outgoing Editor, Dr, ioe p chblication ay thie 

* Now at the Base Cascade 


, 


participative groups. The present results show that 
attendance dropped below pretreatment levels in the: 


ance continued high in the third participative gr 
was found after 1 yr. in those groups where incentive 


man- 


se two groups, and that attend- 
oup. An increase in attendance 
plans had been imposed by 


the posttreatment figure also was 83%: 
changes were observed in the control pee 

It is possible that the increase in attendant 
in the participative condition was due solely o 
an initial burst of enthusiasm on the pei 
the workers in the participative groups a 
that the effects observed during the w 
postexperimental period were only tempore. 
Alternatively, it is possible that the impo ye 
plans were, in the long run, equally as effec y 
as the participative plans—but that it wor 
takes longer for an imposed plan to have e a G 
To examine these questions, follow-up aoe 
ance data were gathered from the orig! ay 
work groups approximately 1 yr, after the a 
plans were initially implemented. of 

In addition, the study examines the elech aI 
a naturally occurring reversal of the ge: 
experimental treatment, After the pay F ot 
had been in effect for 6 mo, in the case ° up? 
group and 11 mo. in the case of another arco 
company management unexpectedly a 
tinued the incentive plans in two of the 
participative groups. Workers in both 
groups had shown a significant inc in 
attendance when the new plans had peen nahe 
tuted. The present data show the effects Be 
attendance of the employees in these E ay 
when they reverted to the preexperimen™ 
arrangements. 
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58 59 60 61 62 63 64 


WEEKS SINCE BONUS STARTED 


Fic. 1. Attendance of one participat 
plan started. (Attendance is expresse 
scheduled to be worked that were a 


METHOD 


The reseg i 
a site was a small company that provides 
S were ack Bae services on a contract basis. The 
during fae = ime employees who cleaned buildings 
ranged in seta The Ss worked in groups which 
for doing esa 2 to 25 and which were responsible 
Work groups ald work in one building. Although the 
mous and there amila work, they were highly autono- 
employees in di was very little communication among 
The in different work groups. 
evels Pe i tended to have very low educational 
ranged in age f were members of minority groups. They 
Ployees fae ee 16 to over 70 yr. Although the em- 
Study are ve m hom data were collected in the present 
ry similar to those in the original study, 


about 6007 
60% of the original participants had left the com- 
his study had been 


Dany by i 
NR late data collection for tl d b 
all maintenance, high turnover rate is characteristic of 
higher in Ae employees in the company and was not 
this resear k experimental and control groups used r 
in greater Ne The research site and Ss are discusses 
Attends etail by Lawler and Hackman (1969). 
rom the aa data were collected for the present may 
Carlier chile p groups which had participated ne e 
Imposed on th wo of the groups had incentive P hos 
and the pla them by management in the earlier study, 
Were catia ans were still in effect when the present data 
an ee ee One group had participatively developed 
Still in Gn plan in the earlier study, and the plan was 
incenti ect. Two groups had participatively developed 
ive plans in the earlier study, but the plans 


ive group for a 12 
d in terms of the percentage of the hours 


ctually worked.) 


-wk. period 1 yr. after the 


subsequently had been discontinued by company 


management. 

A weekly percentage attendance figure was obtained 
for each group by dividing the number of hours 
scheduled to be worked each week into the number of 
hours actually worked. In addition, employees in three 
of the groups were given semistructured on-the-job 
interviews about their reactions to the incentive plans, 

interviewed regarding 


and company managers were 
their views about the way the plans had been introduced 


and how they were functioning. 


RESULTS 

ipative groups are pre- 
2, In the single partici- 
e pay plan was still in 
resent data collection, 


Results for the partic 
sented in Figures 1 and 
pative group in which th 
effect at the time of the p 
the average employee worked 93% of his 


scheduled hours (see Figure 1). This compares 
to 94% during the 16-wk. observation period 
immediately following the introduction of the 


lan, and 88% prior to the introduction of the 
plan. Thus, the plan appears still to be effective 


for this group. 

Data for the two participative groups for 
which the plans had been discontinued by 
management are shown in Figure 2. (Although 


the plans were discontinued at different times, 
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3 2 1 1 2 3 4 5 
BONUS IN EFFECT BONUS DISCONTINUED 
WEEKS 
Fic. 2. Mean attendance of two parti 
plans were ended and for the first 5 wk. 
pressed in terms of th 
ally worked.) 


cipative groups for the 9 wk. before the 
after they were ended. (Attendence is ex- 
€ percentage of hours scheduled to be worked that were actu- 


h 
5 » plans 
the data for the two groups have been com- groups was 92%; in the weeks after the Pi to | 
bined.) In the weeks immediately prior to dis- had been discontinued, attendance ous see is 
continuation, average attendance in these two an average of 82%. The 92% attendance 
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after the plans st 


hours to be work 


two imposed groups for a 1 


is expressed in terms of the 
y worked.) 


2-wk. period 1 yr. 
Percentage of the 
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comparable to the 94% average obtained for 
the participative groups in the original study. 
The drop from 92% to 82% is statistically sig- 
nificant (by median test, X? = 7.78, p < 01) 
Suggesting that discontinuation of the plan did 
decrease average employee attendance. 
e dogg the two imposed groups are pre- 
ented in Figure 3. During the 12-wk. obser- 
vation period, attendance in these groups 
averaged 87% compared to 83% for the 12- 
7 i period immediately preceding the instal- 
ation of the plan (by median test, x? = 6.00, 
Be oi Thus, attendance in the imposed 
sp ls somewhat higher 1 yr. after the 
fa W imposed than it was immediately 
owing their implementation. 


DISCUSSION 


It is significant that the attendance data 
ore still high in the participative groups 1 
a mr the plans were installed, despite 
alla that a majority of the workers in 
in ik groups had not themselves been involved 
a he development of the plans. It would 
ote that the norms for good attendance 
a seemed to have developed in the partici- 
S ae groups when the plans were introduced 
a effectively passed on to new workers as 
=a joined the work groups. In one group, for 
eea le, employees expressed feelings of own- 
of Ee of the plan despite the fact that most 

` te group members had not actually partici- 
a in the design of the original plan. These 

ata suggest that the initial success of the 
pave plans was not simply a function 
n Initial and temporary enthusiasm about 
ii opportunity to participate in the develop- 

Saa of a pay plan, and that groups seem to 
E norms that do not depend on & stable 

€mbership for their maintenance. 
i small increase in attendance observ 
Dose | groups in which the pay plans were = 
em i was not entirely unexpected. Although 
e oyees did not respond to these plans 

mediately after they were introduced, the 


rved 


C ‘ 4 saie i 

ey did administer the plan fairly and 

“nsistently throughout the year preceding the 
y the 


TN data collection. Experience with 
aN over this longer time period may finally 
ig convinced the workers that the plan was 
n the level, and that if they came to work 


kE 
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more regularly they would receive more money. 
Further, the workers in the imposed groups 
simply may have taken longer than their 
colleagues in the participative groups to learn 
the mechanics of how the plans worked. The 
net result, it appears, was that both the 
workers’ trust of management and their under- 
standing of the plan and its implications for the 
amount of pay they could earn increased 
slowly over time as did their attendance. It 
should be noted, however, that even after a 
year of slow improvement, the imposed groups 
had not reached the level of attendance 
attained and maintained by the participative 
groups. 

Probably the most compelling finding of the 
present study is the large and immediate drop 
in attendance when the pay plans were dis- 
continued in two of the three participative 
groups. The decision of company management 
to drop the two plans created a second manipu- 
lation in the study and, in effect, produced a 
natural field experiment. This second manipu- 
lation provides dramatic evidence for the 
effectiveness of the participative plans: drop- 
ping them reduced attendance just as substan- 
tially as installing them had increased it. 

The decision by company management to 
discontinue two of the plans came as a com- 
plete surprise to the researchers. Financial con- 
siderations probably did not play a role in the 
decision, since hourly rates were raised when 
the plan was discontinued. Indeed, discontinu- 
ance of the plans appeared to cost the company 
more money in increased hourly wages than it 
saved in unpaid bonuses. 

Further, the decision did not seem to hinge 
on misperceptions of company managers about 
the effectiveness of the plans. The researchers 
had shown the relevant managers the article 
written about the earlier study and most of 
these individuals accepted the fact that the 
icipative plans were working as intended. 
s with several managers responsible 
ans did suggest that 


part 
Interview: 
for administering the pl 
these managers felt little or no ownership of 
the plans or commitment to them. They had 
been told by top management to work with the 
researchers in instituting the experimental pro- 
gram, but they themselves had little to do with 
developing it. It is not surprising, therefore, 
that these managers felt little commitment to 
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the plans and that they were ready to discon- 
tinue them when the first opportunity pre- 
sented itself. 

Viewing the long-term impact of the experi- 
mental program on organizational effective- 
ness, it appears that the researchers’ failure to 
gain the commitment of the managers was a 
very significant one indeed. The two imposed 
pay plans persisted and even showed a small 
increase in effectiveness 1 yr. later. But two of 
the three plans in which employees participated 
to a significant degree were dropped. Thus, the 
net result is that the three participative plans 
were not as effective over the long term as were 
the imposed plans. The reason, it should be 
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emphasized, is not that employees respond 
better to imposed plans. The data suggest just 
the reverse. In summary, it appears that the 
present data support the view that long-lasting 
change can be introduced effectively in organi- 
zations only when the changes are accepted and 
owned by all levels—employees and managers 
alike—who are or will be involved in the new 
program. 
REFERENCE 
LAWLER, E. E., III, & Hackatan, J. R. Impact of em- 
ployee participation in the development of pay incen- 
tive plans: A field experiment. Journal of Applied 
Psychology, 1969, 53, 467-471. 


(Received December 22, 1969) 


j 


TG 


Journal of Applied 
vii P P: 
1971, Vol. 55, No. ACA 


TESTING PORTIONS OF 


THE PORTER AND LAWLER MODEL 


REGARDING THE MOTIVATIONAL ROLE OF PAY? 


JAY R. SCHUSTER, * BARBARA 


CLARK, axp MILES ROGERS 


System Development Corporation, Santa Monica, California 


Hypotheses regarding the motivational role of pay 
and Edward Lawler in their book, Manag 


tested. Two key hypotheses were confirme! 


performance factors influence pay, 
formance; (b) individuals who see pa: 
factorily. These hypotheses were con 
formance relationship is probably stro 
and Lawler in testing their 
our finding that effort functions 
model. 


Me poi Managerial Altitudes and Per- 
(1968) ies H man Porter and Edward Lawler 
Rade established a unique precedent for 
So ai performing psychological Te- 
indust pay. They have contributed to the 
E ea ie i capacity for designing 
total able reward system by testing, 1n a 
tal or a oe diverse private and governmen- 
upon te E 3 various hypotheses based 
reward ne theoretical model ofa performance- 
of E, pen This article describes our tests 
Motivati = hae hypotheses dealing with the 
oné eeivede f role of pay. They were made in 
of diver ate company rather than in a number 
Th se organizations. 

study rae was selected for the present 
Payor fae ae it seemed to be attempting to 
thee its employees on the basis of per- 
the larg nw ith the better performers receiving 
tive om increases (see Table 1). The objec- 
sis thre, test the Porter and Lawler hypothe- 
ment te h research conducted in an environ- 
exte at paid for performance to a greater 
nt than did the environments researched 


by Por 3 
a orter and Lawler. In regard to their en- 
—tonments, Porter and Lawler said: “for 

III of 
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the harder they 
y as a satisfier w 
firmed in an environment where the pay-per- 
nger than in the organizations used by Porter 


hypotheses. However, 
differently than in the Porter 


as set forth by Lyman Porter 
erial Attitudes and Performance, were 
: (a) the more employees believe that 
will work to improve their per- 
ill try to perform more satis- 


d 


this environment may account for 
and Lawler theoretical 


companies in our sample, pay is being dis- 
pensed at least as much for trying hard (i.e, 
effort) as for accomplishment (i.e., perform- 
ance) [p. 158].” 

We used Porter 
variables and the inte 
that they derived from t 
test the hypotheses (see 
ables, definitions, and interrel 
follows: 

Value of rew 
bility to the individual 
come of his behavior in the work situation. 

Perceived effort-reward probability: an indi- 
vidual’s expectations concerning the likelihood 
that given amounts of rewards depend upon 
given amounts of effort on his part. 

Effort: how hard the individual is trying. 

Abilities and traits: the relatively stable, 
long-term individual characteristics that repre- 
sent the individual’s currently developed power 


to perform. 


Role perceptions: t 
an individual feels he should expend 


in order to perform satisfactorily. 
Performance: the result of effort modified by 
individual abilities and traits and role per- 


ceptions. 
Rewards: 
trinsic outcomes resu 
performance. 
Perceived equitable rew 
of rewards the individua 


as a result of a given le 
feels s 


and Lawler’s terminology of 
rrelationships of variables 
he theoretical model to 

Figure 1). These vari- 
ationships are as 


„ard: attractiveness Or desira- 
of the potential out- 


he direction toward which 
his effort 


positively valued intrinsic or ex- 
Iting from satisfactory 


ards: level or amount 
] feels he should re- 
ceive vel of performance, 
or which an individual hould be attached 
to a particular job. 
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TABLE 1 


AcTUAL MONTHLY SALARY INcREASES GRANTED 
IN THE Survey Firm DURING CALENDAR 
YEAR 1969, By SUPERVISORY 
PERFORMANCE RATING 


% respondents receiving monthly 
i salary increases? 
Supervisory 
perform- fa 51 
ance rating _ $51- $101- | ne 
an = 
below | $100 | $159 | above 
High 4.0 35.1 47.0 13.9 
Middle 12.0 64.3 18.6 5.1 
Low 48.5 43.6 5.9 2.0 


a Performance ratings were determined by first having super- 
visors rank-order the employees reporting to them on the basis 
of performance or contribution to supervisory goals and ob- 
jectives. Supervisors then met to integrate the ranki 
upon cross-organizational employee contributions. 
assigned on the following basis: high, top 30% of fc 
population contacted (n = 202); middle, next 50 Yo of per- 
formers in population contacted (n = 393): low, bottom 20% 
of performers in population contacted (x = 101). 

bx? = 197.45, df = 6, p <.001, 


Satisfaction: the extent to which the rewards 
meet or exceed the perceived equitable reward 
level. 

Inlerrelationshi ps of variables, Testing their 
hypotheses regarding the nine variables in the 
model led Porter and Lawler to conclude that 
the more an individual values a reward and 
the more he sees his efforts as leading to this 


VALUE OF 
REWARD 


ABILITIES AND 
TRAITS 


PERCEIVED 
EFFORT— REWARD 
PROBABILITY 


ROLE 
PERCEPTIONS 


Fic. 1. The Porter and Lawl 
and E. E. Lawler II, Manager 


Richard D, Irwin, Incorporated.) 
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reward, the more effort he will expend. If Ha 
performer has accurate role percepnons, E 
expended effort will be cony erted into menni 
ful performance which will in turn obte d 
valued rewards for the individual. Performance 
also leads to satisfaction if it results in the 
amount of the valued rewards matching ae 
level of rewards considered by the me 
as equitable for his performance level. t A 
reward granted is less than the pare 
equitable reward expectation level, the P d 
former will be dissatisfied. Rewarding perforn 

ance commensurately reinforces perceptions n 
the probability that efforts are rewarded A 
are thus worthwhile, However, the value 0 A 
reward to the giver and a reward’s value as pin 
ceived by the recipient may differ. If there a 
difference, the probability in the recipien, Fs 
mind that his increased efforts result in ie 
creased rewards will lose credibility sgt ‘a 

recipient will not see an individually va sere 
reward-performance connection. When a get 
and recipient of a reward do view it he his 
Same way, the recipient’s satisfaction with ds. 
reward will affect the value of future EENE 
For example, such higher order needs nee 
teem, autonomy, etc., become more ee a 
the more an individual feels satisfied wae) er 
given reward level, The Porter and ee 
(1968) hypotheses retested in this study ‘ 


listed below, 


PERCEIVED 
EQUITABLE 
REWARDS 


INTRINSIC 
REWARDS 


er theoretical model (Re rinted f ) 
ial Altitudes and Perfo > t 


vmance, 1968. Copyrighted by 


] 
i 
| 
i 
\ 


“~~ 2 
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TABLE 2 
oa ae Survey SAMPLE BY PREDICTOR VARIABLE 
= Renae Questionnaires returned 
Predictor variable No, questionnaires 
distributed , 
No. % 
Base monthl 
y salary 
$1000 and below 245 208 84.8 
at to $1200 175 118 67.4 
1201 to $1500 250 166 66.4 
pea and above 130 78 60.0 
ties . . 800 570 m3 
3 classification | 
Faroe ie and systems analysts 414 305 73.7 
atten classifications 386 270 69.9 
Q Tanagerial, administrative, 
Sone ea: human factors, etc.) 
ate : 800 575 71.9 
a nance ratings as judged by supervisor 
Mas 237 178 75.1 
X iddle 400 291 72.8 
aa 163 105 64.4 
otal 800 574 71.8 


K i, . 
prove olhesis 4-A 3 “The higher the perceived 
bility that pay depends upon job per- 
will vind hacen the more effort an individual 
evote t Easa ae oat 
Cp. 627.” o performing his job effectively 
E r 
t S ypothesis 4-C. “The relationship between 
— aed probability that pay depends 
actu: pe performance factors and measures of 
for T performance and effort will be stronger 
Port: hose individuals who say their pay is im- 
Say p to them than it will be for those who 
aoe ee relatively unimportant to 
rs [p. 637” à 
ae ee 4-D. “The perceived probability 
highl bay depends upon effort will be more 
ance Y related to measured actual job perform- 
Prob; and effort than will be the perceived 
job ability that pay depends upon quality of 
Performance [p. 647.” 
h jetminology 
Y the terms base salary and base m 


onthly salary, b 


ceive r} 
from your position with the compan 


Predictor Variables 
b. grit is your current base monthly 
1 aoe your current job classification 
2 Gar Department Manager, Dep 
ther Supervisory 


3 
~ 7 Computer Programming 


Bae: 
Ypotheses numbers are the one: 


salary? $. 


artment M 


Naire 
» We mean your pay gx ompensation, 
z alone. The term compe”, n 
fon y—including base salary and fringe 


— 
? (Circle the numbe! 


s used by Porter and L 


Hypothesis 4-E. “The more an individual 
sees his pay as a satisfier, the more effort he 
will put forth to perform his job effectively 


[p. 66].” 


METHOD 


Subjects 
All 800 professional employees in a major division of 
the firm were mailed a survey questionnaire and asked 
to participate. The final sample consisted of 575 people, 
or 71.9% of those contacted. Table 2 shows this sample 
according to three predictor variables used in the 
hows that the sample is representative 


survey. It also show amp 
of the total population from which it was drawn. 


Questionnaire 

Some of the questions included were from the Porter 
and Lawler study, and some were of the present authors’ 
own design. An abstract of the questionnaire as appli- 


cable to this study appears below. 


of which are frequently used throughout the question- 
on the other hand, refers to the total income you re- 
benefits. 


oth 


r at the left of your classification). 


Tanager, Division Vice President 


awler (1968). 
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4 Engineering 
5 Human Factors 


6 Systems Analysis 
7 Other 


š N 
c. Performance rating as judged by supervision. 


3. Perceived Self-Ratings 


m ers in the 
. ; i ire i ine how you rate yourself relative to other: ch 
this portion of the questionnaire is to determine =e you a f ale whic 
peo be an abi A duties. You will be asked to rate yourself for characteristics on a seven-point sca 
comp: 
looks like this: 


(low) 1 2 3 4 5 6 7 (High) 


Please circle the number on the scale 
think you are low on the characteristic, 
than average as compared with others 
scale, circle only one number, 


that represents where you feel you 
» You would circle the numeral 1. If 
with similar duties, you w 


s you 
stand compared to others. = = 
you think that you are a P a 
ould circle the numeral 3, and so on. Foi 


a. Quality of your work on the job. j 
(ow) 1234567 (High) 
b. Your amount of productivity on the job. 
@ow) 12345 6 7 (High) 
c. Amount of effort you expend on the job, 
(ow) 1234567 (High) 


4. Perceived Pay Determiners 


= tails 
š - k an individus! 
Ics or qualities often used to determine the amount of an indi 

you feel thi 


A A Ean presen 
S company considers these for determining your p Á 
using the seven-point scale below each characteristic. bers 
a + ee A y num 
Circle the number on the scale that represents the importance of the characteristic being rated. Low nv 
represent less important or unimportant characteristics, If 


cter- 
you think this com ink 
rmining the base s 

o on. 
a. Quality of your work on the job. 


pany considers a given gan hin 
alary for your Position, you would circle numeral 1, If you 
important you would circle numeral 2, and si 

(unimportant) 12345 6 7 
b. Your amount of productivity on the job. 

(unimportant) 1 2 34567 (important) 


c. Amount of effort you expend on the job, 
(unimportant) 1 2 


4 5 


Strongly Agree Undecided Disagree Strongly 
Agree 


34567 (important) g 
. . . ivin 
5. In this section are statements designed to measure your attitude about the base monthly salary you are Ta 
for your Present job, Please indicate your agreement or disagreement on the scale below each statement. 
a. Perceived Importance of Pay 
My base sala: 
1 
| 

i Disagree à 
Perception of Pay as a Satisfier j 
(1) Raises have me 


ry level is important to me. 
2 
b. 
ant that I w 
1 2 3 


as Progressing in my work, 
4 5 


Strongly Agree Undecided Disagree Strongly 
Agree Disagree 
(2) The effort I expend determines the le 

1 2 4 


vel of my performance, 
5 


a 
Strongly Agree Undecided Disagree Strongly 
Agree Y 


Disagree 


, tioni pmf i 
; apts distribution of the questie fo% f 
rms a way that ratings could be associated wi shia, j 
were distributed, but not after they w 


" 
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3 i ii A 
(3) Raises I have received were rewards for good performance. 


1 2 3 4 5 


“ronigly Agree Undecided Disagree Strongly 
Agree Disagree 


(4) a my job, raises are a form of recognition for a job well done. 


2 3 4 5 


p Agree Undecided Disagree Strongly 
HETER Disagree 


Allitude Measures 


cet 
ae ore indicated ona 7-point scale (No. 1 on 
Portant”) thee unimportant” and No, 7 being “im- 
tors in det af perceived importance of each of nine fac- 
Measures Se ihe pay. Three factors were used as 
Although ee the importance of job performance. 
Lawler theg 3s high as those found by Porter and 
high and th intercorrelations among these items were 
ypotheses e questions were thus usable in testing the 
attributabl (see Table 3). Some differences may be 
included o i a the fact that while Porter and Lawler 
study aa ly managerial personnel in their survey, our 
Suggest = Lies ad nonmanagers. Still, the findings 
tions used na erable homogeneity among factors. Ques- 
Sure the q Bs Porter and Lawler were included to mea- 
-point ee to which pay was seen as a satisfier. A 
agree” aN was employed, No. 1 being “strongly 
among fee 5 ‘strongly disagree.” Intercorrelations 
item ho items were high, indicating considerable 
mogeneity (see Table 4). 


Hypotheses 


Portes performance factors play a major role in the 
tional role of awler hypotheses regarding the motiva- 
€ predicto pay, hypotheses were tested by computing 
as a distribu variable of supervisory performance rating 
e aene against the remaining variables from 
Square sj nnaire. The technique, along with the chi- 
i ignificance test, was used to determine both 


than would have been possible through Student’s ¢ 
scores as used by Porter and Lawler in testing their 


hypotheses. 
RESULTS 

Hypothesis 4-A 

Table 5 shows that the perceived probability 
of pay depending on two of the three job per- 
formance factors—work quality and produc- 
tivity—becomes greater at higher performance 
levels. A similar trend was obtained for effort, 
but it was not as strong and was not statisti- 
cally significant. Even though effort is not seen 
as leading directly to pay, Table 5 shows that 
the higher performers see themselves as ex- 
pending more effort than lower performers. In 
addition, Table 6 shows that nearly 70% of the 
population felt that effort determines perform- 
ance. (The slight difference that occurred on 
the basis of supervisory performance was not 
statistically significant.) Hypothesis 4-A is 
thus supported : As the perceived probability 
of pay depending on performance becomes 
greater, the perceived amount of effort ex- 
pended to perform effectively also becomes 


lrecti r 
copie strength of responses. It provided an 
inity to more closely study response patterns greater. 
TABLE 3 


INTERCORRELATIONS AMONG ITEMS DESI 


GNED TO MEASURE PERCEPTIONS OF THE 


IMPORTANCE OF JOB PERFORMANCE FACTORS IN DETERMINING Pay 
Productivity Effort expended 
Porter/Lawler Porter/Lawler 
Item A 
urvey i 
ouiyey Private firm eee 
Government industry Government i y 
sample sample sample sample 
pete 7 47 .66 .50 
of ara 80 a ‘ 
fectivity aa ‘65 ‘67 ‘61 
fiort expended 
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TABLE 4 
INTERCORRELATIONS AMONG ITEMS DESIGNE 


D TO MEASURE THE DEGREE TO 1 
Watch Pay Is SEEN as A SATISFIER 
Item 2 Í Item 3 
| Porter/Lawler | | Porter/Lawler 
Item | i 
Survey | | Survey | Private 
firm” | Govern- | Private firm | Govern- | P sty 
ment | industry ment ‘neu 
| sample | sample sample | sam 
| patent | 
1. Raises have meant that I was Progressing in my work, -62 -64 61 50 Ao i . 
2. Raises I have received were rewards for good performance, -64 -ól m 
3. In my job, raises are a form of recognition for a job well done, , 
TABLE 5 Hypothesis 4-C 
PERCENTAGE OF RESPONDENTS WHo RATED PER- P oe am- 
: a es , ne Sé 
FORMANCE Factors as Pay DETERMINERS To test the hy pothesis, va divided ti rtant 
VERSUS PERCENTAGE OF RESPONDENTS ple into those who rated their pay as impo nt 
Wuo RATED THEMSELVES on Errorr and those who rated their pay as sin cone 
Exrenpep, By SupErvisony Within these two groups, we then compar 
PERFORMANCE RATING Sie z : ions 
pay and performance-effort percepti l 
Supervisory Supervisory performance ratings. Tab : a 
performance | (1 & 2) (3, 4, 5) (6 & 7) shows that for those rating pay as ere, 
. . 4 ae a 1 
rating the higher the supervisory rating the med 4 
r the perceived level of performance and el b 
Work quality! F a A eant, W 
or those rating pay as unimportant, the 
Highs 39 32.6 63.5 higher the supervisory rating the en 
Middle 3.8 46.2 50.0 Perceived level of performance and €l ally 
zd 5 atieti Cay 
Low 5.8 47.6 46.6 How ever, the results are not only statist! p 
Insignifica n i ay as 
Brodien ; g ae nt for the group rating pé ? necte 
mportant, but the large number of exp that | 
High 4.5 38.8 56.7 cell frequencies of less than five indicates chi- 
ue a 52.4 43.1 Caution should be exercised in interpreting 
f 54.4 41.7 Square for this Population (see Table 8). ithe! 
Vi A e 
Effort With our data, we were unable to h we 
accept or reject Hypothesis 4-C, Althoug 
Highs 11.8 55 
h E 5.6 32.6 
Midales 12.5 58.8 28.7 TABLE 6 id 
e . 6 -r TE 
OW 14.4 57.7 27.9 PERCENTAGE op Resronpexrs Wro FeeL T 
Self-rati Errorr EXPENDED Determines PER- 
ell-ratings of effort expendedi FORMANCE, BY SUPERVISORY 
A PERFOR AN vi 
Higi ð MANCE RATING 
. 29.9 70.1 
Middle? 1.0 48.3 7 
Lowa 10 4 50.7 Effort expended ae 
. 51.9 47.1 Supervisory determines performan 
Note.—Scores in Parentheses performance 
rating scale used į aa i tomitmhersomtha 7001 ing" ee 
classified “mimon, ne forthe piire, Scores oft and 2 are Paling Disagree ae 2 
and “low" for f-rating ar three Performance factors, (4&5 (3) a 
aS portant” for thence tinea” Scores of ») 
aN" for the “self-rating aSk three performance. . 69.7 
ort expended,” High 16.9 13.5 68.0 
Middle 18.9 13.1 62.9 
Low 20.0 17.1 


e 5 
y, i the 
Note.—Scores in Parent} bers 0n 
À z hes umb 
b >85. rating scale used in the duester et ton 
4, b <.001. ay? = a naire. ai 
EK thee te? > 75, resp? 
Ns for the three Tatings were 178, 201, and 105 


PETATA > Fy 


m t 
PORTER AND LAWLER MODEL 193 
é TABLE 7 
eee RESPONDENTS Wuo RATED THEIR PERFORMANCE AND EFFORT VERSUS 
UPERVISORY PERFORMANCE RATINGS, By IMPORTANCE OF Pay 
Supervisory | Pay is important? Pay is unimportant® 
performance ‘is ma ] 
rating® Low Hi 
3, 4, 5) igh Low 5 High 
9 (3, 4, a s 
| aaa (6&7) (1 &2) 643 6&7) 
Work quality 
Higl 
Middle 0 20.2 79.8 0 50.0 50.0 
sane 0 30.4 69.6 0 43.3 56.7 
0 52.6 47.4 0 28.6 71.4 
a Productivity* 
Hi —_ 
Mie 0 27.7 72.3 0 75.0 25.0 
Dot e 0 42.3 57.7 3.4 63.3 33.3 
E 55.7 43.3 0 SRE 42.9 
es Effort’ 
Ha | x =] 
| ie 0 28.9 711 0 75.0 25.0 
mi mas R 0.8 46.5 52.7 3.4 63.3 33.3 
_——_—— 1.0 53.6 45.4 0 28.6 71.4 
Note, — EEE ii 
portant, Pa s important = those who indicated on the questionnaire that they ree” or “strongly agree" that pay is im- 
Sagree" th mportant = those who indicated on the questionnaire that they are "undecided," “disagree,” or "strongly 
Toras Ns fi important. Scores in parentheses refer to numbers on the 7-point rating scale used in the questionnaire. 
mportant” = 173, 260, and 97 for high, middle, and low, respectively ; for "Pay is unimportant,” 4, 30, and 


Dan 
„pay is important 
ay is important"; 


foun d 


Add rat strong relationship between perceived 


aie joy ae among those who said 

US to detern nportant, our data did not enable 

those who ae a trend or lack of trend for 
rated their pay as unimportant. 


Ly pothesis 4D 


ny “ ? shows that attitudes toward the per- 
a mine that pay depends on work 
job "has aze more closely to measured actual 
at s than the perceived probability 
tenden spends on effort. There was more of 
Quality on for people to see a pay-work 
came hehe ee as performance level be- 
apparent “tiny This tendency was much less 
he small ve the pay-effort relationship, ame 
Cally Sis rend that did occur was not statisti- 
Port nme We were thus unable to sup- 
Ypothesis 4-D with our data. 


nimportant" : x? = 75, =4 
nimportant" : x? = 4.08, df =4, 


Hypothesis 4-E 

We were able to support the hypothesis with 
one of three items designed to measure pay as 
a satisfier (p = -09). Table 10 shows that the 
more pay is seen as a satisfier, the more the 
amount of effort that is perceived as being 
expended. 

DISCUSSION 


The results of our tests of the Porter and 
Lawler hypotheses regarding the motivational 
role of pay are summarized below : 
supported by our research, 
inconclusive results, 
supported by 


Hypothesis 4-A : 
Hypothesis 4-C: 
Hypothesis 4-D : not 
research, 

Hypothesis 4-E: 


our 


supported by our research, 


We found that people will expend more effort 
the more they see pay as a satisher and that 
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TABLE 8 


EXPECTED CELL FREQUENCIES OBTAINED IN CosreuT- 
ING CHI-SQUARE—SUPERVISORY PERFORMANCE 
RATING VERSUS PERCEIVED PERFORMANCE 
LEVEL—RESPONDENTS WHO RATED 
Pay As UNIMPORTANT 


Expected cell frequencies 


Supervisory 
performance m m High 
: Low g 
rating REA (3, 4, 5) (6&7) 
Work quality 
] 
High — 1.66 | 2.34 
Middle — 12.44 | 17.56 
Low = 2.90 | 4.10 
Productivity 
High .10 2.54 1.37 
Middle 73 19.02 10.24 
Low 17 4.44 2.39 
Effort 
High -10 2.34 1.56 
Middle J3 17.56 11.70 
Low AT 410 | 2.73 
\ 
Note.—Scores in parentheses refer to numbers on the 7-point 
rating scale used in the questionnaire, 
a The abse: 


nce of data under th 
indicates that no cell frequenci 
these cells did not contribute to 


his column in “work quality” 


Therefore, 
are, 


es were observed, 
the value of chi-squ: 
performance (work quality 
is seen as a more important pay determiner 
than effort. We also found that nearly 70% of 
our survey participants felt that 


and Productivity) 


el. 
par at the amount indeed it could be dropped from the mo 
of effort expended is important in determining However, at this point, we still feel that it! 
) 
TABLE 9 


PERCENTAGE oF REsponpENts W 


HO RATED 
EFFORT ag P 
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THE PERCEIVED 


l 


their level of performance. It thus appears E 
although we confirmed two key hypotheses, p 
function of effort in our survey population 5 
somewhat different than in the theoretical 
model. In their research, Porter and Lawler 
found that the more an individual sees = 
efforts as leading to the attainment of a Ba 
(pay), the more effort he will expend. Beie 
sees his effort as leading to the desired rewal “4 
and then proceeds to exert effort which, if a i 
perceptions are accurate, will result in me 
ingful performance that will in turn lead to t 
desired reward of pay. her 
For our survey population, on the ot ` i 
hand, our results led us to conclude ha! 
effort is important primarily because it ji- 
believed to result in performance. The ret 
vidual first sees that his performance will eo 
him to the desired reward. Since he feels a 
effort expended leads to performance, he W a 
then exert effort which, if role perceptions ri 
accurate, will result in the performance jas 
will attain for him the desired reward of P 
Porter and Lawler suggested that in a ye is 
lation, such as the one we used, where enor le 
seen to lead to performance, it may be a 
to drop effort from the theoretical model. A 
said: “Unfortunately we did not directly 
the managers to estimate the probability 
effort on their part would result in good 


t 
x jes thê 
formance. If it were found in future studies hen 
this Probability was 


t 
always close to 1.0, 


D 
IMPORTANCE or Worx Quauity AN 
AY DETERMINERS, By Supervisoy PERFORMANCE RATING 
Jork ityb 
Supervisory Work quality Efforte r 
performance ii rant 
rating® important 8,45 Important | Uni Impor" 
5 5 nimportant ) 
(1 & 2) r59) (6 & 7) iga | G45 (is 
High 3.9 32.6 635 ~ 32.6 
Middle 3.8 16.2 sv oe 55.6 28.1 | 
y 5. | 4 z i 58.8 27. 
Low | 5.8 | 47.6 | 46.6 | 144 aa 4 f e 
— ag ee | - ely 
eses refer to numbers on the 7-poj = TA cn ————— nad vé 
Sunder high, middle, anq low, Tespecti sE for "work me : and Pan es and 108. respe“ 
bx? = 11.39, df = 4, p = 02, biad 
Pehl, df —4, p ES. 


a ae 
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TABLE 10 


p. i R 
PERCENTAGE OF RESPONDENTS WHO RATED PAY AS A SATISFIER VERSUS 
SELF-RATINGS OF EFFORT EXPENDED 


Measures of pay as a satisfier 


Self-ratings 1. Raises have meant that 2. Raises I have received 3. In my job raises are a 
of effort I was progressing in were rewards for good form of recognition for 
expended* my work.» performance.® a job well done.? 
Disagree | Agree | Disagree Agi Di A 
MS 3 ag 5 Agree isagree 2 gree 
aes | © |aen|@ey| © | aan] ees © jazz 
High | 
S 7 7) 15.7 15.7 68.5 14.9 19.4 65.7 19.2 | 20.5 | 603 
tage 5) 19.3 15.7 65.1 16.9 22.0 60.2 22.9 26.5 50.6 
(1 & 2) 29.2 29.2 41.7 25.0 29.2 45.8 33.3 25.0 41.7 
“pay asa 


be relevant i 


-point rating scale for “self-rating” and 5-point scale for 


es refer to numbers on the 7 
2; and 468, 83, and 24 for Item 3. 


463, 83, and 24 for Item 


p> 
p> As 


PERCEIVED 
EQUITABLE 
Pd REWAROS 


INTRINSIC 
REWARDS 


SATISFACTION 


PERCEIVED 
PERFORMANC 
REWARD PROB. (A) 


ROLE 
PERCEPTIONS 


NAL MODEL. 


vised. (Reprinted from L. W. 
nd Performance, 1968. Copy- 


REWARD PROBABILITY IN THE ORIGI! 


(A) PERCEIVED EFFORT-! 
Fic. 2. The Porter and Lawler theoretical model, re 

Porter and E. E. Lawler II, Managerial Altitudes a 

righted by Richard D. Irwin, Incorporated. 

n many situations [P 88].” We applies to our survey’ population is shown in 

F Figure 2. 


agri : d 
Bree with Porter and Lawler that effort 1S 
REFERENCE 


releva ‘ 

that = Aan our research does not in A 
sho 5 el. 

nstead uld be dropped from the mo! porrer, L. W., & LAWL 

ent inc as stated above, its funct 
our population than it is i 


retical mode 


dicate 

å +. differ- gR, E. E., II. Managerial atti- 
ion i ar = tudes and performance. Homewood, Ill.: Irwin, 1968. 
n the neo- , 

l The function of effort as it (Received December 22, 1969) 
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CONTINGENCY MODEL OF LEADERSHIP EFFECTIVENESS: 
: SOME EXPERIMENTAL RESULTS! 


GEORGE GRAEN? JAMES BURDEANE ORRIS, ano KENNETH M. ALVARES 
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Two parallel experimental studies were performed to replicate and further ceed 
the contingency model of leadership effectiveness. The two studies yielded nel t 
that were inconsistent with those predicted by the contingency model. These data 
along with other evidential studies cast doubt on the plausibility of the contingency 


model of leadership effectiveness. 


The contingency model of leadership effec- 
tiveness is a theory of leadership effectiveness 
proposed by Fiedler (1967) in his recent book 
that summarizes 15 yr. of research on the con- 
tribution of leadership style to group per- 
formance. This model assumes that the leader’s 
contribution to group performance depends on 
both the style of leadership (task oriented 
versus person oriented) and the favorableness 
of the situation for the leader. The empirical 
base from which this model was inferred in- 
cludes a rather broad sampling of actors, be- 
haviors, and contexts, ranging from infantry 
squads to high school basketball teams (see 
Fiedler, 1967). 

Table 1 presents an eight-cell classification 
of leadership situations. This classification was 
produced by dichotomizing on three aspects of 
the situation: group atmosphere, task struc- 
ture, and position power. The most important 

aspect of the situation is assumed to be group 
atmosphere, which is the leader’s evaluation of 
his members’ reactions toward him and his 
reaction toward the role of leader (Fiedler, 
1963). The second aspect, task structure, is the 
extent the leader is able to control the group’s 
activities based on the extent to which the 
task is programmable and the number of possi- 
ble solutions. The least important aspect, 
position power, is the amount of potential in- 
fluence given to the leader by virtue of his role 
occupancy (Fiedler, 1967). The resulting eight 
octants are hypothesized to represent a dimen- 
sion of decreasing situational favorableness, 

The leadership style dimension varies con- 
ceptually from a relationship or person orien- 

1 This article has been accepted for publication by 
the outgoing Editor, Dr. Kenneth E. Clark. 

? Requests for reprints should be sent 
Graen, Department of Psychology, 
Illinois, Urbana, Illinois 61801. 


to George 
University of 


tation to a task orientation. This dimension of 
leadership style has been operationalized by Ae 
instrument called the “esteem for the Leas 
Preferred Co-worker” (LPC). This instrument 
asks the respondent to think of the person wit? 
whom he can work least well and to rate a 
person on a number of bipolar adjective sc? me 
such as friendly-unfriendly, helpful-frustraln’y 
supportive-hostile, and quarrelsome-harmon e 
ous. The lower the leader’s esteem for his Lens, 
Preferred Co-worker, the more task orient" 
he is assumed to be, and the greater the L 
score, the greater the relationship orientas 
The dependent variable of the continga y 
model is the relationship between the ea 
style and his group’s performance. The pa k 
measure of covariation is the Spearman e 
order correlation, rho (McNemar, 1962). 
Twelve studies reported by Fiedler „ship 
yielded 63 relationships between leaders 
style and group performance that were yss ie 
inductively derive the predictions of the “ihe 
tingency model by a procedure of allowing ee 
median correlation for each octant salue 
Table 1) to represent the predicted «tions 
(Fiedler, 1967, p. 169), The point predict ond 
reveal that the more task-oriented leaders ip 
to be associated with more productive a tive 
in Octants 1, 2, 3, 8 and with less ProlUe ine 
groups in Octants 4 and 5, The opposit® ; 
case for the more person-oriented leaders: pol 
Point predictions from the model art ance 
very interpretable, however, since the va nou! 
of correlations within octants is often ¥? ppis 
an apparent central clustering of values | tbe 
variance within octants might be due 5 
fact that a wide variety of leadership al: i 
tions were used in developing the me h 
order to examine this hypothesis m g 08 
variance within octants is due to f° 


F 


specified j 
SE the model, two closely parallel 
follwwed a s peniaenits were designed which 
tions of os tigorously as possible the prescrip- 
in its Sorc model and the methdology used 
of the pi ies In a sense, this is a study 
two te Seren of the model in that the 
these sony should yield similar results. Also, 
igen ee should replicate the con- 
for i Na yield further data points 
model, ial support or nonsupport of the 
ca ite: toe mtr, day 
using the a n predicted curve generated by 
a rather a ts of the 12 antecedent studies, 
sis is also on ani less easily testable hypothe- 
tingency Ar by Fiedler from the con- 
states that eg This latter hypothesis merely 
be associat > s joa task-oriented leader will 
very ENOR with more productive groups in 
for the oe a and very unfavorable situations 
leader silkes and the relationship-oriented 
groups in i, associated with more productive 
favorability ig that are intermediate in 
169), y for the leader (Fiedler, 1967, P- 
M 
ae tens data of the two experiments 
oriented ee in a more performance- 
Configural pee Instead of looking at the 
cients, gro patterning of correlation | coef- 
as the de up ertaroan will be designated 
e musts ten ent variable and the factors of 
and | Meade co atmosphere, position power, 
Pendent yz style) will be treated as inde- 
determined . iables. In this way, it can be 
eon not only if the results replicate 
o; Citations: model in terms of the pattern 
actors of se but, more importantly, if the 
omance į e model significantly affect per- 
Finally n terms of mean differences. p 
ally, an attempt will be made to clarify 
bove As mentioned 
AT R with a low LPC score 1S 
nd a lea aan ina task-oriente 
o act A er with a high LPC score 15 assume 
amine this 1 person-oriented manner. To ex- 
of the ai hypothesis, a bipolar adjective scale 
eloped t La form as the LPC scale was de- 
Nat sho 7 SSS: the behaviors and attitudes 
aders m be exhibited by high and low LI £ 
te e amg this scale, the extent to which 
task ceri members perceive the leader as 
riented or person oriented can be deter- 


sh PD 


a 
fe] 
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TABLE 1 


MEDIAN CORRELATIONS FOR THE DEVELOPMENT 
STUDIES OF THE CONTINGENCY MODEL 
OF LEADERSHIP 


Situation classification EA 

| leader- 
be aa ship | Man. | N 

Group | Task | Po- style | rho 
atmo- | struc- | sition (orien- 
sphere | ture | power tation) 
| 

1 | Good | High | Strong | Task -52| 8 
2 Good | High | Weak | Task —.58| 3 
3 | Good | Low Strong | Task —.33 | 12 
4 Good | Low | Weak | People A7 | 10 
5 | Poor | High | Strong People 42| 6 
o Poor | High | Weak ar 0 
7 Poor | Low | Strong | People 05 | 12 
8 Poor | Low | Weak | Task -48| 12 


is the median correla- 
(LPC) measure of 
the value the 
ncluded in the 


Note.— 
tion between th 
leadership s 
model predic 
calculation of the median 


st Preferred C 
d group performance. Th 
is the number of relationship: 
correlation. 


In terms of convergent validity, the 
correlation between the results of this instru- 
ment with those of the leader’s LPC scale will 
tend to support or question the LPC score 


(Cronbach & Meehl, 1955). 


mined. 


METHOD 


for both studies were undergraduate 
students at the University of Illinois who were paid 
§1.35 for their participation. The first experiment 
(Exp. 1) used 78 Ss, and the second experiment (Exp. I1) 


used 96 Ss. 
Task. Fiedler operation 


Subjects. The Ss 


ally defines the degree of 
task structure by four dimensions developed by Shaw 
(Fiedler, 1963; Shaw, 1962). These four dimensions 
are (a) goal clarity, (b) decision verifiability, (c) solu- 
tion specificity, and (d) goal path specificity. Each of 
these four dimensions is rated on an 8-point scale. 
Moreover, Fiedler dichotomizes task structure by 
setting a cutting point of 5,0 on the mean rating over 


all four scales. 

Experiment 
and Task No. 
(1962) taxonomy 
ratings derived from the judgm 
(1962) for Task No. 23 is 5.86, and that for Task No. 
64 is 3.69. Task No. 23 (high structure) is an allocation 
lem involving the optimum assignment of five 
five machines, based upon the performance 
each machine. Task No. 64 (low 
n relations problem requiring the 
alternative recommendation from 


high task structure) 


64 (low task structure) from Shaw’s 
of experimental tasks. The mean 
nents reported in Shaw 


Lused Task No. 23 ( 


rob! 
operators to 
of the operator on 
structure) is a huma: 
selection of the best 
a list of five alternatives. 
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TABLE 2 


COMPARISON OF OBSERVED CORRELATION gas 
Exe. I anp Exp. IL wits THOSE PREDICTE! 
FROM THE CONTINGENCY MODEL 


Correlation (rho) 
Octant Observed Observed 
Predicted (Exp. I) (Exp. II) 
Ais =B 
? =A 18+ 
5 | AG es 
2 33 | =o% 
F 25 | =S% 
d 30 43 
7o 05 43 45 
8 | 8 —.33 ar 


® Sign opposite that predicted. 


Experiment II used Task No. 63 (high structure) 
and Task No. 65 (low structure) from Shaw’s (1962) 
taxonomy. The structure ratings were 3,45 and 3.60 
for the two tasks. Task No. 63 (high structure) was 
the Captain Watts detective problem involving the 
reconstruction of an incident that resulted in two 
deaths. Task No. 65 (low structure) was a human 
relations problem requiring the selection of the best al- 
ternative recommendation froma list of five alternatives, 


The criterion measure for each task in 
ment was a stand; 


bined the qualit: 
solution, 
Procedure. Both ex 
pendently, by differe 
the results w 


each experi- 
ardized performance index that com- 
y of the solution and the speed to 


periments were conducted inde- 

nt Es and at different times, and 

ere analyzed independently. However, the 

Procedure for both experiments was identical. 
At the beginning of each session, three 

duced and told that they would be working on problems 

as a three-man group and that one of them would be 


designated the leader for each problem. The Ss were 
told that it was a study to i 
perform on vario 


Ss were intro- 


and responsi- 
‘as then given the 
first problem, each 
the group atmos- 
l d of ten 8-point 
bipolar adjectives of the type used in the LPC. The 
S was asked to descri 


The group w; 
ompleting the 


he group 
adjectives, 
ader behavior 
of the following six 8- 
ssive-strict, requesting- 
i participating-managing, 
aring- task-controlling, The left- 
hand member of each pair refers to high LPC behavior 
and the right-hand member refers to low LPC behavior. 
The pairs were randomly positioned on the actual 


ng, using these bipolar 
Also, the members filled out the le 
questionnaire which consisted 
point bipolar adjectives: permi 
ordering, considerate-rude, 
passive-active, task-sh 
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5 at 
questionnaire.) The instrument was a ves pe 
a high score should be associated with high L 

nge Wels ; sroup. 
Then, ate second problem was given to wl jee 
After this problem was completed the por coins 
sphere and leader behavior uSSianneles 3 oa 
pleted again. Finally, Ss were debriefed anc Pad we 

Position power of the leader was aye strong 
cording to Fiedler’s (1963) checklis in ETE 
leader condition, the leader role was BIEDES A special 
formal status relative to the members, ee highest 
information about the task, and delegated t ue a the 
decision-making authority and responsibility. ot ai 
weak leader condition, the leader role was aon with 
cussion leader without special information PGs to 
decision-making authority and responsibility © poth 
that of the members. Each group worked oF ations 
tasks under only one of the two power ooh task 
(strong or weak position power of leader). ditions: 
sequence was randomized within power CO! made A 
The selection of leaders for each task was mä 
random. 


ResuLts 


r ed 
T etane 
The data for each experiment w a e the 
by assigning the groups to eight cells 


three 
; ind he 
contingency model by dichotomizing tł Ë 


tas! 
z p. r eee rer. 

Situational variables of position Pow sition 
structure, and group atmosphere. jichot- 


power was defined experimentally asa J 
omy (strong and weak power condition i 
task structure was defined by task m ally 
(structured and unstructured tasks)- . vid’ 
group atmosphere was dichotomized by scot 2S 
ing the leaders? total group atmosphere atmo 
at the median (good and poor group i the 
sphere as perceived by the leader a six 
experience). This procedure resulted a and 
Sroups in each of the first four octan ctant 
Seven groups in each of the last four ° 


tant 
for Exp. I, and eight groups in each oma 
for Exp. II, Within each octant, Sprulates 
rank-order correlations (rho) were CA oup 
between leaders 
performance, 


hip style (LPC) and 8 
The results of th 


i 
n 
rè show, 
ese analyses are $60 
Table 2. None of the observed co! 
for either 


o lev 
«ct study reach an acceptable 
statistical reliabili 


criterion of mean 
contingency mod: 
observed correlat 
dicted at least 

example, in discu 
leadership study, 
only one of the 


an 
ion 


of 
aren! 
ty. However, the app’ ib 
ingful relationships © ye 
el merely requires sais pf 
ions approximate f meet o 
in terms of direct! hirt 
ssing the results of a owe 
Fiedler (1967) states ae 
sixteen (sic) correla 


p 
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ame ly one of the eight correlations, 
direction Te. bbe: ‘not in the hypothesized 
iierprétatin, 119].” This is followed by an 
Hot Seis Ton of these correlations that were 
direction ae but were in the hypothesized 
Fiedler = a er in discussing the same results, 
ncludes: 
ine ajor significance of the church-leadership train- 
the first two pie oe cross-validation and extension of 
findings ABS HG Ra reported in this chapter. The 
but also for 7 IG d ba just for college students 
for mixed sex eann experienced leaders; they held 
tasks [p. 120). ips and over four different creative 
1 
aperar a of the correlations to 
sitet a ae direction is used as an 
the eee of meaningful results. 
need to per rationale for this is. that the 
overlappir a the sample into eight non- 
ee nS data sets to produce more homo- 
redüčes ae (Fiedler, 1967, pp- 84-85) 
sary be Be. Sample size to below that neces- 
even at peg acceptable statistical reliability 
This prot z correlation of moderate strength. 
experimental. of requiring large numbers of 
the unit oa hea is especially severe when 
individual, analysis is a group rather than an 
oy . 
Pan i the criterion of the correspon- 
tions to i direction of the observed correla- 
Median rh ee predicted (the sign of the 
that in Ja from Table 1), we find in Exp. I 
conditions. y ; two of the seven hypothesized 
Posite in oe the observed correlations Op- 
Was mad “7 W those predicted (no prediction 
criterion e for Cell 6). Based upon the sign 
model mi the advocates of the contingency 
of the oe claim support from the results 
e ap St experiment ; however, based upon 
toc statistical criterion, the con- 
Ported aona hypotheses cannot be sop 
Samples he hypothesis for each of the eight 
is on that the correlation in the population 
obsery raner be rejected based upon the 
Sis is pip correlations. Thus, the null hypothe- 
in T cepted for each of the eight samples. 
tions raba i not only are all of the correla- 
appear eh eeigar st but sizable discrepancies 
ations bane sae observed and predicted > el 
ing th . ae all seven predicted octants. Empo 
A agin of the sign of the predicted 
are ; ations, only two of the seven correlations 
in the hypothesized direction. Thus, the 


k 
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results of Exp. II are damaging both to the 
hypothesized predicted curve and to the 
hypothesized inverted-U relationship between 
the favorability of the situation for the leader 
and the correlation between leadership style 
(LPC) and group performance. 

To determine the contribution of the factors 
of the model to group performance, group 
performance was designated as the dependent 
variable and position power, group atmo- 
sphere, and leadership style (LPC) were 
treated as independent variables. Moreover, 
group performance was not assumed to be the 
same variable over the structured and un- 
structured tasks. Thus, group performance on 
each task was considered a different dependent 
variable. Employing an analysis of variance 
procedure, leadership style (LPC) was nested 
within both position power and group atmo- 
both position power and group atmo- 
sphere had two levels as defined above, and 
LPC was dichotomized by cutting at the 
median within each of the four position power 
group atmosphere combinations. This design 
was used to analyze each of the four task 
performances. The results of these four analy- 
ses of variance show that none of the effects 
approach an acceptable level of statistical 
reliability. Following the traditional criterion, 
the null hypothesis cannot be rejected based 
test results. The means of group 
performance are shown in Table 3. Even the 

ans between high and low 


differences in me 
LPC between the two studies disagree in six 


of the eight cells. 
Finally, the correlation between the leader’s 


LPC score and the mean members’ perception 
of the leader’s behavior was —.01 for Exp. I 
and .20 for Exp. II, neither of which was 


statistically significant. 


sphere; 


upon these 


DISCUSSION 


The two studies presented followed as 
closely as possible the prescriptions of the 
contingency model and research methodology 
used in its development. However, neither of 
the studies supported the model in terms of 
replicating the predicted curve. Experiment I 
showed typical contingency model results in 
terms of the sign of the predicted correlation 
between leadership style and group perfor- 


mance; however, Exp. II yielded five of seven 
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TABLE 3 
SCE AND C 3 JENCIES FOR 
bi NS ON Group PERFORMANCE AND CELL FREQUENCIES FOR 
ee THE Two PARALLEL STUDIES m 
Octant 
Sell ] ] N) 7 
i 1 2 3 | 4 5 6 | 7 8 
| | 
x | | 
| y 11.15 
Bight | 1208 1032 | 1159 | 10.19 14.16 Sot | 1203 Ha 
bos LPG 10.28 11.54 | 8.42 8.97 9.74 7.71 p 33 
Difference 1.76 =1.22 | 3.17 | 122 4.42 3 2.79 
| | 
| | | 
Exp. IL 29 
eo: LPC | 10.80 12.39 10.93 | 10.65, 9.40 923 | 9.92 re 
Low LPC 11.35 11.58 13.61 9.69 11.46 9.28 10.59 267 
Difference | —.55 81 | —268 | 96 —206 | —05 | =c67 
Frequency ” . | - , 
Exp. I 6 6 6 6 7 | 7 7 8 
Exp. II 8 8 8 | 8 8 8 8 
I 


Note.—In six of the octants, the diffe 
each study, the sign of the correlation 


correlations with signs opposite to what the 
contingency model predicts. Perhaps, however, 
We are too strict in our use of even this most 
lenient criterion. For example, in describing a 
study by Hunt (1967), Fiedler (1967) states 
that “Hunt’s results closely fit the ex- 
however, Hunt’s 
of .60 and —.g9 
for Octant 3 and correlations of .30 and 
Although Hunt’s corre- 
gns, the average (median) 


had any 
results of 
negative: none 
d any influence, 
an interaction, 
analyze the other 


this analysis were completely 
of the factors of the model ha 
either as a main effect or as 
It would be interesting to 
contingency model studies in this manner. It 
is possible that even in the cases where 
support is claimed for the Contingency model, 
the effect on performance could stil] 

reliable, especially when rank-or 
tions are used. After all, the dependent 
variable of interest is group Performance, 


nce between means shows reversed si 
Sagrees with that of the differ 


ants of 
a vo octant 
gns for the two parallel studies. In two 

ence between means. 


r 
not a rank-order correlation between leade 
LPC and performance. tudies 

It was also found that these two 5 wv 
did not yield similar results, either in moet 
the attempted replication of the og 
model or the mean performance Scores aa 
Octants. This is a rather damaging entä 
considering the similarity of the experime' he 
conditions, tasks, and Ss. Consequently, 
studies not only lend evidential dispro? at 
the contingency model, but also indies S an! 
it may not be summarizing meaningft 
stable relationships, 


any 
š “bute 2 
These Studies also failed to contribut 


ow 
Construct validity to the LPC score. twee? 
Correlation was found in both studies be’ n- 


the leader's LPC score and the group : are 
ers’ perception of the behaviors tha yl 
assumed to define these leadership (LPC) 
Perhaps this Measure of leadership styl¢ ilts of 
is the key to the many ambiguous re ntin 
studies attempting to replicate the “ware 
gency model, A recent review (Graen, AY the 
Orris, & Martella, 1970) summariz igen 
studies that have been relevant to the Je 
tial validation of the contingency erate 
Points out several conceptual and mewe in 
cal problems, The two studies presente sever 
Present article, jn conjunction with | oi 
other studies reviewed by Graen et al 
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E doubt on the plausibility of the con- 
ngency model of leadership effectiveness. 
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Í AND ALVARES STUDIES TESTING THE 
CONTINGENCY MODEL! 


FRED E. FIEDLER? 


University of Washington 


This note discusses several methodological sh 
Alvares studies of the contingency model. 
position power manipulation was probably i 
differences between supposedly structured 
sarily small, especially since other tasks wer 
of variance of leader Least Preferred Co-worker (L 


Graen, Orris, and Alvares (1971) presented 
two studies of the contingency model which 
tested the “hypothesis that the variance within 
octants is due to factors not specified in the 
model.” This presumably means that the cor- 
relation between Least Preferred Co-worker 
(LPC) and performance is not systematically 
related to the favorableness dimension. Since 
the authors do not specify what these other 
factors might be, the study is simply designed 
to test a null hypothesis, The authors found 
no significant relations supporting the con- 
tingency model and they concluded that the 
results constitute “a rather damaging result 
considering the similarity of the experimental 
conditions, tasks, and Ss.” 

As most of us kn 
yield nonsignifican 
reasons, 


ow all too well, a study can 
t results for any number of 
these, certainly, 
manipulations of important 
as inadequate experimental 
ims to disconfirm a 


in- 


at their stu 
he Prescriptions 


* Published out of turn at the request of the Editor. 
? Requests for reprints should be sent to the author. 
Organizational Research, Uni t 


versity of Washi 
Seattle, Washington 98105. Sento; 
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ortcomings in the Graen, Orris, and 
The critique points out that (a) the 
nefiective and at best very weak; (b) 
and unstructured tasks were unneces- 
e readily available; and (c) an analysis 
PC) scores yielded a highly 
the second study contained 
primarily low LPC leaders. 


to point out that these studies have methodo; 
logical weaknesses which make them 
adequate tests of the model. E T. 

I should like to leave aside at this oo 
extended discussion of the need for modi sore 
tions in the theory. The theory is, in fact, oh it 
general than Graen et al, suggest ee 
Postulates primarily a contingent and ee 
vilinear relationship, that is, that the “i er 
direction of the relationship between ee by 
LPC and group performance is moderate the 
the degree to which the situation gives up: 
leader control and influence over his gf? ue 
The interested reader is referred to a giana 
of the contingency model by the writer © g 
his associates (Mitchell, Biglan, Oncken 4 
Fiedler, 1970), a review of the valida ter- 
evidence (Fiedler, 1971), as well as a new i?) 
pretation of LPC S 


=p) and # 
t cores (Fiedler, 1970) a0 
review of select 


Cae 
ed studies of the contine la 
) n, Alvares, Orris, and Ma 
(1970). 


The present “note” 
logical critique of the 
tions and procedure 
independent variable: 
experiments. pet 

Position power, Graen et al. (1971) oP the 
mentally varied Position power by givi on 
randomly leader “superior t the 
al information o igp 5; 
ating to him] the :pility 
uthority and respons! ish“ 
ned later, was acca’ ent 
y to the leader in the PF 


do 
is primarily a nim a 
experimental man = jor 
s involving the 3 al 
s in the Graen © 


a 


task, [and deleg; 
decision making a 
This, as ascertaj 
by talking direct] 


eo Wr 


i 


Í g 


the Angra presenting written task in- 
boly o to the leader only, “maintaining 
leer orientation and eye contact with the 
is vi = giving an official timing device 
tion ge ee In the weak position power condi- 
i rhea pee status of the leader was not 
to the ae verbal instructions were addressed 
given A A, and written instructions were 
devie kne one in particular, and the timing 

A was placed in the center of the table. 
Met terse of reasons suggest that this power 
digisutt, to E is inadequate. First of all, it is 
baon in al that the particular E be- 
giving feet ooking the leader in the eye and 
device and sailer of an ad hoc group a timing 
powerful ai ritten instructions are sufficiently 
iaun uecipulasions to do all the things 
Power is a expected to accomplish. Position 
with son onceptuglized as providing the leader 
sanctions T real power to give rewards and 
Some fat . n other words, the leader must have 
difficult al over his members. It is very 
leader in a Eve high position power to a 
was don any laboratory situation. Where this 
complished hiding it was usually ac- 
ormal ne 3y using individuals who had some 
hus penom power outside the laboratory. 
compared elgian Navy study (Fiedler, 1966) 
a study Aen officers with recruit leaders; 
1965) oo i OTC cadets (Meuwese & Fiedler, 
(Skrzypek a study of West Point cadets 
eld higher 1969) used cadet officers who 
Over Pe a rank that their members.’ More- 
eir arti hg ped in an earlier version of 
appointed PO et al. demoted the leader 
Status in ihe the first task session to member 
Member ee second session and made another 
Second ec. the same group the leader in the 
likely Heng This procedure, obviously, is 

iie ilute the formal leadership powe". 
erceived et al. have obtained data on the 
ormane influence of the leader on group per 
E i These were based on members 
8s of the leader, Graen et al. furnished us 


` Vera +. . . 
age ratings based on the groups within the 


Partic 

are octant. These ratings are based on 
9 3.14 Scales. They range in Exp. 1 from 6.17 
“‘4 in the high position power condition, 
yielded correla- 
he con- 
28, .13, 


a 

. The Sk 

tions °F Skrzypek study, incidentally, 

tingene most identical to the predictions of t 
y model, namely, —.43, —-32, -10, PC: 

—.33 for Octants 1 to 8, respectively. 
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and from 4.83 to 3.24 in the low condition; 
the means for these conditions are 5.57 andl 
4.32, that is, a difference between means over 
octants of 1.25. 

In Exp. 2, the ranges for the strong position 
power condition are 6.13-3.50 (sic) for the 
strong condition and 5.38-4.75 in the weak 
position. The condition means were 5.29 for 
the strong, and 5.13 for the weak condition; a 
difference between means of .16 scale points. 
It is difficult to believe that the manipulation 
was successful in the first experiment; it 
strains credulity to believe that a mean dif- 
ference of .16 points can be considered evidence 
showing that the manipulation was successful 
in the second experiment. Moreover, @ ł test 
between the eight mean_perceived-influence 
ratings by members for the high condition and 
the eight means for the low condition yields a ¢ 
of 1.746 which is not statistically significant. 
When we now consider that each mean per 
octant is based on a distribution of six to eight 
groups, it is obvious that the overlap between 
groups in the high and the low conditions must 
have been substantial. In light of these con- 
siderations, the present author would submit 
that the position power manipulation in the 


Graen et al. experiments was clearly 
inadequate. 
Manipulation of task structure. Task structure 


scores range from a maximum of 8.0 to a mini- 


mum of 1.0. The studies reported by Fiedler 
(1967) average 7.39 for the structured and 
3.15 for the unstructured tasks. In contrast, 
the scores in the Graen et al. studies were 5.86 
and 5.45 for the structured tasks and 3.69 and 
3.60 for the unstructured tasks. Thus, the 
structured tasks were less than 1 scale point 
above the cutting score of 5.0, and the dif- 
ferences between structured and unstructured 
tasks were only 2.17 and 1.85 scale points. 
This clearly represents a very weak manipula- 
tion as compared with a difference of 4.24 in 
the original tasks. While it may be argued that 
the differences between structured and un- 
structured tasks in some of our studies were 
no larger, a study which seeks to disconfirm a 
theory should not rely on marginal experi- 
mental manipulations to test a null hypothesis. 

Distribution of leader LPC scores. Graen et 
al. assigned leaders at random within each 
of the three-man groups, and their LPC scores 
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were determined after constituting the groups. 
One likely outcome of this procedure is that 
the leader LPC scores will be unevenly dis- 
tributed over cells or octants. A sample of 
groups in which most LPC scores are either 
extremely high or extremely low is not likely 
to give results which are predicted by the model 
since the model assumes a roughly equal dis- 
tribution of means and variances of LPC scores 
over the octants (see Fiedler, 1966, 1967). 

Data recently furnished by Graen show 
the LPC scores within the various octants 
means which ranged from a high of 77.3 
low of 52.9 in the second study and from 
to 42.7 in the first study. A one-way ana’ 
of variance was performed for each stud 
the eight octants as cells in 
ratio for the second stud 
that there were no 
between means. However, 
first study was 3.10 (p<.0 
means of leader scores diffe 


that 
had 
toa 
84.7 
lysis 
y with 
the design. The F 


within octants, it 
uce the Possibility 
relations, i 
possible to insure 
independent vari- 
xperimental design. How- 


he major indepen- 
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dent variables in the Graen et al. studies are 
inadequate or borderline. Hence, these studies 
do not constitute critical or very meaningful 
tests of the contingency model. 
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A aer UNN) raised several selected issues 
esigned r the research methodology of studies 
Berton hypotheses associated with his 
We Pet Fama of leadership effectiveness. 
as well ass ike to discuss these selected issues 
SF oan several others which emerged during 
p a ne to understand the empirical 
_ ena supposedly underpinning this con- 
cussion’ ie in the hope that such a dis- 
ests Will be useful to others performing 
on this model, 


Re 
Search Strategy and H ‘ypotheses 


In 


the Pt nl to Fiedler’s interpretation of 


ions ney of our two studies, these investiga- 
Ypothesie designed to test an empirical 
alternati 's and were not null designs. The 
| Previous oy explanation for the results of 
°Y our st Pci studies that was tested 
ess of T les was that the apparent unorder- 
€ diver te results of such studies was due to 
Sate Sity of behavioral settings investi- 
S we stated in our previous article 
bers & Alvares, 1971), “the variance 
aparen, within octants is often without 
variance ii central clustering of values. This 
fact th within octants might be due to the 
tions y at a wide variety of leadership situa- 
fact vere used in developing the model.” The 
by the Pit the rejection of the hypothesis, that 
as a ults of contingency model studies taken 
Siversite could be explained entirely by the 
ao A bic cad settings studied, one 
ence in the validity of tha 

YPoth €monstrates that this is not a nu 
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aen, ets for reprints should be sent to 
is, ‘partment of Psychology, University 
| Thana, Illinois 61801. 


(G raen 


George 
i of 


205 


CONTINGENCY MODEL OF LEADERSHIP EFFECTIVENESS: 
SOME METHODOLOGICAL ISSUES 


ORRIS, anp KENNETH M. ALVARES 


University of Illinois at Urbana 


This paper discusses a number of methodological issues concerning empirical 
studies designed to test several hypotheses derived from the contingency model of 
leadership effectiveness proposed by Fiedler. These issues are discussed and a 
number of recommendations are suggested to enhance the information-producing 


value of future studies designed to test these hypotheses. 


Our approach to testing this hypothesis was 
to perform two parallel studies within the 
same behavioral setting to hold the setting 
effect constant. In this manner, to` the extent 
that the results of the two studies*agreed, we 
could support the reliability of the empirical 
procedures within this behavioral setting and 
lend strong support to the validity of the 
contingency model by decreasing the plaus- 
ibility of the diversity of settings explanation. 

As integral components of our attempt to 
test the diversity of settings explanation, two 
versions of the contingency model’s central 
hypothesis were stated and tested in our two 


studies: 

In addition to the strong, clearly testable hypothesis 

of the predicted curve generated by using the results 

of the twelve antecedent studies, a rather weak and 
testable hypothesis is also derived by 
Fiedler from the contingency model. This latter 
hypothesis merely states that the more task oriented 
leader will be associated with more productive groups 
in very favorable and very unfavorable situations for 
the leader, and the relationship oriented leader will 
be associated with more productive groups in situa- 
tions that are intermediate in favorability for the 


leader (Fiedler, 1967, P- 169). 


This latter hypothesis re 
vilinear relationship mentione 
as the “more general” hypothesis. 

Thus, our studies were designed to test Ri 
empirical hypothesis and not simply a a 
hypothesis. We do not agree, therefore, tha 
studies such as ours be judged according to iis 
different sets of criteria depending upon their 
as Fiedler suggests. It does not seem 
ly more “stringent methodo- 
logical requirements” to a study if its results 
are damaging to a model than if its results 
support that model, Although such a pro- 
cedure may suit the preferences of the theorist, 


less easily 


presents the cur- 
ioned by Fiedler 


results 
reasonable to app 
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TABLE 1 
CELL MEANS IN PERCEIVED INFLUENCE OF LEADER ON GROUP PERFORMANCE 
Octants 
Group —— n e 
1 2 3 4 | 5 6 7 8 
| 
Exp. I 
Position power _ 
‘Seas 5.83 6.17 5.14 5.14 486 
Weak 4.33 4.83 3.24 j 
| | 
Exp. IL | | 
Position power | | | 
Strong 5.75 5.63 6.13 3.50 4.75 
Weak 5.38 5.00 5.38 . 


Note.—p < .04 that the differences between strong and weak position pe 


we feel that it would be dy: 
our goal of understanding 
phenomenon of interest, 
example of the consequence 
this double standard, see 
tween Rosenthal (1968) and 


sfunctional for 
the empirical 
For an excellent 
s involved in using 
the exchange be- 
Barber and Silver 


patterns of nonsign 


ations, we 
would have had some b 


asis for hope. How- 
dissimilarity of the 


between the two 


` on 
Leader’s Position Power 


der’s formal p 
sk Structure, O 


esignated 
content and the 


cone. m+ 
ower means within like quadrants are rando. 


administration of the instructions. Tor ian 
ample, the Æ in the high power con¢ e 
related to the designated leader in both of 
and manner as the person in the posia oe 
primary responsibility for directing the gro ade 
activities. These procedures included: (4) i 3 
dressing all verbal instructions directly eat 
leader (maintaining appropriate body ie in 
tion and eye contact during interactions? 

Presence of the members, and (b) giona i 
written task instructions and the timing 

directly to only the leader, In this manne? 
*PPropriate roles for both the leader oe DY 
members were legitimatized in this sen 
E. In the weak Power condition, Æ alt J 
the designated leader as merely another De 


. v 
er of the group; verbal instructions 


v 
t 
e 
itt 

addressed to all 
e 


t 
e 
wil 
e three members, and vere 
Instructions and 


s sW 
t timing device simply 
placed in the cent 1 
l 


i er of the table. i 
Fiedler also questioned the ability of 
students to assu 


. i io 

s me functional roles inp" Í 
solving Sroups in laboratory situations. up F 
were the case, much of the small gro nk 
e 


0. 


w 
t 

I 

ple’ 


d 
he 
he 
jce 
he 
he 
by 
to 
m- 
re 
ep 
ge 
m 
this 
to 

(i 
Search, past and Present, would be brought i 
question (Davis, 1969), Moreover, t? 

wards and sanctions which emerge 

laboratory to Produce role acceptance 

documented (Rosenthal, 1966). n our 
In contrast to Fiedler’s procedure, A e% 
study we had a Somewhat indirect mear taf 
the leader’s Position power, After al 
was completed, the leader filled out a a 
naire which Contained a scale asking ae 

much influence he thought he had 0” 
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group's performance. The alternatives were on 
N ae ig ranging from 1, no say, to 8, 
fag word. The results on this scale are 

in Table 1. As shown in Table 1, seven 


of the ei er z 
iene eight differences in means between 
§ and weak position power within like 


ae ae i the expected direction. The 
8. Aloua ri in Exp. II between Cells 7 and 
orily ‘the ini this measure probably reflects not 
but. eden Ecol pee power of the leader, 
leader rel ADN and persuasiveness of the 
eader’s to that of his members and the 
of the Pa ception of the process and outcomes 
weak ee uP activaty, these results, though 
Aba eee in the right direction. 
to eile ea profits from our attempt 
induction F Bits impact of our leader’s power 
his proced mogi the above indirect measure, 
establish dees makes no attempt to empirically 
manipulati effectiveness of his position power 
n the g so from the leader’s viewpoint. 
o. P Cae Fiedler’s procedure consists 
the expat ee by the Æ based upon 
example à ra and role of the leader. For 
Study 162: the Meuwese and Fiedler (1967) 
Programs college seniors enrolled in ROTC 
groups oral ha formed into 54 three-member 
ank in a 1e member with the highest ROTC 
The hy: at was designated the leader. 
igh Bi hesis that this procedure produced 
tested eae power for these leaders was not 
Most rese Bang accepted as obvious. As 
Vious ex so ers have learned, intuitively ob- 
Proven Er ai have a way of being dis- 
his, we yY empirical evidence. Based upon 
though ing: that our power induction 
Sentative pris weak, probably was repre- 
Vlous cont some of those employed in pre- 
ontingency studies. 


i 
ask 5 tructure 


T 
mental a questioned our selection of experi- 
Ta as not being different enough. 
€ clearly in his book (Fiedler, 1967, p. 149) 
l a Y shows that the cutting point between 
Ou me high task structure is 5.0 scale points. 
or ca with scale points of 5.86 and 5.45 
Struc ur structure and 3.69 and 3.60 for low 
Doin, -C Were above and below this cutting 
bog,’ 'SPectively. Again, referring to Fiedler’s 
nteceg P: 134-141) and considering the 
ent results, 16 correlations as well as 
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two studies by Hawkins (1962) are shown 
based on studies with task structure scale 
points between 3.60 and 5.86. Thus, rejecting 
the above mentioned double standard, our 
tasks were rather representative of those em- 
ployed in previous contingency model studies. 


Distribution of Leader LPC Scores 


Fiedler criticized our procedure of randomly 
assigning Ss to conditions on the grounds that 
this procedure allows for the possibility of 
finding differences among the conditions on 
leader’s Least Preferred Co-worker (LPC). 
This criticism apparently is based upon a 
misunderstanding of the functions of random- 
ization within experimental design. Although 
this procedure allows for variation of leader 
LPC scores among the conditions of the design, 
the expected value of these differences is zero, 
and the standard deviation of these differences 
is a function of the standard error of the vari- 
able. Thus, the differences that occur among 
conditions are due to random sampling varia- 
tion. Using this procedure, the rare event can 
happen, for example, the occurrence of dif- 
ferences among condition means on leader’s 
LPC that produce an F value with p < .01 
(Exp. I). However, the more probable events 
would show differences among the condition 
ader LPC of close to zero, for 
example, differences among the condition 
means on leader’s LPC of close to zero, that is, 
differences among condition means on leader’s 
LPC that produce an F value with p = .33 
(Exp. II). In addition, the difference between 
the overall means on leader’s LPC of the two 
studies produced an F value with p = .89. 

Fiedler apparently misinterpreted the 
plications of this significant F ratio for the 
differences in leader LPC scores among the 
eight cells of Exp. I. Fiedler states that this 
indicates that “the means of leader scores 
differ markedly from octant to octant.”” More- 
over, he states that this would reduce the 
variance of LPC scores within octants and thus 
reduce the possibility of obtaining significant 
correlations. 9 S 

The appropriate interpretation of this signi- 
ficant F ratio is that some linear comparison 


of the eight cell means on LPC shows statistical 
reliability. It does not indicate that the means 
differ markedly from octant to octant. This is 


means on le 


im- 


D 
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TABLE 2 


CELL MEANS AND STANDARD DEVIATIONS ON LEADER 
LPC AND LEADER Group ATMOSPHERE 


Leader LPC Leader GA 

Exp. & N - 

octant h” x | sp x SD 
| | 
es) as | ae | 1345 | 135.2 | 398 
2 6 76.7 15.12 | 129.8 | 3.85 
3 6 66.9 | 27.53 | 146.5 6.55 
4 6 42.7 19.91 137.8 5.37 
5 7 84.7 19.34 | 106.7 7.50 
6 7 48.4 19.92 107.7 14.26 
7 7 69.8 12.34 119.4 12.88 
8 7 69.7 17.41 123.0 5.81 
Exp. II 

t 8 61.8 19.23 146.4 6.74 
2 8 52.9 19.63 136.1 6.19 
3 8 66.5 29.59 142.3 9.16 
4 8 71.8 14.83 | 135.0 4.33 
5 8 56.9 | 17.29 | 1256 13.57 
6 8 77.3 16.42 114.6 9.00 
7 8 69.5 18.95 105.4 29.24 
8 8 62.3 15.77 115.5 12.27 


demonstrated by the finding that testing all 
pairwise differences between means in Exp. I 
using the Scheffé procedure (Winer, 1962) 
showed no significant com 
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Group Aimosphere 


In an earlier version of his note, were 
objected to our dividing group apa 
(GA) at the median on the grounds that ei 
differences between good and poor GA pro f 
ably were too small. Moreover, he Suo na 
that we take groups from the upper and low 3 
third of the GA distribution to assure a are 
difference between cells. Fortunately, igo 
performed an analysis on the results o an 
two studies, which clearly demonstrates ia 
uselessness of taking successively ae 
upper and lower fractions of the GA distri 


; A ively 
tion and in the same process success! 


reducing the sample sizes within célls. Tet 
shows the correlations within octants GA 
progressively smaller fractions of the to 
distribution for each study. Contrary ja. 
Fiedler’s prediction, the patterns of “ae 
tion do not become more supportive of tell 
contingency model. In fact, these results the 
us more about the increased variance in or- 
sampling distribution of the rank-order © y 
relation coefficient: based upon page 
smaller sample sizes than about the influen¢ 


. p] . . . rrela tions- 
parisons (p < .05), the GA distribution on these cor rela hors 
Moreover, as shown in Table 2, the variances Based upon these results, the present au size. 
a LPC within octants are not noticeably Suggest that the loss of power (sample e 
re Fi i i J 
uced as Fiedler suggests. Produced by this Procedure is not compe? 
TABLE 3 
Comparison or THE OBSERVED CORRELATIONS FROM E 
D Exp. I ay EXP 4 PHOSE 
PREDICTED BY THE CONTINGENCY MODEL Usinc ee S 
LARGER SPLITS oy Group ATMOSPHERE f 
ip: GA group Octant pad 
relations r; 7 
Highest] Lowest 1 2 3 4 al. =. 
6 7 
Predicted =e! fl mo 
Exp. I > = = te GH] o5 
6 6 47 | ~.44 
6 3 i 46 33 
: 5 Ai | =i 46 36 rs ae 2 
' 4 63 | —32 40 | —"39 f S 20 
Exp. II a ~40 o 
8 8 =i 18 0. 
8 x 02 | ~og | _ 
7 e | et) all aa | a =| = 
; i 19 -31 | — 093 29 | 7, B A 
; à -10 60 W e A l f 
S |. ae 95 | ~ 89 D io 
Note.—Italicized s si si 
* Based upon tha contingere 988 eppo his sign sioredicted, 


è Number of signs opposite that predicted per GA yreould 


be positive, 


a n E 
x ee 

= a 

TÁ a 
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by i R c 
y increased orderliness in the pattern of 
Correlations. 


Recom mendations 


in oe the following procedures be used 

oti es oe to test the contingency 

Tea predictions regarding the relationship 

mn the leaders LPC and his group’s 

ime within the boundary conditions 

az po by group atmosphere, task structure, 
sition power: 


ste Nisi power of the leader should be 
` proposed nee only by a priori judgment of the 
tion oft see but also by the percep- 
power pi eader regarding his influence and/or 
Dien, Ta group process and output. 
iat a this latter procedure normally has 
vious a og (at least not reported) in pre- 
dhe i aa ies, if followed, it would produce 
Poveete ap to the hypothesis that the leader s 
tended } f power corresponded to that in- 
Conger by the investigator. We should take a 
quirin oe attitude toward inductions by re- 
anise f nat treatment checks demonstrate 
2 or our intended inductions. 
Ta “a acceptable task taxonomy such as 
(1967) p by Hackman, Jones, and McGrath 
(1963) in substituted for the one by Shaw 
ratings : this way, a taxonomy based upon 
would k i the dimensions of group outputs 
ens ace one based upon ratings of the 
efficient te of task instructions. It is in- 
by the l; vs perform studies using tasks defined 
the oie procedure when tasks defined by 
is now pee ane available. Moreover, Hackman 
nonlab extending his excellent taxonomy to 
@Doratory tasks. 
act Leadership style should be monitored as 
me = indexed by LPC and various other 
asures, The lack of agreement between the 
i ership style perceived by group members 
a that indexed by LPC in the studies by 
oe and his associates demonstrates the need 
ise mhg the leader’s behavior. Before we 
ead nts variance in group performance to the 
Eler style, we should show data that the 
m a a behavior was as assumed. This 
onitoring of the leader’s behavior should help 
O dispel the m ystery surrounding the concept 
Of the LPC. J 3 
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4. Group atmosphere should be manipulated 
and monitored as part of the design and not 
merely measured after the fact. Measuring GA 
after the fact may confound it with other 
group outcomes such as the leader’s perceptions 
of the group task processes and outcomes 
(group atmosphere is as perceived by the 
leader). Thus, the manipulation of GA would 
allow unconfounded interpretation of the con- 
tributions of this factor. 

5. The dependent variable of a rank-order 
correlation between LPC and group perform- 
ance should not be accepted as meaningful 
unless an appropriate statistical test shows it 
to be statistically reliable at an acceptable 
level. The consequences in the case of the con- 
tingency model research of not employing this 
traditional convention is shown in the review 
of these studies (Graen, Alvares, Orris, & 


Martella, 1970). 
6. In addition to the usual analysis of 


correlations within octants, an appropriate 
analysis of variance design should be used to 
test the significance and show the strength of 
association of each source of variation on raw 
group performance. Moreover, a priori con- 
tracts on the means produced by this analysis 


should be tested (Winer, 1962). 
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REGRESSION ANALYSIS OF THREE VARIATIONS OF THE 
TWO-FACTOR THEORY OF JOB SATISFACTION ! 


CHARLES L. HULIN? 


University of Ilinois at Urbana 


AND 


L. K. WATERS 


Ohio University 


Three distinct variations of the two-factor theory were tested by reanalyzing data 
previously published by Waters and Waters. No support for any of these three fre- 
quently assumed versions was found. The results of this and several prior studies 
were explained by assuming the existence of two main effects: The first is that in- 
trinsic job factors are generally more potent for predicting overall job attitudes, and 
the second is that overall job satisfaction is more predictable than overall job dis- 
satisfaction. Neither effect requires the assumptions of the discontinuity of satis- 


fi 
intrinsic and extrinsic job factors. 


ae theory (Herzberg, 1966; 
ghneratad Mausner & Snyderman, 1959) has 
Since its ; an impressive number of studies 
Fie appearance over 10 yr. ago. In spite 
in the of studies which have appeared 
that ioe an yr., King (1970) has pointed out 
tWo-fact ower than five different versions of the 
aumea theory have either been implicitly 
designin e explicitly stated by researchers in 
argued R their studies. Further, King has 
factor th mo of the five versions of the two- 
Since a cony have not been adequately tested 
than ep Support may reflect little more 
Ive eos hod variance, The existence of these 
data in represents a serious problem since 

cori ich may support one variation of the 

Th. may not support a different variation. 
iden ro len: versions of the two-factor theory 
Under cor and discussed by King (1970) are 
Versions nsideration in this article. These three 

Ty are summarized briefly below. 

th oe . This version of the theory states 

ined aration in all intrinsic variables com- 
Satisfa, Contributes more to variation 1n job 

kana than it does to variation 1n job 
Variabl, action, and variation in all extrinsic 
ion ie combined contributes more to varia- 
Job Sg Job dissatisfaction than to variation In 
Satisfaction. This is the version assumed by 
"sett and Winslow (1967) for their review 

as Taber and 
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tion and dissatisfaction or the assumptions of dimensional differences between 


of the Dunnette, Campbell, and Hakel (1967) 
study. 

Theory 2. This theory predicts that variation 
in all intrinsic variables combined contributes 
more to variation in job satisfaction than does 

variation in all extrinsic variables combined, 
and variation in all extrinsic variables com- 
bined contributes more to variation in job 
dissatisfaction than does variation in all 
intrinsic variables combined. Herzberg (1966) 
implied that Theory 2 is the basic theory since 


he states: 
Once again, ... the divergent trends for motivators 


and hygiene factors serve to verify the basic theory as 


these factors are involved in positive and negative job 


attitude sequences [p. 101]. 

Theory 3. This theory states that each 
intrinsic variable contributes more to job 
satisfaction than it does to job dissatisfaction, 
and each extrinsic variable contributes more 
to job dissatisfaction than it does to job 


satisfaction. Theory 3 is a stronger version of 
Theory 1. Support for Theory 3 implies 

r ; On one occasion 
support for Theory I | eee 3 
Herzberg (1966) implied that Theory 3 was the 


basic two-factor theory since he stated: 


The chart shows that of the 51 significant differences 
reported for the six motivator factors, every one as in 
the predicted direction. For the 57 significant hygiene 
factors, 54 were in the predicted direction. In sum, then 
the predictions from the theory were wrong 1n less than 
3 percent of the cases [p. 125]. 


These three versions of the two-factor theory 
may be summarized as follows: Let ai, bi, Ci, 
and d; represent either correlations between 
job factors and overall attitudes or frequencies 
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of reporting critical incidents, gssoriated with 
either positive or negative job chit weenie 
That is, if a; represents the correlation between 
job factor i and overall satisfaction, tien b: 
represents the correlation between the same 
factor and overall dissatisfaction. Likew ise, if 
&; represents the frequency of reporting factor 
i being associated with overall satisfaction, 
then b; represents the frequency of reporting 
that the same factor Was associated with over- 
all dissatisfaction. 


Overall 


Overall 
satisfaction (y,) dissatisfaction ( Ya) 
a by 
Intrinsic ay ba 
factors i 3 
an bn 
C1 dı 
Extrinsic Ce dy 
factors i : 
Cn dn 


n n 
Theory 1 states that 2 ite È b; and 
i i= 


” ” 
E aS E gT the case of a correlational 
izi i1 


study, Theory 1 would predict th 

correlation based on the intrin 

would be greater than the corresponding 

multiple correlation based on b; (R 

byba++6bn yo) Theory 1 would 
sn SS R 


at the multiple 
sic factors, ti, 


PLR Cn ygs 
n 
2 predicts that SS 
fed) Got, n ri 
the case of a Critical incident Study, and 
Se ann aenn in the case of a 
corre ation study, Theory 2 also predicts that 
È di> È bi in the case of 
= iz 
study and didada yg 
Case of a Correlation 
heory 3 Predicts 


Theory 


a Critical incident 
> Rano. 
al study. 


that ai > b; for a 
ees di > c; for all di and c; Note agai 

Cory 3 isa stronger versio biiy PE 
e e re n of Theory land 
Theory 1, 


King (1970) conclud 
Pa 


dnu in the 


appropriate test of 


Results section for some Caveats, 
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itical inci studies and 
been supported by critical incident u 
subject-coded studies but has not been a 
adequately in studies where the determina 


| 
i 
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Pleat aie saya: ob 
of satisfaction and dissatisfaction were | 


tained by means other than aair a 
number of Studies apparently testing a 
tions of Theory 1 were discarded by or 
and earlier by Hulin and Smith (1967) ee 
these discarded studies measured overall er 
faction and dissatisfaction on a single eae 
rather than following the assumptions Q. tion 
two-factor theory and assuming satisfact 7 
and dissatisfaction were on two separate ith 
distinct continua. King, and Hulin and Smin 
contend that if a theory denies the eena 
a single dimension of overall satisfaction it it, 
Satisfaction and makes no predictions saver a 
it is not appropriate to test the theory m, 
single satisfaction/dissatisfaction pomt zin 
Theory 2 has not been tested adequate ies 
studies other than critical incident stu 
according to King (1970), sisca] inci 
Theory 3 is Supported only by critica ears 
dent studies, Support for this theory oT ues 
to reflect method variance. King (1970) 067) 
that only one study (Hulin & Smith, otho! 
adequately tests this theory using & me his 
ifferent from critical incident studies: 
study reports data which fail to suppe. a 
predictions of Theory 3, Recently, Wam e 
Waters (1969) reported data which also , and 
to support Theory 3, Unfortunately, ee test 
ith di gather data age a 
and 2 (according to nee e 
Waters and Waters analyzed their data 
Theory 3 but not Theories 1 and 2. PEPS, 
aS article reports a reanalysis 2% 
Interpretation of the Waters and Waters vg, tHe 
data to test Theories 1 and 2. Specifics’ t | ob 
measuring satisfaction a of 
Index ODI] scales and k J jo 
ere used to assess satis utip? 
b factors. Part and MË enr 
ween these factors one 
lisfaction and over alle 
n were obtained in order to X acl? 
the linear association between the job 
and overall job attitudes, 


ree 
969) 


METHOD er 
PNN uP” y 
0 non? di 


were 16 oflic® 


regiona. 
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eo ee company. The employees ranged in 
least high scl late teens to early 60s and all were at 
all Se aca graduates. Immediate supervisors of 
A job aa sampled were also females. 
small pees pe questionnaire was administered to 
a single aUa I employees by the second author during 
their individ e day. Respondents were assured that 
e com a responses would not be made known to 
were tens th Names were requested but employees 
they “felt he option of not responding to that item if 
17% did n pncomiortable” doing so. Approximately 
title, de ied give their names. Other information (job 
Probable aia ty etc.) obtained on the questionnaire 
respondent, uld have been sufficient to identify the 
ne attitude scales were presented in booklet 
issatisfaction oo of separate overall satisfaction and 
randomized) n scales (always the first two scales, order 
Scale, the ren an overall satisfaction/dissatisfaction 
actors (arra € scales of the JDI, and a list of 11 job 
a DE e in alphabetical order) to be rated on a 
tiomain ee eion scale. Ratings of satisfac- 
actors) pa - action (both overall and for specific job 
Separate ere made on a 12-point anchored scale and the 
“point s eon and dissatisfaction ratings on 
points of ti es which consisted of the appropriate 6 
Plus a Sace 12-point satisfaction/dissatisfaction scale 
Satisfied) A alternative (not satisfied or not dis- 
ata arë z Tieg data were gathered, but only the above 
Waters see in this article. See Waters and 
969) for details of these other data. 


T RESULTS 
ests of Theory 3 


Med explication of the five versions of the 
eee the distinctions between them 
test The 1S not clear, however, how one would 
and Woe ee of the theories. Waters 
(1967) Fca (1969) and Hulin and Smith 
deductio 3 example, attempted to test me 
Correlation of Theory 3 by comparing the 
With i between measures of satisfaction 
or one a Job factors and overall satisfaction 
teen dissatisfaction. Such a procedure 
betwee confounding caused by the aera 
i ig the measures of satisfaction with $ e 
tween at job factors. Partial correlations 3 
Actors measures of satisfaction with ae 
Partia] and overall satisfaction, or stand’ 
with; i cong weights relating omega 
DE! ay a factors to overall satisfaction miki 
acou priate methods which would h 
Ested j a the covariance terms. Theory 3 wa 
l in this article by means of the regression 
oh of the overall measures on the set 2 
tee variables. The standard pania 
sion weights indicating the relationstip 
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TABLE 1 


STANDARD PARTIAL REGRESSION WEIGHTS RELATING 
INTRINSIC AND EXTRINSIC JOB VARIABLES TO 
OVERALL JOB SATISFACTION AND OVERALL 
Jos DISSATISFACTION: LIKERT SCALES 


Overall | Overall 
Variables satisfac- | dissatis- 
tion faction 
Intrinsic variables 
Responsibility .08 .08 
Work itself Eeg .19 
Growth and advancement =11 04 
Recognition —.03 —.13 
Achievement .26 13 
Extrinsic variables : 
Competent supervision 18 23 
Considerate supervision 07 07 
Company policies and practices 18 09 
Co-workers —.03 —.10 
Working conditions .06 4 
Unclassified variables 
Salary 16 02 
R? 43 30 


Note.—N = 160. 


between each of the 11 job variables measured 
by Likert scales on the one hand and overall 
job satisfaction and overall job dissatisfac- 
tion on the other are presented in Table 1. 
The 11 job variables are grouped so that the 
intrinsic variables appear first, followed by 
the extrinsic variables, and finally by the 
variable “salary” which is unclassified in this 
table. According to Theory 3 each intrinsic 
variable should contribute more to overall 
satisfaction than to overall dissatisfaction. 
Each extrinsic variable should contribute more 
to overall dissatisfaction than to overall 
satisfaction. Thus, according to Theory 3, 
the first five weights in Column 1 of Table 1 
should each be larger than the corresponding 
entries in Column 2 of Table 1. Only the 
weights assigned to recognition and achieve 


mentare in line with the prediction of Theory 3.4 


eee 

Jt seems unlikely that the lack of predictability 
could be due to an inadequate sampling of job factors 
to use as predictor variables. T ‘he five JDI scales were 
chosen so as to be consistent with the important factors 
of job satisfaction which have been found in previous 
studies. The 11 job variables measured by the Likert 
scales in this study appear to do a more than ade- 


quate job of sampling. 
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TABLE 2 


STANDARD PARTIAL REGRESSION WEIGHTS RELATING 
Intrinsic AND Extrinsic JOB VARIABLES TO 
OVERALL JOB SATISFACTION AND OVERALL 
Jos DISSATISFACTION: JOB 
DESCRIPTION INDEX 


Overall Overall 
Jariables : dissatis- 
Variables satisfaction Bebon 
intrinsic variables P 
Work 
Promotions —.02 .03 
Extrinsic variables T 
Supervision -15 .19 
Co-workers .05 —.07 
Unclassified variables 
Pay 415 42 
R? Ad 25 
Note.—: 


Also, according to Theory 3 the 
each of the extrinsic variables shou 
in Column 2 than they are in Colu 
two of the five, competent supervision and 
working conditions, meet this criterion, 

A parallel analysis of the relationship 
between the job variables measured by the 
JDI and overall satisfaction and 
faction is presented in Table 2 
predict that the first two w 
should be larger than the co 
in Column 2, One, work sat 
criteria, Each of the weights for the extrinsic 
variables should be larger in Column 2 than 


it is in Column 1. One of the two, supervision, 
meets this criterion, 


No tests of significa 
bet 


weights for 
ld be larger 
mn 1. Again, 


dissatis- 
- Theory 3 would 
eights in Column 1 
rresponding weights 
isfaction, meets this 


uacance of the differences 
ween the regression weights were computed, 
redictions of Theory 3 


» and only 50% of the 
t in Table 2, A theory 


of differe a 
data support the results i ead 
Hulin and Smith (1967) 
Waters (1969) whi 


À ch were 
correlational analyses, 


and Waters and 
based on simple 


Tests of Theory 2 


„Theory 2 is somewhat more dificult to test 
directly than Theory 3, Ideally, we would like 
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to obtain the regression of each epena oy 
variable on the 11 job variables (as was € ved 
in Tables 1 and 2) and convert the eR, 
partial regression weights associated with al 
set of independent variables (intrinsic i of 
extrinsic) into estimates of the ee 
variance of the dependent variable accoun i 
for by each set. Unfortunately, this can E 
be done in the case where the paorur d 
between the independent variables are cae 
Such is not the case with these data. ‘he 
alternative strategies were followed. PS a 
regressions of overall satisfaction and te 
satisfaction on the intrinsic variables mwi 
the extrinsic variables were computed. a 
squared multiple correlations which ett 
from these four analyses, using the ee 
scales as the independent variable, are ae 
in Table 3. The results of the same ont 
using the JDI scales as the Sapo 
variables are given in Table 4. It shou ise 
noted that the Likert scale measuring vast 
faction with salary and the JDI scale i“ 
ing pay satisfaction were considered analyses: 
extrinsic variables in all subsequent ana’) 
This is in keeping with Herzberg’s on 
classification, Separate analyses ann 
dropping both pay (salary) scales he 
he results were not changed apprecia “ial 
According to Theory 2 the intrinsic eet 
should account for a greater proportion 
variance of Overall satisfaction ae i 
extrinsic variables do. The results Br cot} 
Tables 3 and 4 Support this hypothesis. jable 
2 also predicts that the extrinsic Va" 
should account for a greater proportion o 
variance of overall dissatisfaction than í ral 
Intrinsic variables. The results given 10 
3 and 4 do not Support this prediction 


ina 
gi li 


TABLE 3 aul 
PROPORTIONS op VARIANCE (R?) OF OVON 
SATISFACTION AND Overatt DIssATISFAC 
ACCOUNTED FOR BY INTRINSIC ANP 
Extrinsic VARIABLES CONSIDERED 
SEPARATELY: Likert SCALES 


Variable Overall 


satisfaction 


Intrinsic 


34 
Extrinsic 


.29 


Note.—N = 160, 


| 
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a TABLE 4 
OPO: h 
n n VARIANCE (R?) OF OVERALL SATIS- 
aR WI VERALL DISSATISFACTION ACCOUNTED 
NTRINSIC AND EXTRINSIC VARIABLES 
CONSIDERED SEPARATELY: JOB 
DESCRIPTION INDEX SCALES 


Variable Overall Overall 
satisfaction | dissatisfaction 
Tikini 39 21 
SSROSIG 35 AS 


Note.—v = 160, 
. two 
are ie predictions made by Theory 2 

fils not a r Tables 3 and 4. 
analyses sun a eee clear, however, that the 
Appropriate are in Tables 3 and 4 are 
and extrinsic n these analyses, the intrinsic 
compete Sr en were not allowed to 
the tos | same pool of variance since 
intrinsic an S g overall job attitudes on the 
tained sey ed extrinsic job factors were ob- 
cinsi e oe It is clear that there is 
or by the fe erlap in the variance accounted 
example, th sets of independent variables. 
of all of the squared multiple correlation 
Satisfaction ~~ variables predicting overall 
multiple pa i 3, yet the sum of the squared 
ables predi Ey ations of the two sets of vazi 
alternative ano overall satisfaction is 63. An 
Proportion eyes would be to obtain the 
action and a variance of the overall satis- 
uniquely pia eat measures which are 
Our disposal) en the independent variables at 
Pendent i y controlled by each set of inde- 
Values, th ariables. In order to obtain these 
Predictor i multiple correlations between all 
Measure de combined and each overall 
tions p ee obtained. The multiple correla- 
tate oe each subset and predictor 
rately = Riese and extrinsic) taken sepa- 
su es obtained next. These values are 
Of the d to compute the amount of variance 
or by oe variable uniquely accounted 

Y the intrinsic or extrinsic variables. For 


Exar 

mple E 2c i 

Vari Ple, the difference between R? using all 
action and R? 


=e pro 


n w predict overall satisf : 
Ortion the : trinsic variables is the P! 
Uniquely ; eS of overall satisfaction 
ables Th accounted for by the intrinsic vart- 
- These proportions of variance are given 
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TABLE 5 


PROPORTIONS OF VARIANCE OF OVERALL SATISFACTION 
AND DISSATISFACTION UNIQUELY CONTROLLED 
py INTRINSIC AND EXTRINSIC VARIABLES: 
Likert SCALES 


Variable Overall ; Overall 
satisfaction | dissatisfaction 

Intrinsic At 08 

Extrinsic .09 07 


Note.—N = 160. 


in Table 5 (for the Likert scales) and Table 6 
(for the JDI scales). 

Theory 2 would predict that the proportion 
of variance of overall satisfaction uniquely 


accounted for by the intrinsic variables would 


be greater than the variance of overall satis- 
ariables. 


faction accounted for by the ext rinsic v: 
This prediction is supported both in Table 5 
and Table 6. Theory 2 would also predict that 
the proportion of variance of overall dis- 
satisfaction uniquely accounted for by ex- 
trinsic variables would be greater than that 
accounted for by intrinsic variables. This pre- 
diction is supported in neither Table 5 nor 
Table 6. Two of the four predictions made by 
Theory 2 are supported in Tables 5 and 6. 
Thus in Tables 3, 4, 5, and 6 only one-half of 
the predictions made by Theory 2 are sup- 
ported. A random model would do as well. 


Tests of Theory 1 

Theory 1 predicts that intrinsic variables 
a greater proportion of the vari- 
Il satisfaction than they do of 
overall dissatisfaction. Likewise, extrinsic vari- 
ables should account for a greater proportion 
of the variance of overall dissatisfaction than 


account for 
ance of overa 


TABLE 6 
VARIANCE OF OVERALL SATISFACTION 
UNIQUELY CONTROLLED 
EXTRINSIC VARIABLES: 
DEX SCALES 


PROPORTIONS OF 
AND DISSATISFACTION 
py INTRINSIC AND 

Jos DESCRIPTION Ix 


MEE Overall Overall 
Vanwee satisfaction dissatisfaction 
Intrinsic -09 10 
Extrinsic 05 04 


Note.—N = 160. 
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they do of overall satisfaction. Tables 3, 4, 5, 
and 6 present the data necessary to test 
Theory 1. Three of the eight predictions made 
by Theory 1 are supported. Four of the eight 
would be chance. It did not seem necessary to 
compute significance levels. 


Discussion 


The data reported in this article would seem 
to meet all of the criteria discussed by King 
(1970) for testing three of the two-factor 
theories. That is, overall satisfaction and dis- 
satisfaction were assessed 
a wide range of individual 
sidered; the 


ory. Fewer 
e three ver- 
ere supported, 
made more than 


i analyzed in this 
article are noteworthy, aside from the specific 
tests of the theories, All of the data in Tables 
3-6, with the exception of one reversal (i.e 

Row 1, Table 6), can be accounted for by 
assuming the existence of two main effects 
The first main effect is that intrinsic variables 
are generally and account for 
in workers? 


ulin and 
aters and Waters (1969), and 


‘he second main effect is that 


ese two main 
€ Proportion of t 
overall satisfaction being 
intrinsic characteristics, a small Proportion gi 
the variance of overall] dissatisfaction ot 
counted for by extrinsic variables with the 
other two cells being intermediate in a 


(see Tables 3-6), We need assume nothing 


i 
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about the continuity or discontinuity of overall 

responses to the job. Neither do we need assume ` 
anything about the qualitative differences 
between intrinsic and extrinsic variables. Two 
simple main effects, one due to the relative 
potency of the intrinsic and extrinsic variables 
and one due to psychometric characteristics 
are sufficient to account for these data. It 
appears that a two-factor theory may account 
for much of the data generated by Herzberg $ 
theory. It does not appear to be the two-factor 
theory which was originally proposed. . 

It should be noted that only three versions 
of the two-factor theory were investigated K. 
this study, However, King (1970) has opni 
cluded that Theory 4 and Theory 5, being 
Stronger versions of the first three versioni l 
have received no support in the literatu! a 
Thus, regardless of which particular vers 
of the two-factor theories one assumed it 
“the theory,” little support can be found fo! 
in the world of data. 

Finally, we should end on a note of cau tes 
The problem of multicollinearity complic® 1 
the interpretation of Tables 1 and 2. Gor 7 
(1968) has demonstrated the theoretical en 
logical absurdity of using multiple maser 
of the Same construct in a regression ana r 
Similar Problems of interpretation are ces 
when a number of linearly related van ie 
are included in the same regression analy" 
oe ace B weights associated with aro) 

Sq cement, recognition, and co-W itive 
hese variables have p° the 
the criteria, indicates | 1! 
’ Which exists among t° nes? 
further indicates that rall 
© redundant for predicting ° fore 
Ln al responses to the job. Ther’, all 

ables 1 and 2 should be approached wr 5 
the problem of unstab insi? 
sic variable and one €X thi 
cluded in the analyse „pis 
removed, Unfortunately a 
would not adequately + 


ition 


Problem would be 
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EXPECTANCIES AS DETERMINANTS OF PERFORMANCE 


ABRAHAM K. KORMAN? 


New York University 


This research reports five studies which all su 
expectancies of competence by others are pi 
of the studies involved laboratory research 
peers while the other three involved field res 
ations where the expectancies were those 
subordinate performance. Intelligence an 
moderators of the obtained relationship. 


There is a growing body of research literature 
which is supportive of the general hypothesis 
that performance is, in part, a function of the 
expectancies which others have of one’s com- 
petency and ability. That is, to the extent that 
one is in a social, interpersonal environment in 
which others believe him to be competent and 
able and communicate such expectancies and 
beliefs by words and action, then, to that ex- 
tent, performance is facilitated. These findings 
are, then, generally consistent with a general 
theoretical model which argues that perform- 
ance is a function of one’s self-perceived compe- 
tence for the task at hand, with such self- 
perceived competence being, in turn, a function 
of one’s previous experience with the task and 
the social definitions and reality which others 
provide for use concerning our competence for 
the task (Korman, 1970). 
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pport the general proposition that high 
ositively related to performance. Two 
where the expectancies emanated from 
earch in different types of working situ- 
of organizational leadership vis-à-vis 
d environmental variables were not 


; ride degree 
proposed hypothesis may have a wide deg P 
of generality. 


Laboratory Studies 


‘or body 
One possible problem with the major e 
of previous research has been that it a ority 
primarily concerned with “leader waar 
figure” expectancies of subordinate Ph has 
ance, that is, for the most part, the eee 
concerned itself with the effects of high-s he 
individuals’ expectancies as influences a ate 
Performance of low-status subordinates- ; an 
sider, for example, the studies of eee In 
Hall (1966) and Stedry and Kay os that 
both of these field studies it was fount func 
performance increased as a direct positive their 
tion of the level of expectation which bo 
superiors in the organization had. Thus, f theni 
cases, the more their superiors thought 0t nanc? 
that is, the higher their level of perform" the 


expectation as this was communicate forn 
subordinates, the higher the level of pe! im 
ance. While such 


\ eð 
research findings en ope? 

A f themselves, it leav uld be 
the question of whether such results pa he 
obtained when the individuals hav! fined 


: aoe , define’ 
expectancies did not have a formally "ice, 
Superior role t 


Portance in and o 


highly signi 
tern in many 


test the gener; 


ig C 
ality of the hypothesis < 
tuted the purpose i ig 
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Pose of the second exp 
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EXPECTANCIES AS DETERMINANTS OF PERFORMANCE 29 
The parz ; Y 
if “> AE involved in this case was to see TABLE 2 
ropos f ; 
differe ee hy pothesis held up for those EFFECTS OF PEER EXPECTANCIES ON PERFORMANCE 
that hiel LAEV els of intelligence, the logic being FOR DIFFERING LEVELS OF 
Neis te 1 intelligence individuals might be less SUBJECT INTELLIGENCE 
in o accept such influence than those lower ] 
s variable. High performance|Low performance 
si expectancy* expectancy? 

Method Task t 

A EI , x sp | X | SD 
Uline ee Ss in both experiments consisted of 
ductory ps y fulfilling course requirements in intro- Total group 
various bss The actual sample sizes for the 

eatments are indicated in Table 1 

Pro F AA s 1 7.85 5 ee 
Ss a I. The procedure consisted of asking 2 ea an 7 a 
studied p i to various creative tasks which were being 3 6.93 34 5 14 22 3st 
project Bd university psychologists as part of a general 4 4.97 21 426 17 2.05* 
tests See AE to develop new types of intelligence a < i a 
Each grou would not have the traditional weaknesses. a 
pice also told that the same tasks had been High intelligence 
these ped taken by other college students and that 
they aoe had been asked afterward to indicate what 1 7.37 24 6.03 2i. |) 267m 
ance for e He to be an appropriate level of perform- 2 1035 | 43 | 938 | 25 1.66* 
then list th ege students on these tasks. The Æ would 3 6.78 3.3 5:23 25) 2.11* 
with thes eee performance expectancies for each task, 4 5.26 2.3 4.53 1.8 1.42 
the pea differing for the high-expectancy group and , 
chosen Rial acne group. The actual levels used were Medium intelligence 
with allege ace of previous research with these tasks 
man students. The four tasks were (a) “how z a 
many i can you think of for a brick?” (b) “How 1 gzs | 28 | S7 14 | 4.16% 
“How a can you describe by the word tgweet’?” (c) 3 mos z on 2.5 3.22%" 
tourists any different ways can you think of to attract 4 473 19 387 H fee 
diferent i the United States?” and (d) “How many ah 3 ` $ 
implications can you think of if a Federal law Note.—All tests are one-tailed tests. 

fote So, 27, and 33, respectively. for total group, high 


ducation for 
for total group, high 


Were passed providi 
tonne E for a free college e 
Pr oR 
wean Exp. II. The procedure foll 
wise. i same as in Exp. I, except tha 
Il out the Wonderlic Personnel Test before 


respondi 
ing to the creative tasks. 


intelligence, and medium intelligence 
bns = 58, 34, and 24, respectively, 
medium intelligence. 


yi z: intelligence, and 
owed here was e 108: 


t Ss were also tp <.01. 


Results 
The results are indicated in Table 1 and 2 
TABLE 1 and provide, in general, overall strong support 
Ereecrs or PEER EXPECTANCIES for the hypothesis. All results in Exp- I are in 
ON PERFORMANCE the direction predicted with two of four being 
and all overall results in Exp. II are 
he direction predicted. However, 
n to decrease 


significant, 
significant in t 


High Low 
performance erformance . . . d 
Task P! the higher intelligence does seem ~. a 
a Rea aperat i the effects of the expectancy manipulation 
slightly, only three of the four being significant 
effect, how- 


4s SD. x SD in this case (one-tailed tests). The 
even for 


; 7.14 2.9 6.28 2.2 1.33 ever, does seem to be a powerful one, 
3 9.86 3.4 9.05 3.3 .97 this group. 
à 6.77 2.4 4.83 2.0 3.04** 
5.50 2.0 | 4.57 1.5 | 18 Field Studies 
Note—All tests are one-tailed tests- In the research to be reported here the 
bas ro general intention was to ascertain whether the 
hypothesis that “the higher the level of compe- 
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tence expected of employees, the a ii 
resulting work motivation and work për 2 2 
ance” would be supported in different type: 
actual work contexts. 


Method 


i f relatively 
ubjects. Study A consisted of a group o à y 
lle ote shies lasdilice eters 
f vas of interes A t 
E a is that all of the people he 
had civil service tenure, an employment condition tha 
has been argued by some to lead to less responsiveness 
to job-related stimuli and less concern with job condi- 
tions. If this were so, one might predict that there 
should be a lessening of a relationship between leader- 
ship and other environmental variables and job motiva- 
tion-performance. j : 
Study B consisted of high-level technical employees 
in a data-processing center at a large university. All of 
the individuals involved here had at least one degree 
and a considerable number had completed either mas- 
ters or doctoral study. Of considerable interest here was 
to see whether individuals of such 


relatively high abili- 
ties were susceptible to social influence effects of this 
nature in the “real world,” as opposed to the laboratory 
findings of Study 2. 


p of clerical employees in 
a branch of a large bank. This office was composed 


of high school graduates, and it was 
the opinion of the branch head that most had little pro- 


a very real sense this g 


, items were 
general level of the 


atch you all the time 


are incapable of 
organization,” 
motivation included such a; 
would be more motivated to 
job is very small”; () « 
myself to go to work”; 
motivation, compared 
would place myself in t 

Identical forms of thi 
Studies A and C and a sl 
B. 


u are in the 

self-reported work 
the likelihood that I 
f I were on a different 
€ time I have to force 
n terms of actual work 


to the People around here, I 
heile» i 


is questionn; 
lightly diffe: 


s (a) “ 
work i 


aire were utilized in 
rent version in Study 
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gi ed in each 

2. The questionnaires were a aa Kaeh 
case by people who were part of i pa 4 ans 
project itself was identified asa university D ie 

3. Performance evaluations were inane Aini 
vant superiors for Studies A and B but D eae say 
able in Study C. Hence, the criterion for ae oe 
was self-reported work motivation as pens onan 
questionnaire items. Some evidence for ot correlated 
validity of this latter measure is that it SRI A 
-93 with performance evaluations for th 
sample. sual 
“tL. The analysis of the data in all cases gene ne 
dividing the criterion variable into those e ein 
“excellent” and “average” and then — ihe items 
differences for each of these groups on each feshin si 
measuring the degree to which the ls high expec- 
managerial system was perceived as having ets were 
tations of competency and performance. ae the cri- 
then computed in each study to test whet ihe Sere 
terion groups had consistently different perce} 
leadership and managerial expectations. 


Results 


ror 
The results were as predicted in all cases. Po 
Study A, the “high” group (» = 9) it 0 
higher than the low group (n = 22) oe study 
the 14 items with one tie (p < .05); for 1 
B the comparable figures were 9 at 
< .05) (n of “highs” = 6 and n OE as was 
= 22); while the “high” group (7 = all 14 
higher than the “low” group (z = 16) on 
items in Study E 


Discussion 


j hat 
The general findings of these studies per” 
expectations from others are related ne ex’ 
formance, regardless of the source of vive! 
pectancy or the characteristics of the resting 
Pectancy, has a number of inte! 
al and practical implications. , ation® 
he implicati is ality 
seem clear in that th at 


theoretic 


d 
Je 
j 1 eop 
authority figures are available, or just w ve 
of low intelligence and/or previous ae cam 
ment. Apparently, we use whatever ‘= , a 
authority figure Or not, intelligent oF a 


eee as well as through the medium of 
setor p minene directly. This opens the 
motivatio greater variety of ways by which the 
a to work may be increased by 
zauional leadership. 
ee question which arises here is that while 
ment _— mechanism for work improve- 
siete, not be disputed, our theoretical 
e aeled a might be. For example, it might 
Cies ak we pai the influence of peer expectan- 
uence aod have studied them here has an in- 
, the as, en self-conceptions but, rather, on 
hat is he it goals which one is striving for. 
the level yee the expectancies, the higher 
higher fin ee goal-setting and the 
ing, pitti eee This type of theoriz- 
Locke “seer a exemplified in the work of 
Model of y ), suggests a much more rational 
advanced TE than the interpretation we have 
Owever ere. Our preference is for the latter, 
Count be ih the ground that it is able to ac- 
rational a me the positive findings of the 
| Which ha pproach and for those research results 
o leoi negative implications for this type 
is by l T he way in which we can do this 
ernally oe between externally and in- 
Tate et goals. 
ee the latter case first, we would argue 
Boals wou ii nd to achieve high self-set 
Since oe be a linear function of self-esteem 
With high achievement would be consistent 
People of self-esteem. On the other hand, 
Soals shou i, self-esteem who have high self-set 
These pi P maa be motivated to achieve them. 
Vious am ictions have been supported in pre- 
Zander ar ni by the writer (Korman, 1970), 
tyan se reel (1967), and ina study by 
as fou ocke (1967). In the latter study it 
und that those who had a high opinion 


ace ‘ 
ed on them worked harder to achieve the 


Se 
a low 


a 
intent goals than those who had 
Set Placed on them but had the same Set 
P 
ternal other hand, when the goals 
the re Y set, both the consistency model and 
an’ tional model would predict that perform- 


ce pi ia 
is a function of the level of expectation, à 
the find- 


and Kay 
» and the research reported in this article. 


ndings con- 
In the 


are ex- 


Predict 
Predic fon: 
ction which is well supported by 


ings 
(1966) Berlew and Hall (1966), Stedry 
ù C a Er implication of these fi 
| Contemporary leadership theory- 
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continuingycontroversies which have marked 
the growth of modern or behavioral science 
organization theory as opposed to the more 
traditional frameworks, there has been a recent 
trend among some students of the field sug- 
gesting that they may both be right. That is, 
perhaps one mode of leadership is correct for 
one type of individual or work group and the 
leader should adjust his behavior to be appro- 
to the characteristics of that group and 
tuation and with a differ- 
ent group, perhaps another form of leadership 
might be most appropriate and most beneficial. 

While this point of view has a great degree of 
appeal, there are some serious problems which 
develop from its adoption. One of these is that 
such a theoretical stance seems to project a 
picture of a leader as a “reactor” to the group 
and a person who fits himself to the group, 
rather than as an “innovator” and as an instru- 
ment for “change” and “improvement.” 
Second, while a leader may adapt himself to a 
group and fit his style to their characteristics, 
given the group as it is, such adaptation ignores 
the simple observation that perhaps the best 
way to increase the group’s effectiveness would 
not be to adapt the leader’s behavior to the 
group, but to change the group. Finally, a third 
problem with such attempts at theoretical 
reconciliation is that they may not be necessary 
in that a specific mode of leadership behavior 
may have more wide-ranging applicability than 
previously thought. The findings of these 
studies suggest considerable support of this 
third position (without negating in the slight- 
est the fact that the other two factors may also 
have considerable importance). 

There are, of course, some limitations to the 
research reported here, with the two most im- 
portant being (a) the sample size reported in 
the field studies; and (b) the fact that these 
latter studies were correlational, thus suggest- 
ing some ambiguity in interpretation. Our re- 
sponse to these limitations would be as follows. 
First, while there is little doubt that high ex- 

ectancies might result from high performance 
as well as the other way around, as we ourselves 
(cf. Korman, 1966), this 


e argued elsewhere 
annot account for the findings 


. On the other hand, 
have advanced will 
all the studies. 


priate 
that, in a different si 


hav c 
interpretation C: 
of the laboratory studies. 
the interpretation we 

account for the findings of 
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Second, while there is little doubt that the sam- 
ple sizes in the last three studies are quite small 
when viewed separately, the studies are highly 
consistent in their findings. Similarly, if we had 
standardized each distribution and then pooled 
them, then the samples would have more 
closely approached respectability, although 
still on the small side. However, we choose to 
present them this way in order to be able to 
show the consistency of findings across different 
types of work situations. Hence, while further 
replication of these findings with larger samples 
is greatly needed, the consistency of the ob- 
tained findings and their integration with the 
first two studies should also be kept in mind. 

One final note concerning needed further re- 
search concerns how far we can reasonably 
push the expectancy effect. It is clear that an 
asymptote will be reached, but where? How 
high? Can we formulate some general law? For 
example, might it be some ratio of the level of 
performance a person will engage in when he is 
under little or no external expectancy? Or will 
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the general law, if there is one, depend on the 
original level of performance? 
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practical strategy to aid in identifying 


satisfactory vocational adjustment. This article 
menting subjective clinical information concern: 


more objective data for formulating specific pl 
| items differentiated placed successes, placed failures, 


ninitial and replication analysis. Validation and replica- 
he final items comprise an effective instrument for select- 
o will experience failure in employment. 


revealed that 15 behaviora 
and nonplaced patients on a 
tion results indicated that tl 
ing the mentally restored individual whi 


hospi ational counselor in the mental 
balance Ayers must strike an optimum 
who will cl ieee expenditure between patients 
he unen med benefit from his efforts and 
enefit res oyable client who might better 
Continued r some other hospital procedure. 
toa sare reliance upon clinical judgment as 
Te t employability is not only time- 
criving i a dilutes the potentiality for 
Services, ull advantage from placement 
these ens of this study was to examine 
Core ines Sea of patients classified as hard- 
o ological S oyables and to develop a meth- 
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determine i investigation was undertaken to 
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ata for eee with more objective 
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ntally restored patient is an area in need of a 
those who would have difficulty in making a 


examines the tenability of supple- 
ing a patient’s employability with 
acement procedures. The analysis 


1964). However, there are no investigations 
representative of the opposite approach ; that 
is, developing criteria by which to identify 
patients most unlikely to make a successful 
vocational adjustment. 

The setting for this investigation was the 
Fort Logan Mental Health Center (Mental 
Health and Manpower Project, 1968) in 
Denver, Colorado. As an integral component 
of the Center’s vocational services, a place- 
ment department operates to develop jobs in 
the community and to place restored psychi- 
atric patients in appropriate positions. In- 
formation collected by this department was 
the primary source of data relevant to the 


unemployable patient. 


METHOD AND RESULTS 


Phase 1 

The placement department began by review- 
ing its records of patient-clients who had been 
both successfully and unsuccessfully placed 
and employed. A pool of items was then com- 


piled listing behaviors incompatible with 
successful employment. The items consisted 
of both preemployment (i.e., interview situa- 


tions) and on-the-job behaviors. An 
items to minimize 


After screening the 

ambiguity and item overlap, a total of 40 
behavioral variables were selected for analysis. 
These items were then classified within the 
following categories: (a) motivational: for 
example, “late for placement interview”; 
(b) employment experience and stability : for 
example, “no experience” ; (c) vocational goal: 
for example, «unrealistic in terms of training 
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TABLE 1 
ANALYSIS OF ITEMS INcoMPATIBLE 
WITH VOCATIONAL SUCCESS 
Item x 
Motivational 
Misses appointments for placement or papis 
counseling interviews n 
Usually late for placement or counseling n 
interviews 4.46 
Resistance or refusal to accept job rege 
referrals x 
Procrastination on following-up job leads 12.70**++* 
Vocational 
No work experience 3.83* 
Vocational goal too limited 3.65* 
Vocational goal unrealistic 3.56* 
Incomplete or no vocational or technical 
training, and no education beyond high 
school 8.99+44* 
No transportation, or dependent on 
public transportation which is unrelj- 
able or difficult to use 3.52* 
Personal 
Inappropriate behavior during place- 
ment or counseling interview 4.61** 
Extreme under- or overweight 4.35* 
Appearance of having a lack of physical 
vitality 9.15044 
Limited because of physical handicap 3.56* 
Difficulty in making self understood 
(poor communication of thought) 4.96** 
No evidence of being able to handle 
leadership responsibilities 7.734 
Indicates Preference to bei 
ing a follower 
rather than a leader 8.668% 
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and experience”; (d) perso 
example, “extremely unk, 
ality: for example, 
alertness, self expr 


nal appearance: for 
j 

ly empt” ; (e) person- 

antagonistic “a (6) mental 


ession: for example, “apa- 


“E00 
thetic”; (g) miscellaneous: for example, “to 
old.” 


Phase 2 


After selection of itums, a group a 4 
patients was drawn, all of whom had ar He 
time during hospitalization received pee, | 
from the placement department. This gr E 
was then divided into three tego n 
vocational success defined by the solowy 
criteria: (a) placed failures (PF): ee } 
who had terminated within 3 mo. after (ps)! | 
ment (N = 25), (b) placed successes (Pa 
patients still employed 3 mo. after piacan 
(N = 32), (c) nonplaced patients Cra 
patients who had utilized placement ogre 
but were not placed (N = 30). The d on 
groups were then statistically compare : 
the 40 behavioral items selected in cp the 
The initial analysis indicated that 16 sh o 
variables either tended to differentio®: gg 
differed significantly among the three Er iE 
These items were classified into the categ 


} 
ee s orson 
of motivation al, vocational, and pe t 


characteristics. Table 1 summarizes 
findings. 


Phase 3 


ing 
: ego! 

The 16 items retained from the i g 
analysis were then cast into the forn yas 


z ale ¥ 
checklist. Tt was Presumed that the sc? g 
an effective instr 


the three vocat 
effort to increase t 


nS 
ting an 
ument in differentiating ® 3 


- jn 4 
ional groups. However, sity, 
he reliability and sensit; 
of the present instrument, a rep T pat 
analysis was Performed utilizing the ite™ 
ferences in Phase 2. cor 
Anew sample of patients was selecte p. jca 
sisting of the following groups for the a p5 
tion analysis: (a) PF (N = 41), O of 


(N = 49), (c) NP (N = 39). This grou? a 
Patient-clients w 


on the 16 item 
validation group: 
analysis demons 


showed di p7 


as then statistically AR thf 

s found to different! atio? 
s in Phase 2. The repli a 
trated that 15 of the pe 
again produced Meaningful differen (or? 
final listing of items comprising the pa mi” | 
Unemployable Scale (HUS) and v4 tion: 
data are Presented in the following S°° 
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Phase 4 


Poa validation and replication (not a true 
data) of pe due to unavailability of S 
of the shee se took the form of rating each 
stituting pna groups on the 15 items con- 
rate each m —_— Counselors were asked to 
group, 7 Pe oe in the two groups (validation 
the Sones ol, replication group, # = 129) on 
tational a Pei og with successful vo- 
the ee c Justment The mean differences for 
mean i ate summarized in Table 2. The 
is siete Table 2 indicate that the scale 
Patients as he selecting the three groups of 
of ce istinct groups according to degree 
al success. 


DISCUSSION 


Bent = 15 items were classified as motiva- 
The bið cational, or of a personal nature. 
definitely T e memgloyiis patient-client is 
ment proc sa ing in motivation for the place- 
Counselin er This person is prone to missing 
job mee interviews, reluctant 1n accepting 
on Rear and procrastinates 1n following-up 
The un a employment leads. 
istory a i person. has a poor work 
too fers pically, the vocational goal is much 
or ivune e E in terms of his abilities, 
negative ertain. This individual also creates & 
Situation impression during the interview 
‘Nation. 
ey Ra consequential implication of 
Present a such behaviors as @ part of the 
tial for eth the illust ration of the poten- 
ment m mploying objective data. to supple- 
concernj ore clinically oriented information 
Peridence= a patient’s, employability. De- 
result ir e on clinical judgment alone = 
energy n notable wastes of counselor time m : 
Winer l when dealing with the chronica y 
Dtoack oyable patient-client. The practical ap- 
aid in offered by this research can serve = 
accordi better distributing counselor tM 
Emne ge client needs. f 
Specific ination of the item content rev 
Unenplorable or trouble areas for 
May be yable client. For example, 
Partly i ated as potentially unemploy aye 
y to missing employment interviews OF 
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TABLE 2 


Svanrary OF MEAN DIFFERENCES PRODUCED ON 
HARD-CORE UNEMPLOYABILITY SCALE RATINGS 


Group Bs F 

Validation 

NP (n = 30) 12.77 2232% 

PF (n = 25) 10.08 

PS (n = 32) 4.19 
Replication 

NP (n = 41) 12.34 19.62* 

PF (n = 39) 9.95 

PS (n = 49) 4.12 


Note.—Abbreviations: NP = nonplaced patients, PF = 


placed failures, and PS = placed successes. 
* p =.001. 


procrastination in following-up on job leads. 
This immediately informs the counselor of 
specific behaviors detrimental to this person’s 
successful employment. Intensified counseling 
and retraining by the vocational counselor 
could increase the probability of these indi- 
viduals obtaining successful employment. 
Subsequent validation with larger groups 
should produce clearer and more accurate 
normative data. Also, inspection of the content 
of items indicates that they are not necessarily 
behaviors that are characteristic of mental 
illness. Many unemployable people are char- 
acterized by qualities such as “no work 
experience” or “lack of education.” There- 
fore, the items appear to envelop and are 
applicable to the whole range of unemployable 
people, including the mentally disturbed client. 
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SOURCES OF REFERRAL AND EMPLOYEE TURNO\ 


MARTIN J. GANNON? 


University of 


In this study seven methods or sources t 
pared in terms of their influence on 


One of the presistent theoretical and practi- 


psychologists is 


r may be the 
re drawn. In 
research has 
eferral or re- 
g them to em- 
Mandell (1956) 
500 or fewer 


S generally h 


ave recourse to 
ertisements 


newspaper ady (Mandell, 1956 F 


Malm, 1954), 
The objective of 
hypothesis that r 


METHOD 

The personnel departm, ew York bank has 
‘mployee referral for 
in this article have 
department categorizes 
f l as follows: (a) The em- 
tisement for work ina newspaper. 


B. Miner and 
nd Suggestions, 
Clark. on by the out- 
uld be sent 
, College oj 
versity of 


to the author, 
usiness and 
and, College 


Public Admi 


Maryl 
Park, Maryl ary] 
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Maryland 


(b) The employee w: 


cy 
‘or hiring age? 
as hired by the major hiring 

under contract wit 


ree Wa 

h the bank. (c) The Pine unde 
hired by hiring agencies other than the a keka referred 
contract with the bank. (d) The employ a employee bog 
to the bank by a present employee. (e) ne a from m 
been reemployed after a period of ke high school; 
job. (f) The employee was referred by enrouâh som! 
and (g) The employee came to the bank the perso” 
other method, mainly by walking into ef referta 
office and asking for work. For each sourci ihe number 
the bank then develops a quit rate, that P within 4 
of employees hired through a particular Soni aes ee, 
Specific year divided into the number of en A that bie 
that source who were hired but quit wi rder tO di 
Thus, the quit rates can be compared np 
cover the best and worst sources of referra jn the 

In order to ascertain if the differences onstruct z 
rates were significant, 2X 2 tables were € al and tY 
the rows of which were two sources of i R n n 
columns of which were the number of ore was a 
quitting and the number who quit. ahs (see Tab 
used to test the significance of the differences 
2). 


quit 


et, 


Resutrs 
Four years 
able 1, the 
Sources of re 
6,390 


jo 
ót. 

are investigated: pei sever 
quit rates for each of t total ° 
ferral are presented. EM period 
es were hired during thi the! 


rang? 
S wer? 


reemployed to 
who came to th 
agency. Table 
showing the a eren 
referral, differ” 


significant 1iscoV’ 


Ources of referral were d ni 
Ources of referral were ae 
antly different from a 3 
Sources of referral (p < 001), The m 
are the reemployment of former worke 
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y 


oom ra as of individuals referred by their 
br TA, s, the hiring of individuals referred 
e aps, and others (primarily 
los ag ese four sources of referral have 
a, ce me ranging from 21.3% to 28.8% 
Gablayee e pa as suppliers of stable 
fleri nA “Hee three sources of 
37.8%, ‘aa have quit rates ranging from 
eee 0% can be viewed as suppliers of 
the em Reign To hese three sources are 
the hone menk of individuals who come to 
thot through the major hiring agency, the 
BE nes PA se of individuals who come because 
. OF jetta advertising, and the employment 
agenci uals who are contacted through hiring 
T cies other than the major one. 

Me are no significant within-group differ- 
le the three poor sources of referral. 
there a 13) within the group of good. sources 
B three significant differences. “Others, 
ane is a poorer method for re- 
< 03)" employees than reemployment (p 
ati or recommendations by high school 
inistrators (p < .01), although it does not 


signi : ; 
nificantly differ from recommendations by 


MATRIX or CHI-SQUARE VALUES SHOWING TH 


E ASSOCIATION BETWEEN TH 


TABLE 1 


PERCENTAGE Quit RATE BY SOURCE OF 
REFERRAL: 1961-1964 


Source of referral | No. hired | No. quit | % quit 
Reemployed 253 54 21.3 
Referred by high 

school 602 131 21.8 
Referred by present 

employees 2320 615 26.5 
Others (mainly 

walk-ins) 1212 349 28.8 
Hiring agencies except 

“major” 637 241 37.8 
Newspaper advertising 512 202 39.4 
Major hiring agency 854 342 40.0 

Total 6390 1934 


The final methodological point of importance 
concerns the stability of quit rates for each 
method over time. In the analysis thus far the 
total figures for the period 1961-1964 have been 
presented. It is entirely possible that a par- 
ticular recruiting method is characterized by a 


low quit rate one year put a high quit rate the 
ies would tend to 


fe p 5 

ty employees. Reference by present em- next. Such inconsistenc 

2 Oyees is a poorer source than recommenda- undermine the analysis. However, as shown in 

lons by high school administrators (p < .05). Table 3, there are no inconsistencies when the 
TABLE 2 


E SOURCES OF REFERRAL 


Higher quit rates? 


à 21.3%-28.8%. 
¥ 37.8 %-40.0%6. 
001 


* 
pes 

a2 S 

? DZ. 
| wee S | 


Lower quit rates" 
Hiring 
t Source of referral Referred Referred Others be Newspaper |Major hiring 
Reemployed| by high | by present (mainly except | advertising | agency 
school employees walk-ins) “major” 
Reemployed 04 3,40 6.141**  22.83* 25.57* 30.46* 
Referred by high à m n g 
school 5.677** 10.23** 38.06 41.33 53.83 
efe 
ei presen 2.10 31.16" 34.24" 54.32" 
Other. : 
et 1570" 18.80" 28.49" 
iring dendie aa 
N except major” 31 75 
€wspaper i 
advertising s 
ajor hiring agency 
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TABLE 3 


CONVERSION OF THE Qurr RATES ror EACH RECRUITING 
METHOD TO RANKS FOR EACH YEAR 


Ranked data 
Recruiting method 


Ne ee 
1961 | 1962 | 1963 | 1964 


| 


Newspapers 

Major hiring agency 

Other agencies 

Referred by fellow employee 
Reemployed 

Referred by high school 
Others (mainly walk-ins) 


BNR WUD 
OREN US 
Pe WNM ATU 


meme | fe 


* No employees were hired through these Sources in 1964, 
quit rates are analyzed 
the quit rates for each 
verted to ranks. In eac 
methods of recruitment 
quit rates than the fou 
cruitment. While there 
within 


by year. In this table 
year have been con- 
h year the three poor 
uniformly have higher 
r good methods of re- 


ixth in 1962 
there are no exceptions when the two groups of 
good and poor recruiting methods are 
compared, 

Discusston 


re predicti 


employees, namely, the use of 


other than the maj 
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: Das . ee, 
estimate of such costs is $1,000 per empio 
The significant difference in the quit rates 1] 
the seven sources is 9.0%, since the two group 


of sources identified as good and poor in ee, 
ence to employee turnover are divided in S 
their respective components by “othien 
(mainly walk-ins)” (28.8%) and “hiring agen 


I ; In 
cies other than the major one” (37.8%). 


é ; 
the 4 yr. under examination, approximately 
2,000 employees left the bank without ve, 
pleting their first year of employment. Gna 
a quit rate of nine per 100 that could have ie, 
avoided if the three poor sources had not be 
used, the total cost to the bank is $180, 
(9 X 20 « $1,000). m- 

This analysis assumes that enough fowr 
ployees could be gathered through the ar- 
good sources of referral. In a tight labor istid 
ket, such an assumption may be unreal 
However, the analysis does suggest iets in 
if a firm were to spend $40,000 or $50,0 


F r : ividud pire 
increasing the number of individuals me 
through the four good sources rather 


through the 


x tme? 
three poor sources, the inves 
would proba 


bly be a good one. ices of 
Although further research into the sou ii 
refer: 


ral is necessary so that the demograp 
and attitudina] characteristics of these wate? 
can be assessed, the present analysis 19 ‘ut 
that a substantial Proportion of employer jous 
Over can be eliminated through the jue sating 
use of vari È : 
employees, 


= tr 
lous procedures for at 
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PERSONNEL SELECTION AND THE PREDICTION OF ERROR! 


I. van W. RAUBENHEIMER? 
University of South Africa at Pretoria 


A few trends in current personnel selection 


backdrop against which the rationale for a pre 


ability and ov 
dictability is 
le into unde 


the concept of underpredict: 
noncognitive predictor of pre 
group a cross-validation samp 


dictables (n = 51), and overpredictables (» 
between the unweighted sum of six cognitive 


Clerical Adaptability Test) and a criterion 


sulted in rxy = .26, rxy = 48, and rzy 
This compared favorably with an rzy = - 
involving the above six cognitive predictor: 
(used to develop the predictor of predictabi 


ee the last few years some interesting arti- 
such Pene on various off-quadrant topics 
1956 agri predictability (Ghiselli, 
1967) a, 1960b, 1963; Ghiselli & Sanders, 
the rib rs poopie variables (Saunders, 1956), 
iter oe phenomenon (Fisher, 1959), 
oderat variables (Dunnette, 1963), and 
Tn ne a (Guion, 1967). 
about nl sc to reflecting a growing uneasiness 
Predict; = has been called the “traditional 
a fresh A pon ’ all of these articles call for 
istic a and probably more complex but real- 
impli ah oach toward prediction problems. By 
bility aed they also repudiate the applica- 
adhe well-established psychometric theories 
Ales to in this field. 
eh some authors (e.g, Storms, 1960; 
ideas fi 1969) take issue with some of the 
the deat Sard in the aforementioned articles, 
S oe from traditional or conventional 
Keerai strategies seems to have some 
The if not rational appeal. 
Posed anes and prediction model to be pro- 
Marily a the present article were developed pri- 
conce, against some current research on the 
Speca of predictive error and its relation, to 
Situatic individuals in specific prediction 
ations, 
Pe hice argued that if error 
in ee and therefore if it appears n S 
e (or group) after anol her, 
OUutgoin, article was accepted for publication by the 
2 g Editor, Dr. Kenneth E. C lark. 
Bahena for reprints should be sent 
e Ales of emg 
uth Africa, outh Africa, P.O. Box > 
a 


of prediction were 
o recur 
it 


to I. van W. 
Psychology, 
2, Pretoria, 
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AND 


JOSEPH TIFFIN 


Purdue University 


research are summarized to constitute a 

diction model, taking advantage of 
verpredictability, is being presented. A 
developed ( = 224) and used to sub- 
rpredictables (» = 25), accurately pre- 
= 25). Product-moment correlations 
predictors (subtests on the Purdue 


measure of overall job performance re- 


= AO for the three subgroups, respectively. 
25 before subgrouping and an r $y= 34, 


s and two noncognitive predictor scales 


ility). 
should be possible to develop a predictor of 
such error. 

According to Ghiselli, error of prediction is 
defined in terms of the difference (D) between 
edicted and observed criterion scores for 
a given group of Ss. Although Ghiselli did not 
attach any signs to this difference, it is sug- 
gested here that the algebraic difference 
between the observed and predicted criterion 
scores would make for a finer analysis, since it 
will not only indicate the magnitude of pre- 
dictive error, but will also tell what the direc- 
tion (overpredicted or underpredicted) of the 
predictive error for each S would be. Further- 
more, the difference scores would seem to be 
predictable by something like a noncognitive 
predictor scale (P) consisting of biographical 
items validated against the difference scores as 


the criterion. 
Jf the afor 


the pr 


ementioned predictor (P) could 
be developed, it could be used to estimate the 
extent to which each individual in a specific 
situation tends to be overpredicted or under- 
predicted by a predictor (X) or battery of pre- 
dictors already in use. 

In addition, it is suggested that any group 
of individuals in a given situation actually 
consists of three relatively independent sub- 
groups— (a) those who are relatively accurately 
predictable by a given battery of predictors, 
(b) those that are relatively overpredictable, 
and (c) those who tend to be relatively 
underpredictable. 

The predictor of error (P) as discussed 
above, can be used to define these subgroups. 
This can easily be done since the predictor of 
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error estimates for each individual the magni- 
tude as well as the direction of predictive error 
involved. Thus, it could be inferred that all 
individuals who score above the mean on the 
predictor of error (P) tend to be overpredicted 
and all those that score below the mean tend to 
be underpredicted. This is the specific advan- 
tage of obtaining the algebraic and not the 
absolute difference between the predicted and 
observed criterion scores. 

The subgrouping itself would be rather arbi- 
trary. One can, for example, regard the over- 
predictables as the 25% of the group that score 
highest on P, and the 25% that score lowest on 
P can be considered underpredictable. This 
will leave the middle 50% of the group defined 
as predictable. It can readily be seen that these 
cut-off points could be altered depending on 
the number of employees to be selected, the 
number of applicants available, etc, 

Once the subgroups are 
battery of cognitive predic 
to each group separately, 
the correlation between th 
criterion will be higher 


Separately than for the 
together. 


ation was presented 
regarded as a Major 
anizations 


educational s 
tion at all levels, 


V J e be expecte, 

extent of difficulty Imposed on S óy wig ee 
ame for all, 
Organizational stru 
formed, as furnished þh 
the daily 
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P in 
of smaller groups with one person (the me ete 
each group responsible for the performance of ime 
bers of his group. The supervisors in all the ifi a 
branches were requested to act as judges of os a 
ployees’ overall job success by using a paired-comp’ 
son rating system (Tiffin & McCormick, 1965). ai 

All criterion ratings were gathered before any va 
dictor information was obtained to reduce the ante 
bility that performance on the predictors could in 
the criterial ratings made by the supervisors. 


re- 


Predictors 


; , ility 

Cognitive predictors. The Purdue Clerical Adapted cri- 
Test was administered to all employees for W 50 eri 
terion data were obtained. The test, covering S ion 
range of clerical activities such as spelling, coe 
checking, word meaning, copying, and reasoning the | 
considered to provide meaningful information 
purposes of this study. ing subtest 

Although in the computation and reasoning s! is col 
reference to the dollar is frequently made, this “4 sin 
sidered to have created no substantial problem on 
South Africa's currency (the rand) also operates 
decimal system, 

Noncognitive (biographical) predictors. £ 
100 biographical items was drawn from AG ci 
Life History Items (Glennon, Albright, & Owe! yan ing 
with the number of response options per P were 
from two to eight. The content areas cover mo! oy 
habits and attitudes, health, human relations ies, 
family life (past and present), recreation, za 
interests, school, education, self-impressions, class 
opinions, preferences, work history, and som 
cation data, 

Response cate; 
stituted 
others, hi 
lying respo 


was 


g P Jearly 
gories of some of the items © ev 


value of o 
if not. T 
binary S 


i a d 
ne if the response option was peba 
he procedure generated a total num 


. . -ealth re 
hich provided a wea!’ al P 
front. whieh biographie ci ssd 
developed, as will be 


- sjol! 
jel! 
Accuracy pred 


Y of Some Conventional 
Techniques 


t ression W. won 
ermine 7 ing regres? a 
on an experi d by developing reg ght? yn 


The wes a 
then applied ton! aoe Coan = w ý 
observed and pred M 
computed. cage the acct 
tna eo termes 
dictors in this study ihe soint-biserial Co we 
between 494 binary ` items ia the crite nsis” 
generated and two Scales were derived. O” 
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po poe which correlated positively with the cri- 
ea and the other of 20 negatively correlated 
leval “Seg correlations were significant beyond the .01 
lates Tee the two noncognitive ales were ob- 
dai. Ei 33 in an experimental group. After regres- 
TET E were developed for the two scales on the 
on, kn al group, they were applied to a cross-valida- 
betwen n ensi of 110 Ss, and the correlation 
calla be e observed and predicted criterion scores 
i de determined, 
aoe ma to get an indication of the predictive accu- 
tors comi iS cognitive and two noncognitive predic- 
the po pi a regression analysis was performed for 
= 224) val pepe on the experimental group (n 
evens oy the weights so established were applied to 
ss-validation group (2 = 110). 


Im prow icli 
provement of Prediction 


Fi Noche to define the magnitude and direction of 
braic diffe er of the cognitive predictors, the alge- 
on the Sets between the standardized total scores 
tends Clerical Adaptability T est and the 
eXperim se criterion scores of all individuals in an 
he eae group (n = 224) were obtained. These 
which all p scores constituted the criterion with 
then ear la 94 binary-scored biographical items were 
biserial re ated. Only those items Go 41) with point- 
05 eee coefficients significant beyond the 
error sez were selected and combined into a predictor of 
ale, 


A tota P ‘ : 
total score for each individual in the experimental 


r 
Piha obtained by assigning a + 1 if the individual 
lated witht item which happened to be positively corre- 
a negati ag difference score (D) and a — 1 wh never 
ing nee correlated item was endorsed. (TI 
ranging frag yielded 224 predictor of error scale scores 
a i hale + 10 to — 10—a positive score indicating 
Just the cy to be overpredictable and a negative score 
pa reverse.) 
he apredictor of error (P) was then correlated with 
tion of ee of error (D) in order to obtain some indica- 
real test pee It was realized, however, that the 
and it we of P would come only under cross-validation, 
validation therefore decided to apply the scale to a cross- 
that th on group consisting of 101 Ss. (It will be noticed 
i E Sann of the cross-validation groups used in the 
instan nt situations were not always the same. In some 
Puter ces cases had to be deleted since one of the com- 
Missi programs used read blanks on the input cards as 
Sing data instead of zeros as it was intended to be.) 
Re to establish the three predictability sub- 
Coss bee defined before, all individuals (x = 25) ina 
-validation group of 101 who scored above +2on 
digit of error scale were considered ala 
aoe es, all those (» = 25) who scored below — © en 
ane z regarded as underpredictables, and the resi 
TI 51) constituted the accurately predictable p 
X) a unweighted sum of the six cognitive pre ie n 
correls as used as the predictor in this analysis, ei 
frou ation between X and Y was obtained for mer aa 
th P Separately and combined to determine whe! : 
© ‘subgrouping had any effect on the accuracy ©! 
Prediction, 
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TABLE 1 


MULTIPLE CORRELATION COEFFICIENTS AND THEIR 
CORRESPONDING CROSS-VALIDATION CORRELATION 
COEFFICIENTS FOR SIX COGNITIVE AND Two 
NONCOGNITIVE PREDICTORS SEPARATELY 
AND COMBINED 


Experimental Cross- 
Predictor group validation 
(R)* group (rys)? 
6 Cognitive (A) 30 27 
2 Noncognitive (B) 45 33 
A+B 50 34 
an = 224 in each predictor. 
bn = 110 in each predictor. 


RESULTS AND DISCUSSION 
Accuracy of the Conventional Prediction Tech- 


niques 
The multiple Rs obtained for the six cogni- 
tive predictors and the two noncognitive pre- 
dictor scales separately and combined appear 
in Table 1 along with their magnitudes under 
cross-validation. All Rs were expected to shrink 
under cross-validation to some extent. There 
owever, more shrinkage for the two non- 
cognitive than for the cognitive predictors. 
Also the multiple correlation coefficients based 
on both cognitive and noncognitive predictors 
decreased more under cross-validation than the 
cognitive predictors alone. 
The two noncognitive predictor scales seem 
to have done substantially better than the six 
cognitive predictors (R= 45 vs. R= 30) 
but were less stable under cross-validation 
(rey = 33 VS. My = .27). Combined into a re- 
ight predictors yielded 


gression equation the ei 
ai 50 whi e to roy = 3H und 
an R = .50 which shrank to ry = + under 


cross-validation. The latter correlation coeffi- 
cient seemed to be the highest that could be 


obtained through an ordinary combination of 
the cognitive and noncognitive predictors into 
a regression equation. Any improvement In 
predictive accuracy therefore had to be com- 


pared with this figure. 


was, h 


Im provement of F ?>rediction 

includes several correlation coeffi- 
ate the extent of predictive accu- 
s proposed in 


Table 2 
cients to indic: 
racy under the different analyse: 
the foregoing sections. ¥ 

Predictor of error. A product-moment correla- 
tion of xy = -57 between the predictor of error 


232 


I. van W. RAUBENHEIMER AND JOSEPH TIFFIN 


TABLE 2 


2 r = Pass 'E 
P; MOMENT CORRELATION COEFFICIENTS ĪNDICATING THE PREDICTIV 
sae Accuracy or DIFFERENT MODELS oF PREDICTION 


a 
Coefficient Coefficient 
for the for n tion 
Predictors involved experimental |cross-valida 
Model | i group group 
| n= 
pe : at 4 Thy = - 
Multiple regression 6 Cognitive and 2 noncognitive predictors R= se bed = 20 | 
Beare y Predictor of error , rep = .57 em Ja 
Total group [Total on the 6 cognitive predictors | | j | 
Underpredictable [Total on the 6 cognitive predictors | | 
Overpredictable [Total on the 6 Cognitive predictors | 
Accurately predictable /Total on the 6 Cognitive predictors 


= 224. 
bn = 110, 
en = 101. 
dn = 25, 
en =5)1. 


and the criterion of error for the e 
group, and one of rzy = .22 for the 
tion group clearly indicates (alth 
to considerable shrinkage) 
possible to develop a predic 
study, 


If this outcome is to be interpreted with 
reference to the individuals (as 


xperimental 
cross-valida- 
ough subject 
that it had been 
tor of error in this 


viduals tended to interact 
six cognitive predictors 
differential interaction ten 
Occur in the experimental 
indicate that it could be 
non), but it also proved t 
cross-validation sample, 

A word about the con 
seems in order at this juncture, Unpredicta- 
bility, as indicated befor 


differently with the 
than others, This 
dency did not only 
group (which would 
a random phenome- 
o be predictable ina 


implies a loss 

d very well turn out to 

be valuable employees, (Incidentally this is 
also a point of critique 


on the off-qu 


Study of Hobert & Dunnette, 1967 raj 


McNemar, 1969.) 


.  givell 
By defining unpredictability in a twee? 
situation as the algebraic difference ae 
Predictor and criterion scores, the ee 
unpredictability (overpredictable or — - 
dictable) is determined and also can 
dicted, as the results above eT us 
additional information can therefore in the 
effectively in Prediction as discussed 
following sections, 


pr 
This 
e! 


Subgr ouping to I. mprove Prediction veld” 
f the total group into pane has 
ndent predictable pubar the 
Prove prediction more enous!) 
Tegression technique Pre ation® 
ording to Table 2 the cor dic’ 


otal on the six cognitive 


dation 
ida 
criterion under cross-V® and 


A division 0 
tively indepe 
shown to im 
Conventional 

iscussed, Acc 
between the t 
tors and the 
for the under 
accurately pr 
be .26, .40 
an improve: (cb, 

E rsy = 34 obtained through the en 10"? 
equation involving the six cognitive ) sco 
and the two Noncognitive (biographica y e sub 
Even riy = | for the underpredicta ta atii 
group is an improvement on the co idat? 
between X and Y for the total cross-V 25. d 
Soup which aPpened to be rsy reac 
vas if classifying peop el pa” 
ictive tendencies can 


appreciable exte 
regard that the 


Sie as no | 
Classification was [ 
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"sae on the cognitive¥predictor composite 
ad D in the off-quadrant study by Hobert 
velo maenig (1967), but involved the de- 
3 Pment of a predictor scale against algebraic 
sorte thng evading the critique voiced by 
n (1969) against the mean differences 
Enpo loa. danaa groups on the predictor 
and Du e scores as employed in the Hobert 
i nnette study. i 
eee some interest to know,what kind of 
ie p ical items differentiated between the 
EE e AR This information could be ob- 
that ba a content analysis of the items 
well, Tf re the three predictive tendencies 
which wer the items in the predictor of error 
Prediction . negatively correlated with error of 
Would be Joo endorsed by an individual, he 
Converse) i assified as relatively predictable. 
items Y, an individual endorsing all the 
ew a ely correlated with error of pre- 
re Nai be classified as being relatively 
ility te able; The direction of unpredicta- 
=e or under) will not, however, be 
The | om this analysis. 
es me that separated | the predictables 
ind of eines vaguely describe the following 
ot —— i an He (or she) is married, has one 
organized children, conservative, stable and 
dued) zed, not outgoing, reserved (even sub- 
sider neon overly popular, and does not con- 
6 Imself a leader. 
able” a grier hand, the relatively unpredict- 
ollowin ma ual possesses more or less the 
in A characteristics which are amazingly 
Single i those listed above: The person 1S 
Personally. no dependents, not well organize 
respect mi moves around frequently (with 
unchall o residences as well as jobs), bored by 
Pendent ee outgoing, popular, inde- 
and was considers himself a leader of others, 
school considered somewhat of a problem at 


CONCLUSION 


The g 
he results obtained in this study seem to 


e 3 
ee one major point, namely, that a3 
at r of job success or academic success oi 
il nt can be improved beyond levels 
study”, attainable through a more thorous! 
re m of the individuals for whom pea 
and a de, Interactions between the indivi i 
Error € prediction instrument which make for 
t of prediction should not be assumed to be 
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random. This, however, does not imply that 
error of measurement per se should not be con- 
sidered a random phenomenon. The results of 
this study deal specifically with error of predic- 
tion due to systematic predictive tendencies 
among individuals rather than with mispredic- 
tion due to random errors of measurement, 
which could be the result of differences in test 
environment and/or testing equipment and 
procedures. Failure to make the latter distinc- 
tion is most probably directly responsible for a 
great deal of discord among personnel psy- 
chologists regarding the assumption of random 
error of measurement. 
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This investigation examined the exi 
and compared the operation of the 
study habits) according to three di 
differential prediction oj 


In an effort to increase predictive efficiency, 
investigators have given considerable attention 
to techniques involving moderated prediction, 
In particular, subgroup analysis, differential 
prediction of predictability, and moderated 
regression have been used, 

The variables under investig: 
techniques are generally referr 
tor variables. Banas (1964) defined a “modera- 
tor variable” as a quantitative or qualitative 
variable which improves the usefulness of a 
predictor by isolating subgroups of individuals 
for whom a predictor or set of regression 
weights is especially appropriate, Implicit in 
the moderator variable technique is the notion 
of dividing 
homogeneou: 


ation in these 
ed to as modera- 


a heterogeneous 


S, homogeneous 


subgroups should show differential patterns of 


validity, 


able concept is particu- 
ons in which a validity 
but where 


hom more accu- 


rate predictions can þe made. The practical 


implications of moderator techinques are in- 
Requests for reprints should be sent to Sh 
i eld 
Zedeck who is now at the Departme = 


leck y ) nt of Psycho] 
University of California, Berkeley, California 04729. 
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creased flexibilit swe 
z ae ; Js, a 
Increased efficiency of selection models, 4 


aesa 
y in the use of predictors é 


as the isolation of 


addition 
groups for whom additi 
or different predic 


tors should be tried. 
Prediction Techniques 


jille 

Subgroup analysis, Frederiksen and Mely pe 
(1954) hypothesized that the usefulness fying 
interest test can be improved by age on 
Subgroups of engineering students for ne e 
the test was especially appropriate aga erest 
dictor, Specifically, the usefulness of pase 
measures would be ; eed 
compulsive group. A measure of reading SP d 
and an unspeede: 
to identif 


~ar 


Blank, correlations bem f 
and grade point averages sive 
higher for the noncomP™ ica 

the compulsive group. A rP ged 
tion (Frederi sen & Gilbert, 1960) 3 
similar results, but only for the occup? 

z ally related to engineenne 
for Subgrouping have ine job 
-solving Styles (French, 1965) ‘ 
characteristics (Peterson 1964), scores is- 
knowledge test (Steinemann 1964), jo 961) 
faction (Dawis, Weiss, Lofquist, & Betz, ‘ 
and anxiety and aptitude (Pervin, 1967). 561 

Lifferential Predictability, Ghiselli’s 
1960) technique involves obtaining # "gize 
difference Scores (D) between standa" 


tion? 
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criterion and standardized predictor scores. 
The magnitude of D serves as an index of pre- 
dictability; the smaller the D, the better the 
en r ween criterion and predictor. Corre- 
and bach pe D sone: are subsequently identified 
high on pi edictors of predictability. A 
BAR abe correlation indicates that the 
dh: the > ould be higher for those scoring low 
the enti predietar g? predictability than for 
Sites Ire group. The 4 lowest and 3 highest 
tie re on the predictor of predictability test 
‘elite tested for significantly different 
AA Prag igi A significant difference in 
s dictor is e low group indicates that the pre- 
than for eel useful for the lowest 4 scorers 
for the remaining Ss. 
os (1956) demonstrated the potential 
predicti ss of the technique in a study involving 
vers ie of job proficiency of taxicab, dri- 
those p occupational inventory differentiated 
ae ee for whom a tapping-and- 
fictenes test was a good predictor of job pro- 
itself ns The measure of occupational level 
elf had a negligible correlation with the 
Criterion, oS 
ane s moderated multiple regression. The 
the Re ae variable is a means of maintaining 
U of the total population while still 
Ts ae a statistical control on each indi- 
infinite membership in one of a continuous, 
cick senma of subpopulations defined by his 
Hence S, the moderator (Saunders, 1955). 
duae > san (1956) treats the cross pro- 
a Hew es moderator and predictor scores as 
pve he seme in any standard multiple-re- 
echnique. The regression equation 15: 


T J+ E amt Eby + Eestis 
and mie b, and c are weights, Xi the predictors, 
2; the moderators. 
an guunders (1956) reanalyzed the F rederiksen 
tegr Telville (1954) data using the moderated 
Sa technique. In 5 of 10 comparisons, 
Ne crea regression resulted in increases 
erat validity over the usual multip e- 
liere ation method. However, in no tase 
Coe any significant improvement over i 
the Tes for the noncompulsive subgroup 1 
rederiksen and Melville study- 


Pris Bile 


Zis 


The literature pertaining to moderator vari- 
F “joint 


“bles has largely ignored the effect of 
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moderators.” Joint moderators may be defined 
as two or more variables, quantitative or 
qualitative, that interact to influence a validity 
coefficient. The traditional one-variable-at-a- 
time approach does not take into consideration 
the possibility that several moderators may 
have to be combined. It is conceivable that two 
potential moderators, for example, study habits 
and anxiety, will not, when operating sepa- 
rately, influence validity of a predictor. It is 
also conceivable that the validity coefficient of 
a predictor for a homogeneous group composed 
of students with high anxiety and good study 
habits will be significantly different from that 
of a homogeneous group composed of students 
with low anxiety and good study habits. 

An approach which is relevant to the notion 
of moderator variables, and particularly to 
joint moderators, is that of configural scoring 
or actuarial pattern analysis (Lykken, 1956). 
Both moderated prediction and configural 
scoring identify homogeneous subgroups of 
individuals; moderator variables operate in the 
multiple-regression sense, whereas actuarial 
prediction permits the forming of subgroups by 
combining predictors in the configural sense 
and then investigating the criterion perform- 
ance of each subgroup. This article discusses 


the use of joint moderators in terms of regres- 


sion models. 
There have been few attempts to use joint 


moderators. Stricker (1966), in a replication of 
the study by Frederiksen and Melville (1954), 
formed four subgroups from the combination 
of the two dichotomized compulsivity mea- 
sures. Results indicated that the joint use of 
the two variables did not enhance the modera- 
tor effect. This result may be attributable to 
the fact that both moderators were measures 
of the same construct. 

Klein, Rock, and Evans (1968) employed a 
different approach for joint moderators leading 
to differential prediction of law school grades. 
Five items (age, major field of study, degree of 
preparation by college for law school, father s 
occupation, and time at which decision was 
made to attend law school) were used to form 
multiple moderators. Essentially, individuals 
were placed into groups on the basis of the 


similarity of their profiles on a combination 
of items. A hierarchial grouping technique 


yielded four homogeneous groups. In only one 


_ 
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of the four groups was the validity coefficient 
increased over the total validity coefficient. 

The present investigation examined the 
existence of potential “joint moderator vari- 
ables in an academic setting. In addition, since 
the three techniques have the same intent, the 
operation of the same potential moderator 
variables was compared according to the three 
prediction techniques. Two variables, Study 
Habits Inventory (SHI; Wrenn, 1961) and 
Anxiety Scale Questionnaire (ASQ; Cattell & 
Scheier, 1963), were examined to determine 
their potential moderator effects on prediction 
of college grade point average (GPA) from the 
American College Test (ACT). 

To determine the separate usefulness of SHI 
and ASQ as potential moderators, the follow 
hypotheses were tested: 

Hypothesis 1. The use of ACT 
dictor of GPA 
depending on the 


ing 


Scores as a pre- 
is differentially appropriate 
operation of study habits as 
a moderator. That is, different validity coeffi- 
cients for ACT exist depending on whether S 
demonstrates good study habits (GSH) or 
poor study habits (PSH), 

Hypothesis 2. The use of ACT Scores as a pre- 
dictor of GPA js differentially appropriate 
depending on the operation of anxiety as a 
moderator. That is, different validity coeffi- 
cients exist depending on whether § is high 
anxious (HA), moderate anxious (MA), or low 
anxious (LA). 

To determine the usefulness of SHI and ASQ 


as potential joint moderators, the following 
hypothesis was tested : 


Hypothesis 3. The u 


co- 


R 70 
: fulfilling a research 
participation Tequirement of an introductory psy- 
chology course, Because of incomp 


criterion informatio; 


lete predict 
n, the final sai i ue 
= 393, 


mple size was NV 
Procedure 


Subgroup analysis. To test Hypothesis 1 
habits groups were formed Pork gs 


using cutoffs Suggested by 
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the test manual. If S received > +15o0n SHI, he wa 
placed in the GSH group (x = 209). If S e 
< + 15, he was placed in the PSH group = wen 
To test Hypothesis 2, three anxiety groups ‘ded 
formed. Less extreme cutoffs than those recommen 
in the manual were used so that sufficient sample si "a 
were obtained in the extreme groups. The HA a 
n = 104) consisted of Ss with scores > 37; ie ha l 
group (n = 192) with scores > 23 but < 36; an 
LA group (n = 97) with scores < 22. y. 
To test Hypothesis 3, joint moderator groups f the 
determined based on six possible combinations a s 
two study habits groups and three anxiety group 
sulting in the following: 


Joint moderator 1 (n 
Joint moderator 2 (n 
Joint moderator 3 (n 
Joint moderator 4 (n = 30): PSH and La 
Joint moderator 5 (n 90): PSH and on 
Joint moderator 6 (n = 64): PSHfand HA. 


ted 
Diferential predictability. All scores were oe 
to z scores. To test Hypotheses 1 and 2, D Scoren gera 
correlated separately with cach of the potential n 
tors, SHI and ASQ. mi with 
To test Hypothesis 3, D scores were correlate" aris 
each of the following two moderators: (a) Joint Meve 5, 
tor A, composed of Ss with GSH at all anxiety jet 
and (b) Joint moderator B, based on PSH at all Te 5 
levels. Joint moderator score was the sum endent 
standardized scores on SHI and ASQ. Two indep r 
joint 


ensato 
moderators were used so that the compe ion 
effect of the two potenti 


would be controlled and 


40): GSH and LA 
103): GSH and MA 
66): GSH and HA 


aod wea 


al moderators in joint ee a 
y the psychological meaning 
significant effect would be facilitated. and 2 
Moderated regression. To test Hypotheses 1 ate ie 
SHI and AS » respectively, were used in aon A cT 
gression equations, in a cross-product term wit e 
Scores. To hesis 3, two cross-product ijg 
ACT X SHI, were used sm ditio™ 
r ted regression equation. In @ 5) were 
all variables (predictor and potential moderator Oy 
examined in a multiple regression to assess yi sens” 
buted to prediction in a linea 


rm5, 


Res: Lrs je 

The validity coefficient of ACT as & pretal 
tor of GPA Was 49 (p < .01) for the esti 
group: WV = 393). The standard error © r 
- Table 1 presents the interco 
S een the Criterion, predictor, 4” : 
tential moderator Scores (single and joint 


Subgroup A nalysis i. 
s pcie? 
Table 2 Presents the correlation coef ind 


+ a 
between ACT and G anxielY fi 
Study habi ER dop the s 


no volts subgroups, The correlation gie”, 
cient for the M group was significantly “phe 
ent (p < 05) f 


rom that of the HA grouP” 
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TABLE 1 
INTERCORRELATIONS 
Variab 
riable GPA ACT SHI ASQ ACT X SHI | ACT X ASQ 

GPA 

ACT = .49* a —.01 .38* Tt 
SHI 27" —.07 eo 34 
ASQ 32 .99* —.20* 
ACT X SHI = io 
ACT X ASQ E 

SHI = Study Habits In- 


Note.—N = F 
ventory, EEs 393. Abbreviations: GPA = grade point av 
*p S01. Anxiety Scale Questionnaire. 


` standar is . 
dard errors of estimate for the two sub- 


” 
eden .80 and .95, respectively. The 
rami etween the study habits subgroups 
Tabl significant. 
for T n presents the correlation coefficients 
Vise Joint subgroups. None of the coeficients 
ignificantly different from any other. 


Diferential Predictability 


ore presents the correlations between 
tors. The k ana each of the potential modera- 
Scores was a relation between D and anxiety 
Study habit ignificant (p < 05). Neither the 
Correlated s scores, nor the joint scores were 

Since am Ne with the D scores. 
Oy tiated (p < 08) 
ASO scor scores, two fractionations related to 
ighest ie were made: (a) 4 lowest and i 
ighest AS Q scorers, and (b) § lowest and 3 
lation þ i Q scorers. Table 5 presents the corre- 
ractio etween GPA and ACT for each of the 
Saata groups. The difference between 
grou enis for the lowest 3 and highest 3 
eg PS was significant (p < .05). The standard 
1 0S of estimate for the lowest $ and highest 


l 
Stoups were .85 and .93, respectively: 


erage, ACT = American College Test, 


Moderated Regression 


Table 6 presents the multiple correlation 
coefficients between GPA and the predictors 
for both linear and moderated conditions. Item 
4b indicates the relationship between GPA and 
ACT as affected by joint moderators; Items 
2b and 3b indicate the relationship between 
GPA and ACT as affected by single 


moderators. 


It is appare 
6 that the use of SHI sc 
tion by the linear model (increase over the 


simple r is significant in both cases, p < .05), 
whereas the use of ASQ scores (Item 3a) does 
not. Comparison of the linear and moderated 
models indicates no further contribution gained 
by the inclusion of the cross-product terms. 


nt from Items 2a and 4a in Table 
ores improves predic- 


DISCUSSION 
e of this study was to 
compare the operation of potential moderators, 
single or joint, in three prediction techniques. 
The results of this study provided insights 


into the operation of, and problems associated 
each of the moderator variable 


The essential purpos 


with, 
techniques. 


TABLE 2 
Cotneranows eee GPA ano: KEL TOR Suscrours Basep ON ANXIETY AND Srupy Hasits 
Total LA GSH PSH 
grou MA A 
N = 303) a ae | (u = 192) (n = 97) (n = 209) (w= 184) |) 
49 m 36 45 50 37 
Be 
= moderate anxious, LA = low anxious, GSH = good study habits, 
ifferent at $ < .05. 


Bg Ne 
Sy \ote.—Abbreviations: HA 


= Door study habits, Commonly underlined 


= high anxious, MA. are significantly d 
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TABLE 3 


vt Su: ses BASED ON 
a ATIONS BETWEEN GPA anp ACT ror Joint SusGroups Base! 
S BOTH ANXIETY AND Stupy Hasits 
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A 
MA | GSH + H? 
MA PSH + HA GSH + LA GSH +} —"40) 
ea 03), E EES eg Fo (n = 64) (n = 66) (n = 103) (n 
w= = ğ 
5 5 46 
49 22 52 30 52 50 


Note.—Abbreviations: PSH = poor study habits, GSH 
HA = high anxious. 


Improving upon a validity coefficient that is 

already relatively high (r = .49) is both diffi- 
cult and somewhat impractical. While the basic 
purpose of any moderator variable approach 
is to improve validity in situations in which 
predictors are poor, whether a gain is worth the 
effort involved in applying a moderator tech- 
nique is a matter to be decided by the individ- 
ual making the selection decisions, 

Another problem is that of sample size, 
particularly in the subgroup analysis proce- 
dure. Reduction in sample size necessitated by 
any subgrouping approach reduces the power 
of a statistical test, and hence, without large 
initial samples it is difficult to determine the 
effectiveness of a moderator technique. 

Anxiety operated as a “predictor of predict- 
ability” in the differential predictability tech- 

validity was obtained 
and not for the 4 
er group included 


TABLE 4 
CORRELATIONS BETWEEN D 


AND POTENTIAL 
Moperator Sco: 


cious, 
5 = arate anxio' 
= good study habits, LA = low anxious, MA = moder: 


sonificant 
the subgroup approach, where the aati 
difference between the MA and HA g 


indicated that anxiety was a moderator. i. 
Further evidence for the operation of omnia. 

as a moderator is found by examining ait 

standard errors of estimate. In the subg! 


4 + stability tec | 
analysis and the differential predictability t°% 


«1 yaliditi® 
nique, in instances where differential tee ar 
were obtained, more accurate prediction ait 
made for the subgroup with the higher hea o 
coefficient. However, as there is a pean , 
improving an already high validity st accu" 
there is also the problem of improving t iy the 
racy of prediction. In the present stu ” tech” 
improvement in accuracy by moderator i was 
niques over that accuracy already existing 
minimal, ressio® 

Saunders’ (1955, 1956) moderated reg ag t 
technique provided information een 
with that demonstrated by the Sue ech 
analysis and differential predictability 
niques. The moderated 


j ch 
regression tec 
indicated that 


alone 
nan’ 

as © 
abits 2? 
nclusion of study habi f iw 
id enhance validity, the lac 


i anxiety as a predictor n 
m a cross-product term did not © 
validity, While į 
predictor d 


ce 


TABLE 5 


gr F 
RES CORRELATIONS BETWEEN GPA AND À! 
TRACTIONATED ASQ SCORERS 
Grou 
p r Fractionated ASQ group r 
D & ASQ: 
is 25 
D & SHE —03 meet r 2 
D & ASQ + SHI/GSH» 06 wars 55 
D & ASQ + SHI/PSHe -00 atei 3 
Highest a ee i 
difference Scores, ASO = Anxi N i 
Sienas k Co = nes Ate 


is 
setane als, 
é ations: GPA = grade point aVersti08Z 0 
= American Colle H =g S, 


j i Be Test, = Anxiety Scale Sat ? 
Underlined Correlations Se cae Yjifferent 
an = 135, Sie is 
n = 258. 
on = 268, 


an = 


iC 
ique 
hi 


p 


i 


r~ 


i 
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ro ; 
aan in the moderated- evertiemiltinle 
a angen indicates that study habits 
multiple-pr ra an ordinary predictor in the 
tor varieele iction sense and not as a modera- 
conclusion e. Further evidence for the latter 
99 (see rat ae aor by the correlation of 
X SHI. able 1) between SHI and ACT 
e Pesii l explanation for the difference 
sion ae. results for the moderated-regres- 
E e. on the one hand, and the two 
other femal involving subgrouping, on the 
ing quantit my lie in the difficulty of compar- 
Fiatilonatty ive and qualitative techniques. 
mine differe. a E predictability test to deter- 
of the me Be validities allows the treatment 
variable.) EDY variable as & qualitative 
is essential] ans moderated-regression technique 
nen nat an, attempt to describe the multi- 
that is an ok ere surface with a model 
equation fs ET in the predictors, using one 
values of dt describe the relationship over all 
Variables es predictor (including moderator) 
ence of a m ss ana in the experiment. Pres- 
orm of th noderator is thus indicated by the 
hificance of equation, specifically by the sig- 
Moderate e terms. Perhaps the 
tended to į age equation should be ex- 
order term piney squared and other higher 
of other f akar- allowing for the detectability 
ie orms of nonlinearity. 
each id technique, however, considers 
Sie: g ae of the original population to 
Predictors leent linear relationship between 
detected Rice criteria, and moderators are 
coefficients A by differences in the correlation 
ationshi pan index the degree of linear re- 
qualitative e moderator is thus treated as a 
inearity i variable, and the effect of any non- 
the oe a Its relationship to the criterion and 
spondence predictors is ignored. Close corre- 
niques ce between results of the two tech- 
icity nonn be chiefly dependent on the 
Subgro of choice of the composition of the 
ups. 
FE agence were not found in this 
habits = us, the a priori hypothesis that study 
in this = nd anxiety interact was not confirmed 
Ver ied ‘The results do support, how 
ence. ie contention that a variable can influ- 
Criteri e relationship between a predictor and 
on without being related to either. 
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TABLE 6 
LINEAR AND MODERATED CORRELATION 
COEFFICIENTS BETWEEN GPA AND 
MULTIPLE PREDICTORS 
Model R 
Linear 
(1a) ACT 49 
(2a) ACT & SHI „54 
(3a) ACT & ASQ 49 
(4a) ACT & SHI & ASQ 55 
Moderated 
(1b) 
(2b) ACT & SHI & (ACT X SHI) 55 
(3b) ACT & ASQ & (ACT X ASQ) Ad 
(4b) ACT & SHI & ASQ & 
(ACT X SHI) & ACT X ASQ 255 


Note.—N = 393. Abbreviations: ACT = American College 
Test, SHI = Study Habits Inventory, ^ Q = Anxiety Scale 


Questionnaire. 


since discovery of moderator 


variables cannot be guaranteed by their 
relationships with other variables or on explicit 


theoretical bases, future research should be 
chniques for 


Consequently, 


concerned with developing te 
the systematic and rapid identification of 
moderators. 
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COMMUNICATING DIRECTIONAL INFORMATION WITH 
AN AUDITORY DISPLAY? 


J. RICHARD SIMON? anp JOHN L. CRAFT 
University of Iowa 


This study was concerned with determining whether right or left commands could 
be communicated more effectively using symbolic, directional, or combined direc- 


tional and symbolic cues. In a choice reactio 


or left-hand key in response to: (a) 1,000 Hz 
lated by a single pure tone (directional cue), 


(symbolic cue), (b) the ear stimu 


(c) 1,000 Hz or 4,000 Hz monaural tonal commands in the ear corres 
mbolic content of the command (combined directional and symbolic cues). The 
cue condition produced the fastest RT, while RT to the di- 


sy 
combined (redundant) 


n time (RT) task, Ss pressed a right- 
or 4,000 Hz binaural tonal commands 
and 


ponding to the 


rectional cue alone was faster than to the symbolic cue alone. 


va present study was concerned with the 
ewe of communicating spatial 
Gales, aa by means of an auditory display 
twofold 63). Specifically, this study had the 
simple purpose of determining (a) whether 
ane commands (i.e, right or left) can be 
tional ate more effectively with direc- 
a i D with symbolic cues, and (0) whether 
Weel a of relevant directional and 
r ic cues can facilitate information 
Processing, 
oe is now abundant evidence that in- 
May a processing from auditory displays 
na les 3 ected by irrelevant directional cues. 
to Fe y concerned with reaction time (RT) 
Rudell (190 verbal commands, Simon and 
Processi 967) discovered that the speed of 
trishte, the symbolic content of a command 
Which th or “left”) was affected by the ear in 
research ne command was heard. Subsequent 
hare (Simon & Small, 1969) established 
directi e ear stimulated provides an irrelevant 
tion onal cue which interferes with informa- 
With oe on trials where it conflicts 
à the symbolic content of the commanc. 
Maan (1968) attempted to facilitate the 
Usin tp of verbal directional commands by 
e e ear stimulated to provide a — 
respo; e cue. He asked whether Ss woul 
directi faster to a display which combined 
Sieh tonal and symbolic cues (monaural 
ght” and “left” commands in the ear 


lq: 
This article was accepted for publication by the 


Outro; 

t8oing Editor, Dr. Kenneth E. Clark. . 
Simo, €quests for reprints should be sent to J. Richard 
low, n, Department of Psychology; University of Iowa, 
è City, Towa 52240. 
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corresponding to the content of the command) 
then they would to displays which involved 
either a symbolic cue alone (binaural “right” 
and “left”? commands) or a directional cue 
alone (monaural pure tones). Results showed 
that, while RT to the combined cues was 
faster than to the symbolic cue alone, it was 
not faster than to the directional cue alone. 
Unfortunately, these data could not be used to 
support generalizations concerning the relative 
effectiveness of symbolic versus directional 
cues or the general effect of combining these 
cues. Such generalizations would have been 
use qualitatively different 
stimuli (i.e., tones and verbal commands) were 
used for the different conditions. The present 
obviates this difficulty by employing 
t both directional and 


unwarranted beca 


study 
pure tones to presen 
symbolic cues. 
METHOD 
aratus. The apparatus measured choice RT to a 
wee tones which were presented to S through 
Telephonics TDH-39, 300-ohm earphones. The ear- 
phones were mounted in NAF-48490-1 cushions and 
fixed to a standard headband. The Ss’ task was to press 
the correct one of two finger keys as quickly as possible 
after hearing the tone. Two Hewlett-Packard Model 
200AB audio oscillators were used to generate 95-db. 
tones of 1,000 and 4,000 Hz. Silent switches permitted 
E to present either tone to the left or right ear alone, 
or to both ears simultaneously. A Hunter Klockounter 
started when the tone was presented and stopped when 
S pressed a key. Depressing the key also turned off the 
tone and signaled Æ as to which key (or keys) had been 
pressed. The keys, which S operated with his right and 
left index fingers, were mounted 10 in. apart on a table 
in front of him. f 
Subjects. The Ss were 36 male and 36 female Uni- 
versity of Iowa undergraduate volunteers. All Ss were 
strongly right-handed and reported normal hearing. 
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TABLE 1 


Reacrion Tre (N Msec.) To TONAL DIRECTIONAL 
Cosmtanps as A FUNCTION or CUE PROPERTIES 
OF THE STIMULUS AND RESPONDING 
Bopy MEMBER 


Responding hand 


Cue y 
Left Right x 
3 i 5 71 
Symbolic (S) only 368 374 37 
Directional (D) only 309 298 304 
D & S combined 297 293 295 
xX 325 322 


Procedure and experimental d, 
on three blocks of trials cor 
experimental conditions, A 
used to balance the order 


esign. Each S performed 
responding to the three 
3X3 Latin square was 
of presentation of the con- 
females were randomly 
quences, 


were simply told h: 
block am now going to ch; 
- Your task, h i 


tional and symboli 
terms of the nu 


e all comparable in 
presented. 


and low-pitched tones 


random sequence of 44 tonal 
called for a right 


J. RICHARD SIMON AND JOHN L. CRAFT 


i nd 
The first four responses in each block, two nigar 
two left responses, were practice trials. A bi aici a 
was presented 1 sec. prior to the onset of eac , 
there was a 9-sec. interval between trials. 


RESULTS 


Median RTs were computed for each 5 
both his right- and left-hand responses U ihe 
the three treatment conditions Cen 
symbolic, the directional, and the m A 
directional and symbolic blocks). Table 
marizes the means of these median — hly 

An analysis of variance revealed a hig 


cue 
Sant . i f cu 
significant overall main effect 0 


an | 
(F = 226.02, df = 2, 132, p < .001). „mbolic 
be seen in Table 1 that RT to the sy? | 


cue (371 msec.) was markedly slower thor a 
the directional cue (304 msec.) and (295 
combined directional and symbolic att also 
msec.). However, the analysis of varianc tan 
revealed a significant Cue X Hand 01) 
action (F= 5.24, df = 2/132, p ter fot 
Table 1 shows that while RT was fas 


inde 
the right- than the left-hand response tions 
th 


e directional and combined cue con ic cu 
the reverse was true under the symbo oe 
Condition, Additional analyses ee 
that this reversal of the hand effect un f 
symbolic cue co 
the overall Cu 
when only 
conditions 
Cue X Ha 


ndition was the major n i 
e X Hand interaction, ed cue 
the directional and combine’ the 
Were included in the analys i ifr 
nd interaction was no longer more 

1.73, df = 1/132), Further, 

o the symbolic cue (3 slow? 
cantly slower than et yo ue 
(309 msec.) to either of the other 7 000): 
conditions (F= 114.76, df = 1/132, p it $ 

basis of these findings, 


was still signifi 


Abed 


ter than RT to the dire 
cue (F = 5.34, yy 1/132, p < 025): ; 
The analysis revealed two addition ap 1 
Cant sources of variance. Males T°% Ne 
faster than females (F = 7.65, i cees” 
’< 01), and RT decreased over S% 


ti 
. e 
ted 
the 
i | 
at ® | 
cu 
l 
t 


ba 
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blocks of 
p < 001). 


trials (F = 12.73, df = 2/132, 


Discussion 


P mee ae of the present study are worthy 
a Pa ne is the finding that directional cues 
hema more effective than symbolic cues for 
= ing simple right and left commands. 
ie “ae was decoded much more rapidly 
ee pe le cue was provided by the ear stimu- 
a han when it was provided symbolically 
find as a frequency discrimination). This 
value A underlies ʻ the potential pracne 
Leg vilag directional cues to provide 
cias plore, in auditory displays. It 
fama again that the concept of S-R 
pi aloe ity (Fitts & Seeger, 1953) is as 
ane e to auditory displays as to visual 
Plays (Simon, 1967). 

di secor finding is that information proces- 
aie o statistically, significantly faster when 
vi a directional and symbolic cues were 
e "present in the display than when either 
teh aae was present, This finding fits into a 
“a a y of literature which suggests that, 
T a retain ponditiong, providing redundant 
manot _in a display or simultaneous 
Slee, o different modalities may facilitate 
rmation processing (McCormick, 1964; 
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Woodworth & Schlosberg, 1954). Because the 
observed difference was so small, however, 
this finding is of perhaps greater theoretical 
than practical significance. 
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LINGUISTIC DESCRIPTION 


OF AUDITORY SIGNALS! 


PATRICIA WRIGHT? 
Medical Research Council, Applied Psychology Unit, Cambridge, England 


In three experiments, Ss gave verbal descriptions of tones differing in pitch, ae 
mittency, and modulation, and the extent of the agreement between Ss was nored e 
In Exp. I there was greater consistency across Ss when descriptions referred to the 
tone’s physical characteristics than when onomatopoeic and illustrative terms 


(e.g., bleep, hooter) were used. Onomatopoeic labels appear to convey no more 


specific information than general terms 
ing physical characteristics there was us 


two physical dimensions, the consistency 
the predictions of a model which assum 
dimension independently, then combine 
closely examined the distribution of verl 
data indicated 95% agreement between Ss 
(heard only once and therefore 
were above 600 Hz or below 200 


e 


Hz. 


Although the meanings of many auditory 
signals are easily learned, there are Some oc- 
casions when people must be told before they 
ever hear the signal that sound X means em- 
bark upon a particular course of action. Fire 
alarm signals are a typical example. Inter- 
national telephoning is another (Shoeffler & 
Sherridan, 1966). In this latter case the caller is 
faced with a completely new range of auditory 


stimuli among which he must discriminate, 
Perhaps the simplest w 


EXPERIMENT 1 


Procedure. Answer papers were distributed to 
Ss and the following instructions were given: 
1 This article has been 
outgoing Editor, Kenne 
? Requests for rep 
Medical Research 
15 Chaucer Road, 


accepted for publication b; 
meth E. Clark, i 
rints should be sent to the auth 
Council, Applied Psy, i. 
il, cholo; 
Cambridge, England, Er vk 
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without background or refe: 


(e.g., sound). However, even when label- 
ual 
of the stimulus, although the salient dim 


multiple-choice procedure in Exp. II showe 


lly high agreement for only one feature 
ension varied with different signals. A 
d that if Ss had to label each tone along 


between Ss increased in accordance with 


s that judgments are made along each 


d probabalistically. Experiment III more 
bal labels along the pitch dimension. The 


for the description of isolated pure tones 


rence tones) if these 


crib? 
This experiment is concerned with how people seal 
sounds. I want you to imagine that you kavan y them 
a written message for someone, a message telir a for 
to listen for a certain sound then do something., oF 
example, “When you hear the bell it’s tea tome; 
“When you hear the hooter pack up and o pape” 
ete. This message is written at the top ot for 2° 
only the words saying what you have to liste? ee nea 
missing. (The message written was “When y y pes? 
+ + - do something.”) Your task will be to SUPP or you 
words as you listen to sounds on a tape recor’ n each 
can use as few or as many words as you like, “he an 
sound will be repeated several times, but 


a 
1 pt W 
between different sounds is rather short a sta, 
for a sound to finish before you start writ estion? 
writing whil 


© you can hear the sound. Any 44 ted 
Because it was clear that some S$ ngd = not 
out of the experimental situation and w opp 
writing anything at all, the tape wee „ucted 
after the nineteenth tone and Sswereins't” jp? 
to “think of the tones as being call er st 
hospital. Imagine that you're writing # io 
l refers to which doctor an 


of which cal PEG 
describ ‘ an E This, 
Scribe the tones with this in min “eal? 


orientation had the desired effect of inc 


the number of written descriptions. o 11 
The data were obtained from a group. tb 
adult Ss who were paid for taking P?” 1 
experiment, sig? 
Materials. There were 31 tape-recorde ste 
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pee to the last 12 signals. The physical 
Tabl a of these signal tones is given in 


Results 


Ae be E apparent from the data 
apply ai no readily available vocabulary to 
ee th auditory signals such as telephone 
is blee cd Ss used onomatopoeic terms such 
ee as B uzz, pop, umph, etc. Others made 
kose w neni ion for example, horn type 
Wie’ ume e, siren, etc. These two categories 
erdie od but not necessarily, mutually 
electric a or example, The buzz of an 
either cat or bell. Nor was it necessary to use 
tone. Ge uk ¥ for example, short high-pitched 
‘iieae that o 208 opportunities for response; 
104 en =a e largely onomatopoelc wereused 
this cat S, i ere being 18 different terms within 
101 tin a y; illustrative examples were given 
and the S, there being 36 different illustrations; 
s Ta et gn terms such as tone, note, or 
tities a even no term at all—occurred 34 
trative early, the onomatopoeic and illus- 
ee _Setegories predominated. However, 
eat a little consistency across Ss in their 
diste zese labels. Table 2 shows the frequency 

ribution of the five most common responses 


TABLE 1 
PHYSICAL CHARACTERISTICS OF THE SIGNAL TONES 
Ampli- 
Signal tude 7 1 
tone Frequency modu- Time on Time off 
no. (in Hz) lation (in sec.) | (in sec.) 
(in Hz) 
20 310 0 1.00 1.00 
21 850 32 1.50 1.50 
22 425 16 15 15 
23 620 16 33 67 
24 425 0 2.00 4.00 
25 620 50 50 350 
26 620 16 .03 .27 
27 620 50 .60 5.40 
28 425 50 .03 27 
29 425 16 .60 5.40 
30 425 0 50 50 
31 440 + 480 0 2.00 4.00 


Table 2 shows that response terms are used 
with very little discrimination. For example, 


each of the 12 signal tones was at some time 


described as a bleep, and even within each of 


the two categories there were usually three or 
more labels being used to describe a single 


tone. 

Sometimes Ss ref 
teristics of the soun 
or short, fast or slow. 
1ency distribution 


erred to the physical charac- 
ds, such as high or low, long 
Table 3 shows the fre- 
of references to these 


in bot : : 
‘area the onomatopoeic and the illustrative qu 
SBOHIES: characteristics. 
TABLE 2 
FREQUENCY OF THE Five Most COMMON Descriptions USING 
ONOMATOPOEIA AND ILLUSTRATIONS 
Signal Onomatopoeic descriptions Illustrative descriptions 
S Ei d| Ringi 
ao s Morse engaged.| Ringing | sj 
Bleep Buzz Pip Brr Blip code Hooter | tone tone Siren 
ee e 
D 2 3 A ee be, 2 2 |= 1 
21 pF Z i k 3 & 
5 2 1 = a ; = 
Ee 4 = 3 3 = 2 = 
5 3 — 1 1 2 = — = 1 -= 
24 z : 4 3 2 = 1 r <= ql 
28 3 = z = 1 1 1 2 | = 2 
26 3 = 2 = 1 3 5 = z a 
27 3 3 ae ae ed po 2 z= a 2 
28 1 = 1 a = 3 as E = a 
29 2 3 = 3 — — 1 1 4 = 
30 2 2 ie = 1 2 1 2 Sa ee 
31 1 3 1 3 = = 3 1 1 — 
Total 33 17 11 10 5 11 i 9 9 6 
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TABLE 3 
FREQUENCY oF REFERENCE TO PHYSICAL CHARACTERISTICS OF THE SIGNAL TONES 
R ¥ 
i 
Pitch Rate Duration Interva! 
Signal tone Me j Me- Long 
1O; High g Low | Fast Foa Slow | Short dium | Long | Shor dium | 
i ee _———— es 1 
20 = 1 a se N 3 — 3 2 - re 1 
21 Ie | — oe = a Ai = = $ _ — E 
22 — 2 2 108 -— — 3 — — ; i E 
23 ei tet a T t | = j = 3 
24 — 2 9a = = 2 = = 11° = nee 1 
25 4 2 _ 4 1 = 2 = = = pa z 
26 1 4 to (oe || a 2 Se Wee pE = 1 
27 1 3 1 => — 4) 4 = 3 = — — 
28 — — Os 7 — — — — 4 = S 7 
29 — 1 1 1 — 6 3 — 1 ome 1 F 
30 4 1 — 5 1 — 1 1 3 2 i 2 
31 2 2 2 — 1 1 — 2 8 re 5 26 
Total 35 16 | 36 | 4 3 16 19 6 38 9 
No. references to 
physical char- 0 
acteristic 87 60 63 4 
* More than half the Ss gave this response, ; 
o 
: mber 
From Table 3 it can be seen that for most fast. On such occasions the ee it fot 
Signal tones there was one Predominant re characteristics mentioned was 2.5 her there 
sponse; for example, Tone 24 was high, the present data it is not clear whet : 
whereas Tone 22 was fast, The Notable ex 


expected that if verb 


al labels 
or pips had þ 


fen used, this 
tions such as 
> a Series of high, 


onomatopoeic labels). Th 
characteristics mentioned in conjunction with 
bleep was 1.73 and i j 


jgn 
was a general tendency to deserihe the.” i 5 
tone more fully when not co ego E rerenc? 
noun of some kind, or whether the per: 
Was Primarily a function of indiyiuali" ten ed 
i 8 techniques, for a few peop e thers 
to omit nouns in most of their reponses al 
used illustrations for the majority of any ay 
occasions was there 108 A 
Y a verbal reference to e sud! 
ignal tone, and then ter! 
Y were used, 


Discussion ding 
ae"; 

difficulty experienced in geo 4 i 
down descriptions of the » vere 
at iħey did not have, sighi 
Available for the kinds alts K 
tones used in 4 İs experiment. The rest i 
this out, «sistency across Ss tende 


‘ sjther 
ow, Particular] when people used eith 
trative lables 


es or Nomatopoeic words. ee 
the latter did n appear to convey any sas bei 
Information since the number of adject! nu? n 
with these terms was no less than the «tone 
used With more Seneral terms such as 


From the 
Ss to Write 


was eviden 
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sie of agreement about moms 
the likelihood that en unexpected in view of 
be relatively Ta fara ag symbolism should 
quotes Jespe ng in this field. Weiss (1964) 
lated td a to the effect that words re- 
Symbolism an = show more phonetic 
it is possible T nany other words. However, 
olism are are effects of phonetic sym- 
range of valerie a only under a specified 
and the ke enta conditions (W eiss, 1964), 
tives Present situation might be too insens!- 
Ti o detect such effects. 
these between Ss in the use of 
the findings ae is also low and supports 
and Wood} =a orcoran, Carpenter, Webster, 
that rond (1968). Corcoran et al. showed 
scriptions EF listeners with illustrative de- 
engine noi eads to poorer identification of 
mation at ses than does the provision of infor- 
sounds Chea physical characteristics of the 
Man iay i arly, what is “morse dots” to one 
the ae i a series of pips” to another, and 
interperson © not serve as a useful basis for 
Alt communication. 
when bo agreement between Ss was highest 
there Eee: attributes were _mentioned, 
lowes 3 the problems of which physical 
Without lo it 38 possible to code verbally 
many ahs a, this consistency, and also how 
can usefy ie imensions of a particular stimulus 
scription Q be incorporated into a single de- 
tone” co Le, Is “a high, loud, fast, wavery 
0 mprehensible?). 
ale me first of these problems, data are avail- 
Bet tat study by Markel (1965) which sug- 
along th ERE is high agreement between Ss 
tempo  uecesage of pitch, loudness, and 
substan tis he data from the present study 
tate os this, since pitch and tempa or 
red ty. re the main physical categories refer: 
— the second problem raised above, the 
take nt data suggest that most people seem to 
of t Cognizance of only one physical dimension 
aae Ouni, Tf it should be the case that the 
d of untrained Ss can only refer a 
nen ex signal to one dimension, than this 
e rely limits the number of different tones 
mong which it is feasible to ask people to 
ee ante. It therefore seemed necessary to 
Xamine whether or not Ss could be persuaded 


Pitch Rate 


High Fast 


Medium 


Slow 


Medium Fast 


Medium 


Slow 


Slow Fast 


Medium 


Slow 


None of the above 


Cannot tell 


Tıc. 1. Illustration of page from answer 
booklet used in Exp. IL. 


to use more than one physical dimension for 
categorizing the auditory stimuli. 


ExPERMENT II 


Procedure and materials. Another 22 adult Ss 
h the same signal tones used 
ime they were given answer 
re 1). In half the books 
as replaced by Rate 
and 


were presented wit 
in Exp. I, but this t 
booklets (see Figu 


the heading Duration w 
and the rows were labeled fast, medium, 


slow. 

The Ss were instructed to write the number 
of each tone on the row they thought that it 
belonged, and were told that there were no 
restrictions on how often the same row could 
be used. The numbers 1-31 were written on a 
blackboard as the tones were presented, one 
number for each tone, and Ss used these num- 
bers to designate the tone to which they were 
listening. After every fifth tone, S turned the 
page of the answer booklet. This was done to 

reluctance there might be in 


overcome any 
already contained a number. 


using a row that 


Results 

The consistency between Ss is shown by 
Table 4 which gives the percentage of Ss who 
e modal response on each dimension ; 


gave th 
% indicates perfect consistency. 


that is, 100 
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TABLE 4 


PERCENTAGE OF Ss GIVING THE MODAL RESPONSE 
IN THE MULTIPLE-CHOICE TASK 


Pitch-Rate Pitch-Duration 
Signal 
tone 
no. | Pitch | Rate | Both | Pitch oo Both 
20 82 73 | 64 91 55 | 45 
21 82 55 | 55 91 13. | 73 
22 82 | 100 | 82 82 | 100 | 82 
23 | 100 | 64 | 64 | 100 | 64] 64 
24 64 | 100 | 64 82 | 91 | 73 
25 73 || 35 | 36 82 | 55 | 45 
26 733 | 91 | 64 45 | o1 | 45 
27 73 | 91 | 64 64 | 45 | 27 
28 73 | 100 | 73 91 | 100 | 94 
29 73 | 64 | 45 82 | 55 | 36 
30 45 | 55 | 27 82 | 55 | 55 
31 45 | 100 | 45 45 | 100 | 45 
x 72.1 | 78.0 | 569 | 78.1 | 73.7 56.8 


Table 4 shows that 
ference between the co 


igh, 
ne being 
here was 


only 50% would have agreed to the to; 


there is a sense in which 


us tone along two dimen- 


shown by compari 


] people 
who gave a Tesponse outside the modal cate- 
TABLE 5 

TYPICAL PERCENTAGE OF DISTRRUTIO; 
N OF C 
RESPONSES TO A HicH, Fast Sanr tone i 
Pitch 
Rate a Total 
High Not high 
Fast 50 25 
Not fast 25 == = 
Total 75 25 
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response on either dimension, 


X 
gory in Exp. I and II (see Table Os 
people gave a nonmodal response 1N 

than in Exp. 


z : ate an! 
Test). This was true both for pitch-rate © h 
Pitch-duration, w 


si 
Ss Spontaneously labeled only one phy 


sions of a Signal 
in more accura 


TABLE 6 a 
ESPONS! 
PERCENTAGE OF Ss Nor Givinc THE MODAL RESP 


Exp. II* 
Signal E ——— 

tone Exp. I Pitch- 
no. Pitch-Rate Duration 
20 35 2 0 

21 18 9 0 

22 41 0 0 

23 47 0 9 

24 35 0 9 

25 76 9 0 

26 41 0 18 

27 35 0 0 

28 47 0 0 

29 59 9 18 

30 71 27 0 

31 53 18 3.5 

X 46.5 47 


the modal 
a Ss who gaye a response that did not include 
fore 


axact 
II (p < .01, Fisher Yates Ex 


[thous 


hich suggests that alt cal 


ch 
i Jo SU! 
dimension, many Ss were able to label tw 
dimensions, 


Discussion t- 

1g av 

The data show that drawing people the 
tention to the Physical characteristics 


stimulus and Specifying the features tO ret) 
they must attend, increases the pon w 
between Ss, even when considering ‘agint 
Sponses along a single dimension. Ave cate? 
the items with Superscripts in Table 3 ine c, jn 
approximately 


Exp. I; Table 


pich 


ures such as pitch “ae ane 
© not always do so SP ist” 
pusl Nevertheless, the degree of consi? ine 


iw? 


L 
erefore, referring to T resul! 
l tone will not inevitab m ; 
te identification of the § 
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p os if the stimuli to be discriminated 
Ebene a e along two dimensions, then 
oe. dimensions can be expected to 
Eston’ of aumben of people who select the 
reduction signal (e.g., Table 6 shows that this 
Obtained x ig rom over 40% incorrect choices, 
3% iio using pitch alone, to less than 
either n when using pitch together with 
two he e or duration). Thus, reference to 
ee RE more useful for the response 
PSA which it excludes rather than for 
7h which it includes. 
(Beet ea data on the effect of combining 
of the midi can be accounted for in terms 
auditory a or recognition of two-dimensional 
This ge proposed by Corcoran (1967). 
predicts the termed the independence model, 
stimulus at 5 an observer’s judgment of one 
fluenced Ppt (e.g. pitch) is unin- 
(ES. rat y the value of the other parameter 
the ae e), then the probability of perceiving 
pro inet nt stimulus correctly is simply the 
Table th perceiving each part correctly. From 
Soa Bi is clear that the both columns 
eolmne. ate the products of the other two 


Experiment II 


One fe i : ee . 
ne factor which might limit the generality 


o A 
eo 1 ae from the two previous experi- 
of Sonal hat in both studies Ss heard a range 
few a sonte Te Js possible that the first 
against i might create a frame of reference 
his Wa hich later signal tones are evaluated. 
agreeme turn might mean that the observed 
due lar, nt between Ss is spuriously high, being 
refere gely to all Ss using the same terms 0 

nce, 

eos eeu to check this, „to obta 
of lin etailed information on the distribution 
sion oe categories within the pitch dimen- 
jie noe i asked to label the pitch of pure 
pa tones heard only once. 
adult ra and materials. 
told t s (numbering 13-17 Ss per st 
tele has imagine that they had just di 
a Phone number, that they were listening for 
Minin tone, and that they then heard the 
Ss nd that was about to be played to them. The 
S were instructed to write down whether this 
Ound was high or low. A tape which recorded 


and also to obtain 


Eight groups of 
oup) were 
dialed & 
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“VARIATION IN VERBAL RESPONSE AS A 
FUNCTION OF THE FREQUENCY OF AN 
ISOLATED SIGNAL TONE 


a xX 


fe} 
fe) 


E 
ho “High” Responses 


Frequency of Signal Tone 
a function of the 


Fic. 2. Variation in verbal response as 
frequency of an isolated signal tone. 


22 sec. continuous pure tone was then played. 
The pitch of this tone was different for each of 
the eight groups. No S heard more than one 


tone. 


Results 


The percentage 
responded “high” 


of the signal tones. 
Clearly, there is agreement between Ss that 


600 Hz are high, and those below 


D 


of Ss in each group who 
is shown in Fig. 2 for each 


tones above 
200 Hz are low. 


Discussion 

sults show clearly that agree- 
nt between Ss was not dependent upon a 
eference having been introduced 
ntal situation. People would 
stablished reference points, 
ch dimensions, prior to 
entering the laboratory. Moreover, there was 
fairly wide agreement on what these reference 
points are (viz, 200 and 600 Hz). 

All this is not to argue that the label given 
to a signal is invariant over contexts, there is 
ample evidence to the contrary (cf. the effects 
of context, duration, intensity; Woodworth & 
Schlosberg 1954, PP- 229, 335.), but rather to 
suggest that when there is no readily available 
context, then 


The present re 


mei 
frame of T 
into the experime! 
appear to have e€: 
at least along the pit 


a tone above 600 Hz can con- 


fidently be described as high. 
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Perhaps there are also reference points com- 
mon to the majority of adult Ss in other physi- 
cal dimensions (e.g., rate, modulation, ete). 
Little seems to be known about this aspect of 
verbal communication, about what stereo- 
types are common to adults, and how widely 
any particular commonality extends particu- 
larly across occupations. (The majority of Ss in 
the present experiments were housewives.) 

In conclusion, the results of the present 
studies suggest that, in general, Ss do not have 
a readily available vocabulary for describing 
auditory signals such as those used by tele- 
phone companies, although there may be popu- 
lation stereotypes for some Physical character- 
istics of such tones, and it is possible to direct 
people’s attention to the relevant physical 
dimensions, 
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BINOCULAR COORDINATION IN READING? 


KARL U. SMITH, ? ROBERT SCHREMSER, anp VERNON PUTZ 


Behavioral Cybernetics Laboratory, University of Wisconsin 


red the differences in time between the two eyes, using labo- 


This experiment measu 
ratory real-time compu 
peaks of binocular saccadic movements inr 
of being completely 
ment, the eyes must 
that govern their feedback guidance and 


time differences between 


including no difference and left eye leading 


msec.; and (c) left eye leading by 14 msec. 


to the difficulty of the reading materi 


zontal rotation of the reading display. 


the eyes are perfectly conjugate in saccadi 


data toward a dynamic li 
disabilities in visual perception. 


The classical view defined by von Helmholtz 
(1924-1925) is that the two eyes are exactly 
conjugate in space and form perception. The 
modern statement of this view holds that 
exact time synchronization is characteristic of 
both saccadic and pursuit movements (Alpern, 
1962), The only dissent from this classical 
belief has been from Stromberg (1938), who 
found large differences between the positions 
of the two eyes prior to and after saccades. 
Alpern (1962) and Yarbus (1967) reported 
near perfect identity of timing of the two eyes 
in binocular motion. Yarbus, however, con- 
firmed Stromberg’s observations on differences 
m eye positions, but concluded that these 
differences were related to the fact that a 
change of fixation in binocular space percep- 
tion is composed of two independent move- 
Ments—that is, convergence and divergence 
and a saccade—but that binocular movements 
during a saccade are identical. 

The problem of synchronism of 


In different types of movement—Sace 
ae 


the two eyes 
adic, 
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Rehabilitation Administration and was carried out in 
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feedback doctrine o 


ter methods to detect and measure the time between velocity 


eading. The hypothesis was that instead 


conjugate as indicated by prior methods of ocular measure- 
be coordinated in directional motion by small time differences 
relative velocity. Results indicated that the 
the eyes clustered around three values: (a) near synchrony, 


by 1 msec.; (b) left eye leading by 7-9 
These time differences were not related 


al, but were changed significantly by 15° hori- 
The results change the established views that 


c motion and provide initial suggestive 
f coordinate eye motion and functional 


pursuit, divergence, and convergence—repre- 
sents an almost unexplored field because exact 
methods for recording binocular eye movements 
and timing them have not been developed. 
However, in vision, as in all other psychophy- 
siological systems, exact timing may be 
assumed to be fundamental to all critical 
dynamic operations related to the control of 
velocity and integration of movements. The 
view behind this research is that feedback 
timing—the interval defining the dynamic 
closed-loop control of retinal input by ocular 
motions—is just as critical as ocular and 
optometric factors in governing space percep- 
tion. This hypothesis was suggested initially 
by results of experiments which showed that 
eve-movement-retinal feedback delays of .1 
sec. and less impaired vision and accuracy of 
eye tracking (Smith & Molitor, 1969; Smith, 
Putz, & Molitor, 1966). 

The present experiment has been conducted 
to develop real-time laboratory computer 
methods to measure the time differences 
between the two eyes with high precision, and 
thus to evaluate the classical view that there is 
exact synchronism between binocular saccadic 
vements. The creation of an analog-digital- 
analog computer system as a machine for 
dynamic vision research has made possible 
letailed types of experimental measure- 


mo 


many € 3 
ments on vision, of which measurement of 
binocular time synchronism is only one of 


many possible innovations (Smith, 1966; Smith 
& Henry, 1967 ; Smith, Putz & Molitor, 1962), 
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» Cepending on whether 
ingly or Successively in a line. Roughly, 
havea laten: 0 msec., 

ubsequent in-line Movements being 
Teduced to less than a third of that value. The 
e 
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r {j 
and a special mana 
of pa 
mental program used to detect the occurrence Pi 


A igh rates 0 
experimental computer programming hen the 
digital sampling to measure the interva i 

i inocular saccades, in th 
eo ae eye-movement transducers used 
study are shown - photoresstrs fo 

tores 
each goggle aperture. The light and the E 
are located below the line of sight yin small re 
So that S has clear vision in each eye. sistors, whic 
lights illuminate the corneas, The eo in the 
are arranged in a bridge circuit, sense sa E ariel 
i of light reflected from th 
the eye Moves in a horizontal plane, f computer 
he experiment involved two miet a phase 
control of experimental operations, In t ai edule trials 
the hybrid computer system was used to sc i data 0” 
and to control recording of the eye-moveme sonents 0 
magnetic tape, The arrangement of the ve aw n 
the computerized vision laboratory are a módi ie 
Figure 2, The S was seated comfortably ad Sona no 
dental chair and arranged so that the hea cers We 
e moved after the eye-movement nr geo Jocate 
arranged in Place. The reading material rot in 
i Ss eyes at a distance o! nt and t0 
to monitor the experime ragnetiC 
of the eye movements on D ‘digita! 
(Control Data Corporation whic 
as Programmed to generate trial d on 
ed both on the oscillograph Ee on the 
tape recorder, The recorded mar. ntation 
oscillograph were used by Æ to control presen 


3 ne W! 
8, while those on the magnetic ie 
used to identity trials in the recapturing of om a 
in Figure » the electrical signals ne elka 
transducer were regulated ai, p 
gain control and then amp! “chan e 
were then recorded on rl of th 
and on Separate channel iso W 
J Tecorder, These same signals rentiated 
transmitteq to di erentiators and these differ 

d 


The dis! 
orded on the oscillograph. The CÈ of 


each 


roel; 

o, Ogrammed to sequence the jals oi 

rials an, generate signals to code these tri 

ird channe] of the magnetic tape recorder. run, 
After series of trials for a particular S men np d 
ata for € trials Were processed by the a pai a 

System Sure the time differences betwee a 

Saccades of t in 


e p 

These al 

i ar eye movements. Aa inter 
ere made y determining the tim Fige 
between the peak velocity of paired saccade syste 
Ae + Components of the 4 te make MA 
Of the Vision laboratory Were arranged to aire re 
ene log signals of the Teder, va 
S, as regenerated by the tape “timterentiate 
and di “rentiated. These diffe e 
als were 


verte 
ital computer conv 


igital Cpe | 
en multiplexed for digital “i, 
An analog-digi 1 


Mp 
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DIGITAL 
COMPUTER 


DIGITAL-ANALOG CONVERTER 


FM TAPE RECORDER 


"1G, 2. Components of the real-time computer syst 


READING DISPLAY 


em used 


in recording eye movements. 


multi 

ae ye-movement signals to digital form at a 

each of a of 1,000 samples per second and measured ¥ 

Measured ese signals on a nine-bit digital scale. These 

computer ponals were then selected by the digital 
The di Ti the differential time measurement. 

Deak vel git al computer was programmed to detect the 

velocity ocity value of the two digitized eye-movement 
etw. y signals and to compute the time interval 
etween the two values at the sampling rate used 


i, 
i 1,000 samples per second). The computer program 
-S0 included procedures for indexing these computed 
After the 


i 
epee and for storing them in memory. \ 
Tun, th ic tape record for a complete series of trials was 
Pape time data were punched out in digital form on 
“he tape and in a form for later statistical analysis. 
operat actual program for the time measurements 
blip a in terms of several steps. The in velocity 
a valu the two digitized movement signals was assigned 
Or th © of 0000 and the computer thereafter searche 
Sena occurrence of a second peak value in the incoming 
bredet Ir „this second peak value occurred pist 4 
‘alu ermined elapsed time of 20 msec., the sampe 
exaro of this elapsed time was indexed and stored. For 
mple, if the second peak velocity value occurred 
msec, after the first peak, a value of 0005 was assigned 


this second peak value. However, if the second peak 
‘value occurred in the same signal channel as the first 
peak, the program blocked and stopped scoring a 
particular trial. Eye blinks typically caused such" out- 
lying data and accounted for most of the failures to 
obtain data for a number of trials. To improve the 
success in obtaining measured data from recorded trials, 
only the last 36 sec. of each 1-min. trial were processed 
for time measurements. 

An example of the records on which the time mea- 
sed is shown in Figure 4. In addition 
to the time line indicating seconds, the two top lines 
of the record represent the right and left eye move- 
ments, respectively, while the bottom lines represent 
the differentiated velocity records of these two 
movements. 
FF The reading material used was taken from a collec- 
tion of readings, each passage of which was ranked 
according to the Flesch Ease of Reading Scale. Three 
levels of reading difficulty were used. In the ranked 
scale used, easy material had the higher numerical value. 
As expressed by this scale, the three easy passages used 
had ranks of 85, 84, and 79; the three medium level 
passages had ranks of 62, 63, and 64; and the three 
difficult passages had ranks of 33, 41, and 43. Each of 


surements were ba: 
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MPLIFIER 
LEFT EYE MWE. 


FM TAPE RECORDER 


DIGITAL COMPUTER 


MULTIPLEXER 
Fic. 3. Components of the real-tj 


-time computer system used in measuring the 
time differences between the two eyes, 

the nine 
1-min. 
period oj 


Passages was used for two reading trials of 
each which were run consecu 


tively, A rest 
f 30 sec. occurred between trials. 


„ced, 
ntar 


The reading m 
and had a Jine | 
tions from the o 


: : le spé 
aterial was typewritten, dout inde 
ength of 6 in. The pe ETP ig prov! 
riginal passages were altered 


TIME (SECONDS) 
A 


ALET A \ 
TTT Mie 


Y Was based on 
any other 


| at a a, 
LEFT EYE ae | 


= i mi aH Md ba i Wl yaya s | 


f 
i A jsion 
ing and velocity movements. (The high Preity 
detecting the very Short intervals of the ve cates 
Part of the Velocity signal would not have been as accur@ 


peaks, Measurement based on 


=. 
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eo ott R reading display, so that each display 
Gch ie i a two paragraphs 8-10 lines in length. For 
materi ie reading display included 18-20 lines. The 
aitelepe was typed on yellow paper and displayed on 
the sr es This device provided rear lighting of 
passage -a end allowed Æ to advance from one reading 
iie bas next by remote control. The S was 
ite ofc Ea read the material silently at his normal 
completed. me In all cases the passage material was 
ala within the elapsed time of the experimental 
M varying the difficulty of the reading material, 
ihe fee S of the reading display was varied by rotating 
Three È rompter through 15° to the right and the left. 
Suet used and were run ina repeated-measures 
reading pa al design that encompassed two trials per 
medium’ aie three levels of reading difficulty (easy, 
reading freee difficult), and three orientations of the 
all, A Baay ay (normal, 15° right, and 15° left). In 
suitisi vere 18 trials for each S. The following scheme 
itidjedtes tE, the order of trials (T) for each S and also 
discaraea x trials which gave outlying data that was 
ioy ai n the scheme, Dı, Dz, and Ds stand for 
respecti ie and high levels of reading difficulty, 
normal vely, and Ou, Os, and Os represent right, 
Asterisk and left orientations of the reading display. 
Tecaptu: s indicate trials from which data could not be 
red, 


S Dy A O: 0: 
Di: Ds Dz Di D; „Də „D: „Di 
TiTa Tits * Ta TiTa Tr * TiTe TiTa Tr * TiTa 

S: O: O: O: 
1 Ds Di Dz Ds „D, „D: „Dı 
TiTe Ti * TiTa 


RESULTS 


Pind — of the experiment consist of 
which me which represent the extent to 
aus e right or left eye led the other eye 
sinters ths saccadic reading movements. The 
nk, the time data obtained may be indi- 
tion y the overall means for Ss and orienta- 
ti conditions. The grand mean binocular- 

me difference for all three Ss was 6.7 msec. 
rue represented a leading time value for the 
det eye. The precision of this measurement, as 
isles by generating and processing of 
Se ificial electrical inputs to the computer 
fo Stem, was +1 msec. The mean time values 
in all three Ss were 4.0, 9.0, and 7.1 msec. 
Sen Sə, and Ss, respectively, all repre- 

Ing a left eye lead. 

The mean time differences for the three 

S at the three different levels of orientation 
of the reading display are summarized in 

‘able 1, All of these differences represent left 
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TABLE 1 


Mean Lert EYE TME LEADS FOR THE THREE 
Ss av THE THREE DISPLAY ORIENTATIONS 


Orientation* 
Subject 
Right Normal Left 
Sı 4.0 5.6 4.4 
S2 9.9 79 9.2 
Ss 8.0 5.5 7.2 


_ Note.—All of the values in the table represent left eye leads 
(in msec.). 

a For orientation conditions F =3.08, p >.05. 

eye leads. The detailed data on which these 
mean values are based are summarized in 
Figure 5. These curves plot frequency of mea- 
surements of left eye and right eye leads in 
relation to time in milliseconds. Perfect 
synchronism or identity of binocular move- 
ments is indicated by a zero time value, right 
eye leads by values given to the right of the 
zero values, and left eye leads by values to the 
left of the zero values. The top curve in 
Figure 5 represents right rotation of the dis- 
play; the middle curve, normal orientation; 
and the bottom curve, left rotation. 

The meaning of the differences shown in 
Figure 5 can be described in still another way 
in terms of the percentage of left eye and right 
eye leads that occurred in the measurement. 
These values are summarized in Table 2 in 
relation to the three conditions of orientation 
of the reading display. The table indicates 
that the left eye led the right eye 71%, 80%, 
and 86% of the time in the three conditions, 
whereas the right eye led the left in 8%, 9.5%, 
and 16 % of the measurements. There were 
no differences between the two eyes in 6%, 
10.5%, and 13% of the time in the three con- 


ditions of orientation. 
TABLE 2 
PERCENTAGE OF LEFT EYE AND RIGH 


T THE THREE CONDITIONS OF ORIE: 
OF THE READING MATERIAL 


T EyE LEADS 
A NTATION 


Left eye | Right eye No 


Orientation , 
cond. lead lead difference 
15° left 80 9.5 10.5 
Normal 71 16 13 
86 8 6 


15° right 
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case of the right rotation condition, these 

modes occurred at 1, 8, and 14 msecs. 
The trimodal distribution of eye leads which 
appeared for the distribution of overall mean 
values for orientation conditions was found 
RIGHT -ORIENTATION for each of the three Ss. Figure 6 summarize 
the distributions of eye leads for the three Ss 
for the least difficult reading passages. These 
curves show that S, differed from the other 
two Ss in giving a modal right eye lead at 
about 5 msecs. This S showed a relatively low 
modal lead value at 16 msecs., and a very high 
frequency left eye lead at 1 msec. In contrast, 
S2 showed a high frequency left eye lead at 
NORMAL ORIENTATION 14 msec., and a low left lead mode at 1 msec. 
Sz showed a somewhat different distribution, 
with the highest modal frequency at 0 msec. 


DUH MEENA 202468 DE HE Be 


D BSwUPYBEY 20 2 


FREQUENCY 


SUBJECT 1 


LEPT ORIENTATION 


D BÉURE 202 ub BEE 


Fic. 5. Distribution of the frequency of time-leads 


of the left and tight eye for different conditions of 
orientation of the reading material, 


To determine if the 


SUBJECT 2 
the three orientatio, 


however, was different from the normal orien- 
tation condition, 


The actual distribution of tim 


e differences 
between the two ey 


SUBJECT 3 


ng display, For the left 

(bottom curve), three 

curred at 1,9 ae ie 
at Fic. 6. Distribution of gh 

al condition, the three of the left Pn care. 

modes occurred at 1, 9, and 15 msec: 


. s a5 
X ight eye for different Ss with the lë 
S. In the difficult of the reading passages, 


modal time values oc 


fasana 


and nearly equal modal frequencies at 14 and 
msecs. These S differences in modal fre- 
quencies at different time values were dupli- 
Cated almost exactly in the other two condi- 

tions of difficulty of the reading material. 
eee are limitations to the data besides the 
at a considerable number of trials could 
be completely scored. In addition, the 
Omputer procedures detected a considerable 
number of unpaired eye movements, as 
defined by the criterion of time pairing of 
5 oe Table 3 summarizes the number of 
= Ss ae measures for the three Ss and 
ners ifferent orientation conditions. Table 3 
which ne separately the number of examples in 
ment m was no pairing for a left eye move- 
io and for a right eye movement. It is not 
Own to what extent these failures to find a 
Eo movement represent errors in detec- 
of te actual independent movements of one 
in the eyes. The table shows that Ss differed 
efta hero of unpaired movements for the 
o ipa right eye. Sı showed the fewest number 
a Sapate movements and three to four times 
meate a left eye and right eye unpaired move- 
RS oe S2 showed roughly four to five times 
ap wy unpaired movements as Si and 
'] Proximately 1.3 times as many left eye and 
t eye unpaired movements. In the case of 
— number of unpaired left eye move- 
eben about the same as that for right eye 
id mene The program used in this study 
Proport; make it possible to determine what 
teprese ìon of these unpaired movements 
ut f nted actual independent movements, 
uture research is planned to determine 

Cse Proportions. 


SUMMARY 


wa tl-time laboratory computer methods 
Ment developed for high precision meagre 
Ves į o: the time differences between the two 
as te n reading and other visual tasks. The aim 
at pidetermine the validity of the hypothesis 

N a 'nocular time differences must occur as 
din of dynamic feedback yoking of the 
result, yes in coordinate ocular motions. The 
es; "S found are in keeping with this hypoth- 
Sener. very small time differences = the 
eadi al order of 6.6 msecs., with the left pr 
dif ing, were found in all three Ss and under 
erent conditions of difficulty and orienta- 
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TABLE 3 


NUMBER OF UNPAIRED MEASUREMENTS FOR DIFFERENT 
SS AND DIFFERENT ORIENTATIONS OF 
THE READING MATERIAL 


Subject or Left-eye Right-eye 
orientation unpaired unpaired 
Subject 
Sı 115 32 
S: 256 193 
S: 202 223 
Orientation 
15° left 185 131 
Normal 171 129 
15° right 207 172 


Note.—These frequencies represent about 20% of the total 
number of paired movements which were scored. 


tion of the reading material. The precision of 
the computer measures was determined to be 
of the order of 1 msec., and was achieved by 
the special procedures of velocity measure- 
ment of eye movements and computer pro- 
gramming used in the study. 

The main finding was that not one, but three 
modal frequencies occurred in the distribution 
of the time differences between the two eyes, 
with the left eye leading in each of these cases. 
These modal values of 1, 8-9, and 14-15 msecs. 
occurred at different orientations of the reading 
material. This trimodal distribution of bi- 
nocular time differences was found consistently 
in all three Ss. The basis of this unusual dis- 
tribution is not known, but possibly may be 
related to the time discrepancies for the short 
and long saccadic movements in reading. The 
results permit the assumption, to be tested in 
further study, that the longer saccades of 
reading produced the longer left eye leads, 
and that the shorter saccades produced the 
shorter leads. 

The finding that the left eye led the right 
eye in the great majority of the measures made 
in this study has a limited counterpart in some 
observations by Hyde (1959). She found that 
“outward” movements of the eye from a 
center position were of lower velocity than 
“inward” movements from an outlying to the 
center position. The difference was 58° per 
second. Hyde judged that the eyes are com- 
pletely conjugate since she could not detect 
time differences within the range of precision 
that her film methods permitted (i.e., 1/164 
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sec.). However, her data suggest that if 
velocity differences can occur in ocular move- 
ments of different direction, binocular time 
differences also may occur for “inward” and 
“outward” movements. Since most motions, 
including reading, may be biased directionally 
by sidedness, it is possible that in eye motions 
the left eye may be required to perform 
“inward” direction movements for a greater 
period of time than the right eye, and hence 
could display a greater velocity of motion and 
a corresponding time lead in motion over the 
right eye. 

The results complement the findings of other 
studies on delayed eye 


govern eye motion, 
well as direction Specific in dete 
guidance and Coordination of the two eyes, 
The results also give some added substance to 
previous findings (Smith & Molitor, 1969) that 
space and form Perception and its develop- 
mental deficiencies may depend on precise 
modes of feedback timing in control of saccadic 


and pursuit movements, This research em- 
phasizes the need for mo 


specific as 
Tmining the 
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standing the basis of 


visual 
disabilities. 


perceptual 
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EMPLOYEE REACTIONS TO JOB CHARACTERISTICS! 


J. RICHARD HACKMAN? anp EDWARD E. LAWLER III 


Yale University 


A conceptual framework specifying the conditions under which jobs will facilitate 
the development of internal motivation for effective performance was developed and 
tested. The Ss were 208 employees of a telephone company who worked on 13 
different jobs. Primary independent variables were: (a) a measure of strength of 
desire for the satisfaction of “higher order” needs (e.g., obtaining feelings of accom- 
plishment, personal growth); and (b) descriptions of jobs on four core dimensions 
(variety, autonomy, task identity, feedback). It was predicted and found that when 
jobs are high on the four core dimensions, employees who are desirous of higher 
order need satisfaction tend to have high motivation, have high job satisfaction, be 
absent from work infrequently, and be rated by supervisors as doing high quality 
work. A number of supplementary analyses were reported, and the implications of 
the results for future research on job effects and for the design of jobs were discussed. 


_ Researchers and managers alike are increas- 
ingly attending to the way jobs are designed 
as an important factor in determining the 
Motivation, satisfaction, and performance of 
employees at work, This is not to say that 
Jobs Previously have been seen as irrelevant 
to Organizational administration. On the con- 
trary, earlier in this century when scientific 
Management was in its prime, considerable 
research effort was expended to find ways that 
Jobs could be simplified, specialized, standard- 
ized, and routinized. At the same time, in- 
dustrial Psychologists were developing rather 
Complex and sophisticated procedures for de- 
Scribing and analyzing jobs in terms of their 
Simplest components, as a means of evaluating 
the skill levels required for different jobs. The 
results of job analyses have been used to 
aires ) 
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establish fair rates of pay, for training pur- 
poses, and in personnel selection (see, e.g., 
Ghiselli & Brown, 1955; Lytle, 1946; Stigers 
& Reed, 1944). The general expectation of the 
scientific management approach was that by 
simplifying jobs, work could be carried out 
more efficiently; less-skilled employees would 
be required; the control of management over 
production would be increased ; and, ultimately, 
organizational profits would be enhanced. 

In recent years, numerous scholars have 
documented a number of unintended and un- 
fortunate consequences of the trend toward 
work simplification (e.g., Argyris, 1964; Blau- 
ner, 1964; Davis, 1957; Friedmann, 1961; 
Guest, 1955; Herzberg, Mausner, & Snyder- 
man, 1959; Walker, 1950; Walker & Guest, 
1952). In brief, it has been shown that simple, 
routine nonchallenging jobs often lead to high 
employee dissatisfaction, to increased absentee- 
ism and turnover, and to substantial difficulties 
in effectively managing employees who work on 
simplified jobs.’ The expected increases in 
profitability from work simplification have not 
materialized as had been hoped, and the 
reasons apparently have very much to do 


3 These observations have not, however, gone un- 
challenged. See, for example, Kilbridge (1961) and 
MacKinney, Wernimont, and Galitz (1962). 
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with the human problems encountered when 
jobs are standardized and simplified. . 

Partially in response to the above findings, 
a number of researchers began experimentally 
enlarging various jobs to determine whether 
or not worker productivity and satisfaction 
would increase if jobs were designed so as to 
be more generally meaningful and challenging 
to employees. By and large, those job enlarge- 
ment experiments which have been reported 
in the literature have been considered success- 
ful (see, e.g., Biganne & Stewart, 1963; Conant 
& Kilbridge, 1965; Davis & Valfer, 1965; 
Ford, 1969; Kilbridge, 1960; Pelissier, 1965). 
With few exceptions, however, job enlargement 
experiments have been case studies and often 
have lacked appropriate experimental con- 
trols. Hulin and Blood (1968) review the re- 
search literature on job enlargement in some 
detail and are especially attentive to possible 
difficulties in procedure and methodology which 
may cast doubt on the generality or the va- 
lidity of the findings reported. 

Perhaps equally as disturbing as the uneven 
level of methodological rigor which has char- 
acterized job enlargement studies is the 
total absence of any systematic conceptual or 
theoretical basis for the studies which have 
been done. As a result, after doz 
ments, little cumulative knowledge has been 
gained regarding the effects and effectiveness 
of job redesign. Job enlargement experiments, 
for example, typically have involved a number 


of simultaneous changes—such as 
amount of variet 


of responsibility req 
working with other: 


almost 


ens of experi- 


anges, 
of job enlargement 


n (e.g., whether they 
rtain types of workers 
or whether they are relevant only to certain 
kinds of jobs). More case studies are not 
likely to contribute very much to the develop- 
ment of answers to cr 


ucial questions such as 
these. Instead, what appears to be needed are 
Conceptual frameworks y ate test- 


which gener. 
able propositions about job character- 


how 
istics affect employees under various circum- 
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stances, and empirical research which is de- 
signed explicitly to test these propositions. 
This article proposes one way of conceptualiz- 
ing the impact of job characteristics on indi- 
vidual work behavior and attitudes. It then 
reports data which were collected to provide a 
preliminary test of that conceptualization. 


Previous Theory and Research 


Some progress toward the development o 
theory relevant to job design has been made 
in recent years. The well-known eg 
theory of Herzberg (Herzberg, Mausner, ° 
Snyderman, 1959; Herzberg, 1966), for ex- 
ample, can be used to derive general pee 
tions regarding conditions on the job which 
will be motivating and satisfying to employe 
In particular, the theory suggests that a jo 
should enhance employee motivation to the 
extent that it provides opportunities for (e 
achievement, (b) recognition, (c) responsibil 
ity, (d) advancement, and (e) growth in COM: 
petence. These principles have given rise to s 
series of generally successful job enlargemer 
experiments in the American Telephone an 


Telegraph Company (summarized by Fords 
1969), 


Unfortunately, 
been unable to pi 
some of the maj 


a number of researchers hav® 
rovide empirical support Jor 
or tenets of the theory from 
which the principles used in the AT&T studies 
were derived (e.g., Dunnette, Campbell, 5 
Hakel, 1967; Hinton, 1968; King, 1970), an 


the general conceptual status of the theory 


to specify how characteristics of workers 
interact with t 


e 
: esence or absence of m 
nditions in qetermini” 


TH Be neasured for existing joP* 
his makes it very difficult to test the pa 
mbi ictions 11" 
it about the effects pie a which 
may be contemplated for existing jobs. a 
Measuring job charact® 
licitly and carefully dey 
d Lawrence (1965). Th of 
operational measures 
attributes” which, 0” 


i 


i 


ee Oe TTT 
[= a i 


Se oF a review of existing literatureand an 
a conceptual framework, were predicted 
antl ee related to worker satisfaction 
—— endance. The six attributes are: (a) 
(a) ge n0) autonomy, (c) required interaction, 
skill ‘sen interaction, (e) knowledge and 
on e aie, and (f) responsibility. Scores 
Kae a attribute for each of 47 different jobs 
lie from field observations and 
ecient by the researchers, and precise 
attriby ha of how scores on each of the 
Ration, F is determined is provided. Exami- 
requisite the relationships among the six 
ealed a task attributes for the 47 jobs re- 
related me the attributes were very closely 
awra o one another. Therefore, Turner and 
the R ce developed a summary measure called 
Tails, ‘oy be Task, Attribute Index (RT A 
of the 3 y formulating a linear combination 
Sinn s3 separately measured attributes. T 'his 
the be) Bae was then used in ascertaining 
the eg between the attributes of 
a and worker job satisfaction an 
ance. 
foes authors’ expectation 
hies on jobs which were on th 
ser would have higher job satisfaction and 
Tater Eee was not fully supported. 
tionsh d, it appeared that the predicted rela- 
E between the RTA Index and employee 
boat held only for workers from factories 
Settles in small towns. Workers m urban 
jobs a reported less satisfaction with their 
Tide k iei the jobs were high on the RTA 
ont and the RTA Index was unrelated to 
cee eeism for urban workers. Turner and 
Aenean: (1965) argued that the obtained 
Tode ences in reactions to good (i.e. high RTA 
diff x) jobs were substantially moderated by 
erences in the cultural backgrounds © 
employees, 
peod and Hulin (1967) and Hulin and 
Üi od (1968) provide additional data on the 
ee of subcultural factors 1 aae 
th oe worker responses to the makeup 2 
it us Jobs. These authors hypothesize that an 
ee tant moderating factor is alienation zom 
the traditional work norms which character- 
ìze the middle class. When employees hold 
traditional values regarding the value of work 
and achievement in work settings as wou 
be expected of the employees in small town 


that employees 
high on the RTA 
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factories in the Turner and Lawrence study), 
more complex jobs should be responded to pos- 
itively. When employees are alienated from 
these norms (as might be expected of urban 
workers), more complex jobs should be re- 
sponded to negatively. Blood and Hulin 
(1967) provide data supporting this general 
proposition and propose & three-dimensional 
response surface (Hulin & Blood, 1968) which 
specifies the expected interrelationships among 
worker alienation, job level, and satisfaction 
with work. 

The data of Turner and Lawrence (1965) 
and of Blood and Hulin (1967) are sufficiently 
compelling that the generality of the strong 
proposition that enlarged jobs (in the sense of 
being high, e.g., on the dimensions which make 
up the RTA Index) lead to improved satis- 
faction, attendance, and/or performance on 
the job must be called into question. Instead, 
it appears that certain characteristics of the 
employees themselves must be taken into 
account simultaneously with the character- 
istics of their jobs in order to generate valid 
predictions about the behavioral and affective 
responses of employees at work. 

If the above conclusion is accepted, thorny 
questions then arise about how the relevant 
differences among employees are to be con- 
ceptualized and measured. Both Turner and 
Lawrence (1965) and Hulin and Blood (1968) 
choose to deal with individual differences on 
a subcultural or sociological level (i.e, in 
terms of differences between town and city 
workers or in terms of the alienation of city 
workers from middle-class work norms). 

An alternative strategy would be to attempt 
to conceptualize and measure the relevant 
individual differences directly at the individual 
level of analysis. The town-city conceptualiza- 
tion assumes @ substantial homogeneity of 
worker characteristics and response tendencies 
for employees within the two cultural settings. 
To the extent that there are substantial in- 
dividual differences among town workers and 
among city workers, an attempt to measure 
relevant individual differences directly at the 
individual level would seem to have consider- 
able merit. The difficulty in implementing this 
alternative approach, of course, is that it 
requires prior specification on a conceptual 
level of what specific differences among people 


262 


are responsible for the results reported by 
Turner and Lawrence (1965) and Blood and 


Hulin (1967) (i.e., what it is ab 
moderates the way they react 


out people that 
to their jobs). 


In the following paragraphs, we will propose 
such a conceptualization, and derive from it a 


number of predictions about 


the effects of 


job characteristics on employee satisfaction 


and motivation. 


Jobs and Individuals: A Conceplual Framework 


The present conceptualization of the inter- 
action between job characteristics and indi- 


vidual differences is based prim: 


expectancy theory of motiv. 


and as applied to work s 
(1964), Porter and L 
In particular, five p 
pectancy theory are sug 
address the specific probl 
motivation can be enh 
of jobs. 

1. To the extent that an ind 
that he can obtain 
engaging in some particular b 
of behaviors, the likelihood 


actually engage in that behavi 


Relevant valued ou 
trinsic (e.g., feelings 
personal worth) 
goods); the only 
comes be v 
individual anticipates obt 
outcome as a result of 
course of action, that 
an incentive for eng; 

2. Outcomes 
the extent th 


requirement 


a 
al to do so, Such n 


R be in the 
Interest of the indi 


ing states of 


ndividual, Peo 


arily on the 


ation, as formu- 
lated by Lewin (1938) and Tolm 


an (1959), 


ettings by Vroom 
awler (1968), and others. 
Topositions based on ey- 
ggested below, which 
em of how employee 
anced through the design 


ividual believes 


an outcome he values by 
ehavior or class 


that he will 
or is enhanced, 


tcomes can be both in- 


of accomplishment or of 
and extrinsic (eg., 


material 
is that the out- 


alued by the individual. When an 


aining some valued 
a contemplated 


outcomes 
re expected by 
eed satisfaction 
objective best 
ple frequently 


affairs which are 


long ralis 
being (Locke, 1969), Neverth n 


eless, if g 
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3. Thus, to the extent that conditions at 
work can be arranged so that employees can 
satisfy their own needs best by working effec 
tively toward organizational goals, employ i 
will in fact tend to work hard toward the 
achievement of these goals (McGregor, 1960 

4. Most lower level needs (e.g., ape 
well-being, security) can be (and often me 
reasonably well satisfied for individuals re 
contemporary society on a continuing pas 
and, therefore, will not serve as Je. 
incentives except under unusual ge geese 
This is not the case, however, for eer 
higher order needs (e.g., needs for penr 
growth and development or for feelings pi 
worthwhile accomplishment). A person m 
experience higher order need satisfaction ni 
continuing basis without the strength of den 
for additional Satisfaction of these needs ¢ 


i che Š ‘tional 
minishing, Indeed, it may be that addition 


. 5 f ally 
satisfaction of higher order needs actui 


increases their Strength (Alderfer, 1969). T 
is an important possibility since it BUER o 
that the Opportunity for the development g 
continuing (and possibly even incr Ha 
motivation is much more a reality when hig fot 
order needs are engaged than is the case ere 
More easily satisfied lower order needs. be 
is, of course, a major cost associated ner 
any motivational approach in which ag 
order needs are central: Not all employees tis- 
or will respond to Opportunities for the wer 
faction of higher order needs, and thus ™ 


vational ; an 
ational approaches based on these 
cannot be 


L Ne Jow 

e€ applied indiscriminantly. ee 
(1943, 1954) and Alderfos (1969, 1971) distr 
re complete detail the i 
t needs and their motivati 


5. Individuals w 
order need Satisf 


shel 
ho are capable of hig 
c action will in fact expe 
such Satisfaction 


rienc? 


? 

j r Meaningful (see Argytiss 4). 

Lewin, Dembo, Festinger & Sears, i her 

Specttioally, individuals vio aeciee nighe! 
order need satisfaci: 


sfactions should be most re 


J ack ? 
ch provide feed bat ties 
personal work es mot” 
s for internal wor 


= =F nn eel pede) TD 


y 
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a = it appears that a job must: (a) 
$ poe to feel personally responsible 
A ate Tepi oom and meaningful portion of 
= sn >) provide work outcomes which 
Bone eS ly meaningful or otherwise ex- 
ee E as worthwhile, and (c) provide feed- 
a ae me performance ’ effectiveness. The 
ob we better an individual works on such 
erener more opportunities he will have to 
Re as ce higher order need satisfactions and 
Ractive. incentive there can be for continue 
one : Se Higher order need satis- 
reste i theretäre; are seen both as (a) a 
hive aed oe than a determinant of) effec- 
and (b) ag (Lawler & Porter, 1967), 
perfor ie incentive for continued efforts to 
Th ectively.* 
to a propositions outlined above lead 
So a it may be possible under 
igh sa E e simultaneously to achieve 
effort E oyee satisfaction and high employee 
the ie organizational goals. Specifically’ 
and hi Jera congruence of high satisfaction 
the a effort is seen as depending upon (a 
order n E of employee desires for higher 
the job ee on and (b) conditions on 
EA such that working hard and effectively 
ic ale garners goals will bring about 
action of these needs. 


C PONT 
haracteristics of Motivating Jobs 


ER e general job characteristics identi- 
betwee ve as central in developing à congruence 
nizational dividual need satisfaction and orga- 
able rae goal achievement must be describ- 
of the more measurable terms if the validity 
tested pap one proposed here is to be 
cach | i the following paragraphs, therefore, 
are a the three general characteristics 
ute na in somewhat more detail. In 
th ition, it will be proposed that four of 
e requisite task attributes proposed by 
ape and Lawrence (1965) are likely to 
job useful as measures of the three general 
characteristics. 
e The job must allow a wor 
sonally responsible for a meaning 


~ 


orker ‘to feel 
ful portion 


4 
we should be noted that only highe! 
a jot are predicted to be increased by el 2 
satists with the characteristics outlined al 
sw ‘actions (e.g Pay satisfaction, $2! 
pervision) may not be affected. 
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of his work. What is accomplished must be 
through the individual’s own efforts. He must 
realize that the work he does is his own. And 
he must believe that he personally is responsi- 
ble for whatever successes and failures occur 
as a result of his work. Only if what is accom- 
plished is seen as one’s own can an individual 
experience @ feeling of personal success and a 
gain in self-esteem. This does not mean, of 
course, that feelings of personal responsibility 
for work outcomes cannot occur in team 
projects; all that is required is for team mem- 
bers to feel that their own efforts are important 
in accomplishing the task at hand. 

The autonomy dimension, aS specified by 
‘Turner and Lawrence (1965), would seem to 
tap the degree to which workers feel personal 
responsibility for their work. In jobs high on 
measured autonomy, workers will tend to feel 
that they own the outcomes of their work; 
in jobs low on autonomy, & worker may more 
often feel that successes and failures on the 
job are more often due to the good work (or 
to the incompetence) of other workers or of 
his supervisor.° 

2, The job must provide outcomes which are 
intrinsically meaningful or otherwise experi- 
enced as worthwhile to the individual. If a 
he results of his efforts are 
tant, it is unlikely that he will 
good if he works effectively- 
Instead, he must achieve something that he 
feels is worthwhile and important 

able to experience positive 
i f as a result of his efforts. 
ple to indicate for people 
job characteristics 
utcomes seen as 


not very impor 
feel especially 


if he 
feelings 4 
It clearly is not possi 
in general what 

will be likely to provide’ © 
meaningful and worthwhile; people differ too 
much in the kinds of things they value for 
any statement of such generality to be made. 
It is possible, however, to provide some such 
job does not, of 
y imply that one will have major 


e of work outcomes, over which the 
meaningful control. For example, 2 
ck has high autonomy in selecting 
level of personal control 


worker has little 
football quarterba 
plays but only a moderate 
over the outcomes obtained from execution of the 
plays- ‘Thus, work autonomy is probably best viewed 
as a necessary but not sufficient condition for feeling 


personal responsibility for work outcomes. 
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specifications for individuals who have hipi 
desires, for higher order need satisfaction a ’ 
of course, these are the individuals a! zi 
the present conceptualization is intended to 
i... are at least two ways that work can 
come to be experienced as meaningful for 
employees with relatively high desires for 
higher order need satisfaction. The first is or 
the job to be a sufficiently whole piece of work 
that the worker can perceive that he has 
produced or accomplished something of con- 
sequence. In terms of a Turner and Lawrence 
task attribute, this would be expected to be 
the case when a job is high on task identity. 
According to Turner and Lawrence (1965, p. 
157), jobs high on task identity are character. 
ized by (a) a very 
closure—the job pro 
the beginning and e 


tude. For a worker 


developing and usin 
with such ch 


In addition, the experienc 
of work may be enhanced w 
a worker with the opportunity to accomplish 
Something by using skills and abilities which 
he Personally values, For example, a Strongly 
motivated duffer feels go i 
solid tee shot 


ed meaningfulness 
hen a job Provides 


; 
those skills are important to him; he performs 
well—and that, in itself, 


Jobs high on the T 
(1965) dimension of vari 
to provide Opportunities 
ence this kind of meani 
since high variety jobs t 


urner and Law 
ety 


5 working on high variety 
jobs may become pers 


some employees throu 


ous to that which makes golf meaningful to 


€ noted, however, that 
es in fact challenge the 
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worker will be expected to be oe 
meaningful to workers with desires tor qed 
order need satisfaction ; screwing many dl of 
ent sizes of nuts on many different ae - 
bolts, if this could be considered — K 
would not be expected to be experience! 
meaningful.’ 

To summarize, it may be that jobs an e 
to be experienced as meaningful to emp: s Bie 
to the extent that they involve doing a T H 
piece of work of some significance (i.e. a ta 
high task identity) and, at the same m a 
the extent that they give employees the “ht to | 
to use their valued skills and abilities en 
be challenged) in doing the work. In a abe 
cases the latter condition may be met on j 
which have high variety, pout 

3. The job must provide feedback ee 
what is accomplished. Even if the two gen m- 
conditions discussed above are met, an a 
Ployee cannot experience higher order k un- 
satisfaction when he performs nag S 
less he obtains some kind of feedback pa 
how he is doing. Such feedback may ale 
from doing the task itself (e.g., when H long 
phone operator Successfully. completes 2 rfor- 
distance person-to-person call), but pe ome 
mance feedback also may come from S A 
other person—an esteemed co-worker, a PA 
visor, etc, The crucial condition is ee to 
back be present in a form that isn pelieva 


ists for 
the worker, so that a realistic basig €X!S 
the Satisfaction (or f 


aT p order 
rustration) of higher 
needs, ‘ f the 
_ It should be emphasized that, for afl! a not 
Job characterist; i k 


Wy they 
wrdless 
y the employees, Regia, of 
amount of feedback (or varietiigally 
a worker ilies 
uch he percent h® 
S reactions to P60 


as in his work, it is how m 
that he has which wi i 
will affect hi 


It is also Possible, of Course 
much variety, Activation 


Suggests that when variet 
experience a 


ne: 
hypertension whic 
effectiveness, In 
found that among research scienti: 
Berge en with ow job Satisfaction, apparent 
: de iR hig Variety also tended to be Ie 
in task identity and feedback r 


e 
» for a job to hay 966) 
theory (e.g., Scott, may 


i 5 al 
y is too high, employee je! e 
e of muscular and nktan 


Sts, high job varig / | 


Y 


ol LER” 
> pete job characteristics are impor- 
and ne they do affect the perceptions 
often ew of employees. But there are 
job oaasi ial differences between objective 
lon eristics and how they are perceived 
that sim Age and it is dangerous to assume 
istics of of oie the objective character- 
that the z have been measured (or changed) 
Ployees bee that job is experienced by em- 
Tn oe, Ss een dealt with as well. 
F oe then, it has been argued that 
itions whi get ef jake Ean establish cons 
tion of en will enhance the intrinsic motiva- 
Satisfaction. FK who desire higher order need 
Sested, in ter n particular, it has been sug- 
awrence (1965 of a subset of the Turner and 
viduals will 5) dimensions, that such, indi- 
Personal sati pe able to obtain meaningful 
on jobs wh 1s action when they perform well 
Variety Ae they experience as high on 
Further G og task identity, and feedback. 
Performs € harder and better an individual 
on these dim a job which is perceived as high 
tour core as (hereafter will be called the 
imensions), the more satisfaction he 


is likely to feel. 


Str 
ategy of the Present Research 


Oleg a oepinalization presented above pro- 
Tesearch c basis for the present empirical 
acteristic Fe the relationships among job char- 
strength s; individual differences in need 
tion ; and employee motivation, satisfac- 
job. Te erformance, and absenteeism on the 
here foll articular, the research to be reported 

1. Meson the strategy steps listed below: 
sions Ta sures of the following six job dimen- 
o which » developed : (a) variety, the degree 
a wide a job requires employees to perform 
and/or range of operations in their work 
Use a the degree to which employees must 
ip ont of equipment and procedures 1n 
emplo for (b) autonomy, the extent to which 
ae yees have a major say In scheduling 
Use work, selecting the equipment they will 
lee deciding on procedures to be fol- 
employ (c) task identity, the extent to which 
and yees do an entire or whole piece of work 
ike. clearly identify the result of their 

emplos (d) feedback, the degree to whic 
Wome receive information as they are 
ing which reveals how well they are per 
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aling with others, 


forming on the job; (e) de 
equires employees 


the degree to which a job ri 
to deal with other people (either customers, 
other company employees, or both) to com- 
plete the work; (f) friendship opportunities, 
the degree to which a job allows employees 
to talk with one another on the job and to 
establish informal relationships with other em- 


ployees at work. 

The latter two dimen 
permit exploration of the impac 
personal characteristics of job design. These 
dimensions were adapted with very minor 
revision from the task attributes “required 
interaction” and “optional interaction” pro- 
posed by Turner and Lawrence (1965). They 
are not, however, directly relevant to the con- 
ceptualization about job-based work motiva- 
tion proposed above, and no specific predic- 
tions regarding them were made. Thirteen 
different jobs were described on these six di- 
mensions by the researchers, by employees 
who worked on the jobs, and by members of 
the management of the telephone company in 
which the research was carried out. 

2, A measure was developed which was 
expected, on an a priori basis, to reflect the 
level of employee desire for the satisfaction of 


higher order needs. 
3. Based on the mea 


sions were included to 
t of the inter- 


n scores of the employees 
on the measure of need strength, predictions 


were made regarding the expected relation- 
ships between the job characteristics as mea- 
sured by the four core dimensions and the 
dependent variables: satisfaction, performance, 


and absenteeism. Relevant data were collected 
from 208 employees, and correlations between. 
each of the four core dimensions and each of 
the dependent measures were computed. 

4, The theory outlined above indicates that 
how a job is experienced or perceived by an 
individual employee should determine his re- 
actions to it, rather than the objective char- 
acteristics of the job. This possibility can be 
examined by analyzing the relationship be- 
tween the characteristics of a single job (as 
they are perceived by employees) and the 


behavioral and attitudinal reactions of indi- 
viduals who hold that job. By restricting the 
analysis to individuals working 
it is possible to rule out ob- 
explanation for 


scope of the 
on the same job, 
jective differences in jobs as an 
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obtained empirical relationships—and, fercb 
to address the possibility that the perceive 
characteristics of jobs affect employees ay the 
same general fashion as do objectively meas 
sured characteristics. Therefore, relationships 
between perceived job characteristics and the 
dependent variables of interest were computed 
separately for employees who worked on each 
of the 13 objectively different jobs included in 
the study, and these relationships were com- 
pared to those obtained in the overall analyses 
involving all Ss and all 13 jobs. 

5. The theory implies that satisfaction, per- 
formance, and attendance should be highest 
when all four of the core dimensions 
present. That is, being high on, say, variety 
and task identity but low on autonomy and 
feedback should not provide the conditions 
that are necessary for positive behavioral and 

_ affective responses. Under such circumst 
a worker would be expected to have neither 
a sense of personal ownership of his work 
activities and outcomes (low autonomy) nor 
a trustworthy gauge of the adequacy of his 
performance (low feedback). The conditions 
for the development of internal work motiva- 
tion specified earlier would therefore not be 
met. The importance of having all four core 
dimensions present was tested by comparing 
the dependent variable Scores of employees 
who saw their jobs as moderately high on all 
four core dimensions both with the scores of 
employees who saw their jobs as high on some 
dimensions and low on others, and with the 

ployees who saw their jobs as 
moderately low on all four dimensions, 

6. The theory states that individual differ- 
ences in desire for higher order need satisfac- 
tions should moderate the relationships be- 
tween job characteristics and the dependent 
variables, In order to test this Possibility, 
relationships were computed Separately and 
compared for the third of the employees high- 
est on desire for higher order need satisfaction 
and for those employees lowest on desire for 
higher order need satisfaction 

7. Finally, exploratory analyses w 

of the relationships betw 

personal job dimensions ( 
and friendship opportuni 
dent variables. 


are 


ances, 


ere made 
een the two inter- 
dealing with others, 
ties) and the depen- 
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METHOD 
Research Setting and Subjects 


e 

The research was carried out in an eastern telero 
company, and focused on employees who worl m a 
different jobs in the plant and traffic departmenta g 
the company. The jobs were selected so as to ne ea 
(a) several varieties of operators, ee Ey, 
office repairmen, and cable splicers; (b) to rang sal re 
in complexity and in the level of employee a aia 
quired; and (c) to be located in both rural an 
settings. 2 

Tote were collected from 208 employees FNA 
Supervisors. Employees in the traffic copri jant 
(about one-third of the sample) were female; all p! 
department employees were male. 


Procedure 


job 

All data were collected on site at each of the ee 
locations. One to three days were spent at each ation 
tion, and all data were collected from each oe col- 
before moving on to another location. Data wer tion, 
lected over a period of about 9 mo. At each pene 
the following five procedural steps were fo! p 
(although the order of the steps sometimes Va 
because of local circumstances). was 

1. Local second- or third-level_ management from 
visited to obtain permission to collect data “anise 
employees working on a particular job. When Pager 
sion was secured (it was never denied), the mana job 
were interviewed about the general nature of the 
as they perceived it, inter- 

2. First-level Supervisors on the local job were i soe 
viewed about the nature of the job and employ ch 
reactions to it and were given a questionnaire oye’ 
tapped the supervisors’ perceptions of the emp’ 
job in a format similar to that used for obtaining 
employees’ own job perceptions. d and 

3. Employees working on the job were observed ter- 
interviewed informally. These observations and ne 
views were conducted by two researchers an no 
tinued until the researchers felt that they were the 
longer obtaining substantial new information about ne 
job. Observation typically lasted approximately | the 
working day. Ratings of the Job characteristics PY var 
researchers were made on the basis of these obs® 
tions and interviews, 


f 
. . e E 
4. A questionnaire was administered to a sampl ok 
15-20 employees on the job. The questionnaire 


between 4-1 hr, to Complete and usually was & 


$- 
'Stered to employees in groups of three or four | 
general nature of the research was explained t° 


the 
group of employees before they began work Pdf 
questionnaire, although the hypotheses of the y 
and the depende: 


r 

i nt variables to be analyzed, wor’ ipat 
mentioned. It was emphasized to each individual s did 
participation was voluntary, and a few individu% yere 
decline to ; 


Par e: b 
participate. In addition, employe’ ipes, 
told that putting their 


jonna 
j A names on the questio! yolu”: 
while desirable for research purposes, also was jp 
tary. About 10% of the employees who partic 
declined to give their names. 


5. 
who ak i i 
ee taken the questionnaire and provided their 
alata z were made by first-line supervisors. Absence 
were collected later from company records. 


Th 


- How 
Tow much autonomy do you hav 


- To what extent do you do a‘ 


EMPLOYEE REACTIONS TO JoB CHARACTERISTICS 


Ratings of the performance of those employees 


T 
nstruments and Measures 


pe b a Each of the 13 jobs examined in the 
limensiohe a escribed in terms of the six general 
identity k ASen earlier (variety, autonomy, task 
anaes ack, dealing with others, and friendship 
Lenso ies). The jobs were described on all six 
of emplo 5 using four different methods: (a) 
naire Pa ees on the job completed several 
and Fae ziyani to the six dimensions; (b) first- 
identical ¢ -level management on the job completed 
observin Ausshonnale scales; (Q) the researchers, after 
supervisers e job and interviewing employees and 
searchers a completed similar scales; and (d) the re- 
adapted fror ee a set of objective coding procedures 
a set of non Purner and Lawrence (1965) to obtain 
The job a jective assessments of each job. 
Süpervisors í escriptions completed 1 yees 
oF five ay rae aie three separate questionnaire items 
Rete the six dimensions; two items were 
EEE aaa dealing with others.” One sec 
job on aerate involved making direct ratin t 
of the ei eee scales. The scales used in this section 
attached p oyee questionnaire, togeth 
Poit aren each of the scale end points 
, isted below. 
a. 


a sample 
question- 


by employees and 


used for 
tion of 
tings of the 


ether with the labels 
and the mid- 


Aow much variely is there in your job? 

ery little; I do pretty much the same things over 

and over and use the same pieces of equipment 

aad procedures almost all the time. (Scored 1) 

odere variety (Scored 4) 

wide much; I do many different things and use a 
e variety of equipment and procedures. 

(Scored 7) 

e on your job; 


h 
ow much are you left on your own to do your 


own work? 

voe little; I have almost no “say” about schedul- 

ag my work; the work and the procedures are all 

F out for me in detail. 

` po frete autonomy; I make some of the 

z out my work, but many of them are 
e. 

Very much; I have almost all of the “say” about 

the scheduling of my own work; I alone decide 


what procedures will be used. i 
whole” piece of work 


a job which is 


decisions 
made for 


(as opposed to doing part of 
finished by some other employee) ? 

I do one small part of a job; there are many 
others who do other parts of the job; I may not 
See the final result. 

I do a moderate size “chunk” 
are others involved too, but my 
clear, 

I do an entire piece of work; I do the job from 
start to finish, and what is done is clearly “mine: 
To what extent do you find oul how well you are 
doing on the job as you are working? 


of the work; there 
contribution 1s 


A 
ploye 


attributes was actua! 
scales ranged from 
(scored 1) through 


to “a 
used 


(a) 


() 


(Q] 


(a) 


(e) 
(f) 


The items desc 
throughout a 23- 
which asked ab 
of heterogeneous aspec! 


Sco! 


dimensions were 
summary scores 
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Very little; I often work for long stretches without 
finding out how I am doing. 

Moderately; sometimes I know how I am doing 
and other times I do not. 

Very much; I get almost constant 
my performance as I work. 


“feedback” on 


. To what extent do you have the opportunity to 


talk informally with other employees while at 
work. (That is, is your job arranged so that you 
can chat with other workers while on the job— 
even though the job does not require you to talk 
to these people?) 

Very little; there is almost no chance to talk to 
other employees except about “business.” 
Moderately; there is some chance to talk, but 
you may have to arrange it ahead of time. 

Very much; there is almost always an opportunity 
to talk with other employees about non-business 
topics. 
To what extent is dealing with other people a part 
of your job? 
Very little; working with other people is nol a 
very important part of my job. 

Moderately; I have to deal with some other 
people, but this is not a major part of my job. 
Very much; probably the single most important 
part of my job is working with other people. 


n of the questionnaire asked em- 
ales how much of the various job 
lly present on their jobs. These 
“none or a minimum amount’? 
“a moderate amount” (scored 4) 
t” (scored 7). Scales which were 
uestionnaire include: 


second sectio: 
es on 7-point sc 


maximum amoun! 
in this section of the q 
of variety in my job. 


For variety: The amount 
a number of different 


The opportunity to do 
things. 
For autonomy: The opportunity for indepen- 
dent thought and action. 
The freedom to do pretty mu 
my job. 

For task identity: 
from the beginning 
a whole job). 

The opportunity to com| J 
For feedback: The opportunity t 


well I am doing in my job. 
The feeling that I know whether I am perform- 


ing my job well or poorly. i x 
For dealing with others: The opportunity, 1n 


my job, to give help to other people. 3 
For friendship opportunities : The opportunity 
in my job to get to know other people. m- 

The opportunity to develop close friendships in 


my job. 


ch what I want on 


The opportunity to do a job 
to end (i.e., the chance to do 


plete work I start. 
o find out how 


ribed above were randomly spread 
item section of the questionnaire 
out the presence or absence of a number 
ts of the work setting. 

res on all items describing each of the six job 
averaged to arrive at a set of six 
from the employee ratings described 
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TABLE 1 regarding this dimension is that some fecha 
THE Six provided directly by the job as an employee wo 
ieee aire faa (e.g., an installer successfully installing and testing 
/ESCRIPTIVE NSION 


telephone), while additional feedback may be provided 


k by supervision (e.g., a foreman telling an a 

Source of ratings that he has taken too long to do a particular job). 
Se ee may be that employees, supervisors, end, oe 

i i T- were not attending to the same aspect of the feedba 
cae Emz Sup er ce Process in making their judgments, thereby lowering 
Ployees | visors ence the level of agreement. -EA 
In addition to the problem with the feedback Be 
Variety 90 91 86 sion, there is a tendency for correlations inyo 
Autonomy -17 -68 -89 supervisors to be lower than others. This is apral 
Task identity 17 -86 .95 true for the two dimensions involving ciao ops 
Feedback 715 15 97 relationships: dealing with others, and friends i Pes 
Dealing with others 47 17 88 portunities. Examination of the mean scores o isa 

Friendship opportunities AB 42 92 ploy ees and supervisors on these two jobs reveal l 


iabilities for employee anq 
ratings are internal 


supervisor vi consistent], 
es. Each dimension 


i s 
y more or consistently =, 
i Mponent than employees—th 


vith 
ell with the employees (or wit 


naire items, ex 
age of two i 
ence proced 


d ərsonal 
7 Ete t the extent of Antoine 
judges, adjusted by Sp 1 edures. Ratings he job. Part of the reason 
the researchers (the 7 ) we 
laboratively and reliab le. 


may be the low internal oe 
Supervisors on the dimena 
In addition, on-site observes 
and large, supervisors were i f 
ntive to and concerned with hat 
uction aspects of the jobs Tea 
ith interpersonal issues. This sity 
nt for both the unreliabi ae 
“dealing with others” dim 


s perviso + Sion and the low level of a 
terjudge reliabilities for the re- and theother rate 


Table 2 shows tł 


de 
of supervisors’ d P hat em loyee: elves provid 
pi rs’ data o n “dealing with jud f p! yees thems ts 


others,” the reliabilities are adequate, 
Omparison of the Scores obtained ; ir super 
7 e exceptions a their sur 
the four rating Procedu igh agreement on Visors, It ig not, 2 cotta, Pid 
c= 
conclusively that the employee judgments are obje 
tively accurate, because no Unambiguous standar! on 
accuracy is available, Nevertheless, the strong a 
vergence of the employee judgments with the ass¢ 
TABLE 2 


CORRELATIONS AMONG THE FouR Rating PROCEDURES 


Job dimensions 
Correlation between Friend- 
Vari Task : Dealin ip OP” 
ii Autonomy identity | Feedback with aoe 
others S ties i 
Employees & Supervisors .87* 85* 65* 49* 
Employees & researchers .94* `94* Po .09 31 “age 
Employees & Turner-Lawrence .89* `72* ie ~ 22 -91* iz 
Supervisors & researchers .94* “79 ‘63 ~.66 .95* 38 
Supervisors & Turner-Lawrence 84* “49* E * =15 37 35 
Researchers & Turner-Lawrence .90* “80+ p m N 23 26 3 2 
7 a 16 -93* à 
Note.—N = 13 jobs, 
*p <.05 (one-tailed test), 
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TABLE 3 
CORRELATIONS AMONG THE SIX DESCRIPTIVE DIMENSIONS: EMPLOYEE RATINGS 
. R ; . Dealing | Friendship 
Dimensions Variety Autonomy ay Feedback with opportuni- 
B others ties 
Variety aus 
Autonomy .67* — 
Task identity i7 .24* — 
F cedback Al .06 12 = 
Dealing with others 05 —.05 .19* .26* — 
‘riendship opportunities .41* 21* 07 Ja Ag = 


Note.—y 
+ ote.—N = 208 employees. 
b <01 (two-tailed teat), 


nie a ae from the researchers, from supervisors, 
that the ita Turner-Lawrence procedures does suggest 
Rondistortes Dloyees were able to provide generally 
jobs. In ed descriptions of the characteristics of their 
employe most of the analyses reported in this article, 
of job oy © judgments are used as the primary measures 
given ae eee This is appropriate and necessary, 
ception at it Was argued earlier that employees’ per- 
acteristi of their jobs (rather than objective job char- 
emplo ac) are causal in affecting the reactions of 
} a Ble eae to their work. Fortunately, the data in 

Conver, establish that employee perceptions have 

ore, Gary validity as descriptors of jobs—and, there- 


thee’ relationships among the six job dimensions 
is some ves are presented in Table 3. Although there 
related tendency for the six dimensions to be positively 
Of sub, 8 ope another, only two of the correlations are 
variet a magnitude: jobs seen as having high 
in felon are seen as being high in autonomy and 
Ship an Ship opportunities. The level of interrelation- 
resent rece the six dimensions as measured in the 
I research is lower than that reported by Turner 
use E (1965), and does not mitigate against 
job tin ne six dimensions separately as descriptors of 
Ta aracteristics. 
Mirae ble 4 presents a listing of the 13 jobs studied, the 
mber of employees on each job who participated in 
> research, and the mean employee ratings of the 
aracteristics of each job.? 


ch, 


. dine *Stiptions of the 13 jobs on Turner-Lawrence 
Be eee are not included in Table 4 for two ram, 
om » the Turner-Lawrence procedures were modi ei 

newhat for use in the present study, and therefore 
© scores obtained are not directly comparable to 
we reported by Turner and Lawrence (1965) for 
he Sample of 47 jobs. Second, the primary use ies ee 
en ner-Lawrence scores in the present research was 

nonstrate the convergence of employee assessments 
F eir jobs with more objectively based job A 
A Table 2). Substantive analyses a Ea 
in ified Turner-Lawrence scores are not zepo Ti 
ar ce the scores of specific jobs on the six dimensi 

© confounded both with traffic versus plant functions 


Individual need strength. To obtain a measure of the 
degree to which Ss were desirous of obtaining higher 
order need satisfactions from their work, a question- 
naire was administered which asked how much of 
various opportunities and attributes the employees 
“would like” to have on their job. In a space beside 
each item, employees entered a number ranging from 
1 (would like to have none or a minimum amount) to 
7 (would like to have a maximum amount). Content 
of the questionnaire ranged widely, and included items 
relevant to pay, promotion, security, working condi- 
tions, peer relationships, and supervisory relationships. 
Twelve of the items were judged on an a priori basis 
to measure desire for higher order need satisfactions. 
These items are: 


(a) The opportunity for personal growth and de- 
velopment on my job. 

(b) The opportunity for independent thought and 
action on my job. f 

(c) The opportunity to find out how I am doing. 

(d) The opportunity to complete work I start. 

(e) The opportunity to do challenging work. 

(f) The feeling that I know whether I am perform- 
ing my job well or poorly. L 

(g) The opportunity to do a number of different 
things. oe 

(l) The opportunity to do a job from the beginning 
to the end (that is, the chance to do a whole job). 

(i) The freedom to do pretty much what I want on 
my job. y 4 

(j) The amount of variety in my job. k 

(k) The feeling of worthwhile accomplishment in my 
job. 

() The opportunity, 
the determination o. 
goals. 


in my job, for participation in 
f methods, procedures, and 


Scores of each employee on the 12 items were 
summed to obtain an overall measure of the level of 
ee 
in the telephone company and with employee sex 
(see Table 4). It was concluded on these grounds that 
the sample of 13 jobs was not adequate for analysis 
of relations between objective job descriptions and mean 
dependent variable scores using jobs as observations. 
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TABLE 4 
SUMMARY OF THE CHARACTERISTICS OF THE JOBS STUDIED 
X employee ratings of job characteristics 
“riend- 
7 Task Deal- A 
Job description Se ft Vari- | Auton- ss Feed- | ing oppor- 
ety omy | ! poe back | with tant 
Biy | others | ies 
| 
; 5 | 4.13 
Director assistance (DA) operators F 16 3.08 3.64 5.66 369 
Toll operators F 18 2.87 3.06 5.15 =a) 
Combined DA & toll operators F 17 3.97 3.29 5.87 323 
Traffic service position (TSP) operators F 18 2.78 2.91 5.06 105 
TSP operators with “enriched” jobs F 19 2.94 2.96 | 5.26 aie 
Tip & ring (residence) installers M 18 4.78 5.23 6.38 | oT 
Key (commercial) installers M 19 5.72 5.06 6.26 187 
Private branch exchange installers M 15 6.31 5.61 | 5.25 hea 
Cable splicers M 16 | 5.14 | 467 | 5.04 eee 
Combined tip & ring & key installers M 17 4.94 5.09 6.11 | Sda 
Central office framemen M 8 4.50 | 3.89 4.42 | ei 
Central office repairmen: Step equipment M 11 5.30 | 484 | 5.32 | ei 
Central office repairmen: Crossbar equipment | M 16 | 5.91 | 4.92 | 5.93 9. 
so 


higher order need strength. Internal con: 
liability of the 12-item scale was .89. 
Employee reactions to their jobs. Four types of em- 
ployee reactions to their jobs were obtained from the 
questionnaires employees completed, for use as depen- 


dent variable measures. The four types of reactions are 
described separately below, 


1. Experienced w 
derived 


sistency re- 


degree to whi 
affective outcome: 


ble for what you 
Producing a large 


own work. (b) 
quality work. Eac 


(c) Doing high 
Cparate 7-point 
lenced pressure 
worker on this 
aW experienced 


r high experi 
important for a 
ty work”) to Ja 


pressure (e.g, “It is not especially important for & 
worker on this job to do high quality work”). 

2. Job involvement: three questionnaire items, a8 
the same format as the intrinsic motivation ee 
were averaged to yield a score indicative of the deg" 


in 


ed 
: employees complet 


all jo 
ms designed to tap overall J 


ame 
= . he sa 
also were in t wer 


jtting 
f (b) I frequently think of qii 
my job (reversed Sco 


5 ing 
é ring). (c) Generall speaking 
am very satisfied with the kind of WoT have t° 
on my job, 


a job 
ms used in the general satisfaction te 
and intrinsic motivation scales "ye 


r 
a conceptual level, ae jon 
intrinsic og he 
ey demonstrated N pr 
Scriminant validity ™ syen 
employee attitudes and behaviors- 
though the vari 


es do have di 


tion, and the three items reflecting the specific focus 
of employee motivation are included in Table 5. 

4. Specific satisfaction items: Employees indicated 
on 12 questionnaire items the degree to which they 
Were satisfied with particular aspects of their job. The 
items ranged from “extremely dissatisfied’? (scored 1) 
to “extremely satisfied” (scored 7). The 12 items were 
Not summed but were retained as separate indicators 
of Specific satisfactions of the job. Median intercorrela- 
tion among the 12 items was .40. The items queried 
how satisfied employees were with: 


(a) The feeling of self-esteem or self-respect a person 
gets from being in my job. 

(b) The opportunity for personal growth and de- 
velopment in my job. 

(c) The prestige of my job inside the company 
(that is, the regard received from others in the 
company). 

(d) The amount of close supervision I receive. 

(Q) The opportunity for independent thought and 
action in my job. 

(f) The feeling of security in my job. 

(s) The pay for my job. 

(h) ps feeling of worthwhile accomplishment in my 
Job. 

(i) The opportunity, in my job, for participation in 
the determination of methods, procedures, and 

„_ goals. 

(j) The opportunity to develop close friendships in 
my job. 

(k) The opportunity for promotion. 

® The amount of respect and fair treatment I 
receive from my boss. 


Performance measures. Supervisors rated the per- 
Pee of each employee on several dimensions, 
work of which are used in this report: (a) quantity of 

ork produced, (b) quality of work produced, and 

o) Overall performance effectiveness. 
wé e quantity and quality scores for each employee 

Te obtained by averaging the ratings he received on 
aaa rating scales from his supervisors (when more 

an one supervisory rating was available). T he overall 
Performance measure was obtained by summing aver- 

se Supervisory ratings across seven separate per- 
tmance scales, including quantity and quality. Inter- 
Boe reliabilities of the quantity and quality scales 
cae ‘71 and .79, respectively, and the internal con- 
€ncy reliability of the overall performance effective- 
Ness scale was .92. The correlation between the quantity 


and the quality scales was .63. , 
nchors for the performance ratings were phrased in 
absolute terms, and supervisors were encouraged to 
ies each employee against the highest ome 
a Vel of performance possible for the job in making 
Ssessments. Nevertheless, the degree to which ratings 
ee by different supervisors on different oe <a 
i rectly comparable is open to question, an resi = 
nvolving the performance ratings must be interprete 
With caution. 
bsentecism measure. The numb 
wp Ployee was absent during the 12 
ich the study took place was derive 
Payroll records. Occasions absent ra 


er of occasions an 
-mo. period during 
d from company 
ther than days 
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absent were used to discount the effects of single long 
periods of absence. For example, if an employee were 
ill for a 2-wk. period, the data would show the event 
as one occasion of absence, rather than as 10 working 
days of absence. 

Summary of measures. The various measures used in 
the research are summarized below. The means, 
standard deviations, and intercorrelations of all de- 
pendent variable measures are presented in Table 5. 
The measures include: (a) Job descriptions: variety, 
autonomy, task identity, feedback, dealing with others, 
and friendship opportunities. Descriptions were made 
by employees, by supervisors, by the researchers using 
the Turner-Lawrence procedures, and by the re- 
searchers subjectively after job observations and inter- 
views. Because there was generally high convergent 
validity among the four sets of job descriptions, and 
because the conceptual basis of the study suggests 
that jobs as experienced by employees should be most 
directly causal of employee reactions to their jobs, 
the employee job descriptions are used in the analyses 
to be reported in subsequent sections. (b) Level of 
higher order need strength of employees: a summary 
score of employee reactions to 12 need strength items. 
(c) Employee reactions to their jobs and work. Ques- 
tionnaire-derived measures of: the amount of intrinsic 
motivation experienced by employees, and the focus 
of their motivation; general job satisfaction; personal 
job involvement; and 12 specific satisfactions or dis- 
satisfactions with the job or the work situation. (d) 
Rated performance effectiveness of employees, in terms 
of quantity of work produced, work quality, and over- 
all performance effectiveness. (e) Absenteeism, mea- 
sured by the number of occasions an employee was 


absent during a 12-mo. period. 


RESULTS 


General Relationships between Job Character- 
istics and Employee Reactions 


According to the conceptual position out- 
lined earlier, the nature of the relationships 
between job characteristics and employee re- 
s to their work (including satisfaction, 
performance, and absenteeism) will depend 
upon the need states of the employees. In 
particular, it was predicted that, if employees 
are desirous of higher order need satisfactions, 
there should be a positive relationship between 
the four core dimensions (variety, autonomy, 
task identity, and feedback) and motivation, 
satisfaction, performance, and attendance. 
The mean score of the 208 telephone com- 
pany employees on the 12-item scale indicative 
of higher order need strength was 6.01. Given 
that the maximum possible score on the scale 
is 7.0, it would appear that the employees who 
participated in the present research had strong 
higher order needs, or at least that they felt 
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TABLE 6 
GENERAL RELATIONSHIPS BETWEEN JOB CHARACTERISTICS AND EMPLOYEE REACTIONS 
Core dimension uate eteoral 
dimension 
Dependent variable 
: Dealing | Friendship 
Variety | Autonomy ane b Feedback with opportuni- 
s identity others ties 
evel of intrinsic ivati 
Egue of —< al ial asl i a K 
me personal responsibility .14* 12° 19* 06 08 05 
Doing large quantities of work 10 | =12 01 102 o | -17 
z high quality work i r t i 
“ae tet ae i si ae aap 2 se lz 
uantity 
Quality =m, R 05 00 —.02 —.18 
aes r 7 r .07 = = 
banaa effectiveness .20* .26* ne E = oe 
icneral job satisfaction 38* 39* :20* ‘28 17 aie 
‘ee ica LS 24* 22* 12" 2a" 03 "16" 
Sentecism (no. of times absent ‘02 14 20 — í 
r satisfaction items j ~ “ i p E 
FE j obtained from job a2 32" 15* 35" .15* .27* 
pansa growth and development .36* 3d 14* 31* 11* .29* 
rare of job inside company .30* ae 15% Pk Si .14* .28* 
Inde nt of close supervision received 3r 35e za" 30* .07 .16* 
Seen thought and action 53* .62* 25 15* 00 2s% 
Pay uty 22% 27* .19* .39* alo” .28* 
Peeling 04 05 04 E 25" .24* 
pore of worthwhile accomplishment ay 32% .28* 42" 23* aL 
aloes in job-related decisions .28* Bi .20* 34* K vaa 25? 
Pro. opment of close friendships 25* Aar .09 .29* .09 47* 
Ree 17* ‘20* “15* 34" 21* “19* 
pect and fair treatment from boss .19* .26* .22* 35* 14 24 


Note.—N = 208 
P <.05 (one-tailed test). 


It was 7 
to to express high desires for 
Positive ot oo tg Therefore, generally 
Ween job Fy 1OAEHIpS would be expected be- 
sions ence Becuphons on the four core dimen- 
Torla dependent variables. = 
teises a between the six job description 
Soreni he btained from employees and the 
AS ites employnes on the dependent variable 
distussed i presented in Table 6. Results are 
KEko elow separately for the four core di- 
ionsand the two interpersonal dimensions. 


Co : 
re Di ? si, 
Dimensions: Toward Internal Motivation 


i Pe positive relationships were ob- 

depend etween the four core dimensions and 

satisfa ent measures indicative of motivation, 
È action, performance, and attendance. 

a erformance motivation and actual perfor- 
nce, The higher jobs are on the core dimen- 


nore employees tend to report feel- 
s to take personal respon- 
k and to do high quality 
when jobs are described 
jety, autonomy, and 
re rated as doing 
generally 


sions, the r 
ing internal pressure 
sibility for their wor 
work. And, in fact, 
as being higher on var 
task identity, employees @ 
higher quality work and as being 
more effective performers on the job. 

Further, the data provide support for the 
interpretation of these relationships which 
was proposed earlier in this paper: when jobs 
are high on the core dimensions, employees 
report having higher intrinsic motivation to 
perform well. That is, employees indicate that 
when they perform well on the job, they ex- 
perience positive internal feelings, and when 
they perform poorly, they feel badly. Appar- 
ently jobs seen by employees as being high 
on the core dimensions do establish conditions 
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on the job so that it is possible for workers to 
obtain personally rewarding experiences by 
doing well on the job. The data Suggest, 
moreover, that “doing well” is interpreted in 
the job context as having much more to do 
with high quality performance than producing 
large quantities of work. The core dimensions 
do not relate either to felt pressure for high 
quantity production, or to the actual quantity 
of work which is produced. This is consistent 
with the notion that employees with strong 
higher order needs feel positively when they 
have “accomplished something that they feel 
is meaningful”; it is not unreasonable that 
such workers would see doing high quality 
work as much more of a meaningful “accom- 
plishment” than simply turning out large 
quantities of work. 

General Satisfaction, job 
absenteeism. The core dimensions are, as ex- 
pected, strongly and Positively related to 
overall job satisfaction and to the degree that 
employees feel personally involved in their 
work, It is not, therefore, surprising to find 
that employees whose jobs are seen as high 
on the core dimensions tend to have better 
absence records as well. Work apparently js 
a satisfying place to be for employees with 
jobs high on the four core dimensions, and one 
Way they behaviorally demonstrate this is by 
coming to work regularly, 


Specific satisfactions. Nearly all of the spe- 
cific satisfaction items were significantly posi- 
tively correlated with the descri 
on the core dimensions. This is 
from the fact th 


involvement, and 


ork situation ad 


i j dressed by the 
2 specific Satisfaction items, 


i on these dimensions, 
whereas those items with weak relationships 
should be considered not ; i 


J. Ricuarp Hackman AND EDWARD E. LAWLER HI 


ing the kinds of effects that the core dimensions 
have on employees’ reactions to their jobs. st 

The four specific satisfaction items E 
strongly related to the core dimensions a 
(in descending order): (a) The appor aon 
for independent thought and action in my A 
(b) The feeling of worthwhile accomplishmeh 
in my job. (c) The opportunity for ge 
growth and development in my job. @ a 
self-esteem and self-respect a person gets fro 
being in my job. 

The four ‘eel least strongly related to a 
core dimensions are (in ascending order): 10 
The pay for my job. (b) The opportuni a 
develop close friendships in my job. (c) at 
opportunity for promotion. (d) The an 
of respect and fair treatment I receive fro 
my boss. a 

It appears, as expected, that the four co! 
dimensions seem to be most strongly w 
to the satisfaction of higher order needs. a 
satisfaction items which are most weakly = 
lated to the four core dimensions seem to ds 
most relevant to the satisfaction of nee $ 
classified as lower-order in the hierarchies ° 
Maslow (1943) and Alderfer (1969). 


Two I nler personal Dimensions 


Also included in Table 6 are the Deae 
ships between two Primarily interpersonal Le j 
dimensions (dealing with others, and frien e 
ship Opportunities) and the dependent vario 
measures. The nature of the relationships ° q 
tained are strikingly different from those 23 
tained for the four core dimensions. The tW° 


F i i ry 
Interpersonal dimensions do not relate very 
consistent] 


affective 


and any measure 
ance, or absenteeism. i 
involving the dimen? iy 
ene Pportunities” also are genera 
negligible, The dimension does not relate © 
the level of intrinsic motivation employ?" 
report, or to any measure of performance 
absenteeism, 

here are Positiy 
e for friendshj 
dealing With oth 
Personal dimensi 


ly 
€ relationships (moderata e 
P opportunities, smal ter- 
ers) between the two miphe 
Ons and job satisfaction. 


aan largest relationship between any of the 
isfaction items and either of the inter- 
ll? jph dimensions is not a very sur- 
ties pA oner The more friendship opportuni- 
SA ee on a job, the more employees 
develop ef a a the chances they have to 
pei ose friendships on the job. This 
Bae suggests that the kinds of con- 
Bath Mick to be expected from having jobs 
Fred = Seager components (as mea- 
AT SAORI his research) are primarily social in 
teens rather than being relevant to the per- 
the saat and motivation of employees as 1s 

or jobs high on the core dimensions. 


Withi . 
ee Relalionships between Job Dimen- 
ts and Employee Reactions 


Aker? proposed earlier that job design 

beens allect employee attitudes and behavior 

“eels their impact on the perceptions 

fine tae have of their jobs. This may be 
sented schematically as follows: 


Objective job 1 Perceived job 2, Work attitudes 
— and behavior 


characteristics aa 
aracteristics characteristics 


Tonle presented in Table 2 show that em- 
tial % RerepueDR of their jobs have substan- 
tthe § ergence with the assessments of ob- 
SAeehe Job characteristics made by the re- 
Tae ers and by company supervisors. If it 
a that the assessments by the 
Fett ere and supervisors do reflect the 
Ps character of the jobs, these data 
are ise art ta that employee job perceptions 
job ci (at least in major part) on objective 
le artery E first link in the chain. 
Medis F reported in Table 6 and discussed im- 
ea au above show that employee job per- 
$ a also are related to work attitudes and 
ehavior—the second link in the chain. How- 
ever, these latter analyses are based on job de- 
Scriptions provided by employees who worked 
on 13 different jobs—jobs which have been 
Shown (see Table 4) to vary substantially on 
the six job dimensions. These large objective 
piTerences among the 13 jobs may, therefore, 
nave been primarily responsible for the rela- 
tionships which were obtained. In other words, 
the results in Table 6 could have come about 
because the objective job characteristics di- 
rectly caused both the perceived job character- 
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istics and the employee reactions—and not 
because they caused the perceived job char- 
acteristics which, in turn, caused the reactions. 

By examining the relationships between em- 
ployee perceptions of their jobs and the 
dependent variables separately for workers on 


each of the 13 jobs, it is possible to determine 
ctors, largely uncontaminated 


ob differences, bear a 
o the dependent vari- 


if perceptual fa 
by objective between-j 
significant relationship t 
ables. Such an analysis, therefore, would 
provide a relatively unconfounded test of the 
second link in the above diagram. Within-job 
relationships should, of course, be lower in 
magnitude than those obtained using all em- 
ployees working on all jobs, since there is 
little meaningful variation in actual job char- 
acteristics when data are obtained only from 
a single job, and as a result the variance 
in employee perceptions should be severely 
attenuated.’ 

Thirteen matrices of relationships were ob- 
tained by correlating job descriptions and 
dependent variables separately for employees 
working on each of the 13 jobs. The median 
of the 13 correlations between each job dimen- 
sion and each dependent variable was selected, 
and the matrix of median correlations is pre- 
sented in Table 7. 

As expected, the median within-job correla- 
- are consistent with the correla- 
d across all employees and jobs 
The order of magnitude of the 
b correlations, also as ex- 
an that of the correlations 
based on data from all employees. These re- 
sults suggest, therefore, that employees’ per- 
ceptions of their jobs are of central importance 
in affecting job attitudes and behaviors, but 
that the major determinant of such percep- 
tions is the objective make-up of the job 
itself. 
a 

8 To the extent that the 
jobs assigned to employ 
objective job, the usefulness of wi 
for testing the impact of strictly perceptual factors on 
the dependent variables is limited. Although some 
differences in actual work assignments were noted by 
the researchers among employees working on the same 
objective job, these differences invariably were of little 
consequence. Therefore, the within-job analyses prob- 
ably do reflect primarily differences in employee job 
perceptions rather than actual differences in job design. 


tions generally 
tions compute 
(see Table 6). 


median within-jo 
pected, is lower th 


re are differences in the actual 
ees who work on the same 
thin-job analyses 
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TABLE 7 
x AND EMPLOYEE REACTIONS 
Mepran WITHIN-JOB CORRELATIONS BETWEEN JOB CHARACTERISTICS 
Interpersonal 
Core dimension dimension 
n Pri hip 
i aling | Friendship 
Dependent variable Dea ante 
iety ;| „Task Feedback with oppor! 
Variety | Autonomy identity bee ribs 
3 06 
ivati 35 .03 s 
Level of intrinsic motivation 31 .23 Ai a 
f motivation = A n JE ; 
7 lgace responsibility 00 i a ig —07 
Doing large quantities of work —.05 = te ‘ae T ‘06 
Doing high quality work .05 x k -05 : 
ied lade .06 2i | = | <0g = = 
ae 06 18 06 03 -410 S 
Overall effectiveness .12 .21 E A n ‘Ot 
General job satisfaction 38 48 k E 2 4 
Job involvement 34 28 3 F ; = on 
Absenteeism (no. of times absent) —.10 —.03 03 0 E 
Specific satisfaction items : B 27 
aiies obtained from job -26 wi 15 44 AS | On 
Personal growth and development aoe 40 18 34 -25 26 
Prestige of job inside company Ad Ad 17 42 24 ‘07 
Amount of close supervision received -19 39 .08 31 17 20 
Independent thought and action 40 55 31 30 3 25 
Security .21 28 28 35 af 18 
Pay 24 33 7 33 19 “26 
Feeling of worthwhile accomplishment 40 Eyi Al 48 25 | oe 
Participation in job-related decisions 34 34 39 34 31 a 
Development of close friendships 21 07 25 35 26 ne 
Promotion 19 33 13 32 26 30 
Respect and fair treatment from boss .20 .29 24 35 09 x 
‘ecause the correlations reported in this table are medians, 


Exploration of Condition Jor Enhan 
tion and Satisfaction 

Results presented in the 
indicate that when jobs 
dimensions, employees w 
for higher order need s 
motivated an 
the followi 
reported whi 
vide addit 


cing Motiva- 


preceding discussion 
are high on the core 
ho have strong desires 
atisfaction will be highly 
job. In 
analyses will þe 
ndings and pro- 
ation of the condi- 
ary to enhance em- 
isfaction through job 


Must Jobs Be High on 


All Four Core 
Dimensions? 


The theory on which 


the present research is 
based specifies that whe 


n jobs are high on the 


no indication of statistical significance is given. 


core dimensions, employees have the oppor 
tunity to find out (feedback) that tie 
personally (autonomy) have accomplishec 
something meaningful (task identity and Par 
riety) when they perform well. The implic“ 
tion of this assertation is th 

motivation, jobs should be 
ously on all four of f 


To test this 
into three gro 
their jobs as bein, 


ee. ae 
our core dimensions; ar x 
S, who typically desc 


$ > cor 
g high on some of the 3 
and lo 


missing-data, uneq 


: rjane 
ual V analysis of varið 
was used to comp 


i 
f en 
are the mean scores 0 


; 
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TABLE 8 


Comparison OF JOBS DESCRIBED AS HICH on Att Core DIMENSIONS, HIGH ON Some Core 
DIMENSIONS, AND Low on ALL CORE DIMENSIONS 


Dependent variable x 
Dependent variable Jobs low on | Jobs high on Jobs high on 
F ratio all core some core all core 
dimensions* | dimensions? | dimensions 
Level of intrinsic motivation 13.72** 5.28 5.99 6.46 
Focus of motivation 
Taking personal responsibility 6.08** 5.90 6.40 6.88 
Doing large quantities of work -24 5.48 5.17 5.29 
Doing high quality work 3.04" 6.00 6.20 671 
Rated performance 
Quantity P 5.19 5.23 5.39 
Quality 3.36" 4.97 5.57 5.89 
Overall effectiveness 3.40* 5.14 5.54 5.94 
General job satisfaction 28.97** 3.78 4.99 6.21 
Job involvement 18.40** 2.30 2.94 4.43 
ne (no. of times absent) 1.62 3.11 2.70 2.00 
pecific satisfaction items a 
Self-esteem obtained from job 27.06** ; i aa 6.00 
Personal growth and development 24.45%" bt 3:88 6.00 
Prestige of job inside company 21.6575 oF po 5.59 
Amount of close supervision received 15.97 a3 462 ae 
Independent thought and action 31.66 Hes 5.32 2 
Security 9.75** ne i o 
Pay 2.95 3 oe 303 aa 
Feeling of worthwhile accomplishment 35.590%* 30; = gal 
Participation in job-related decisions 21.097% 2 ot 477 5.65 
Development of close friendships 9.56 4.1 3.92 347 
Promotion 13.98"" A 5.40 E 
Respect and fair treatment from boss 14.64" 4 Á z 


Ployees in the three groups on each of the 
dependent variables. Means and F ratios are 
Presented in Table 8. 
_ Results are consistent with those reported 
in Table 6 and, in addition, tend to be more 
Substantial in magnitude and more statisti- 
cally reliable.’ Tt was expected that when jobs 
Were described as high on all four core dimen- 
Slons: (a) experienced pressures to take per- 
Sonal responsibility for one’s work and to do 
igh quality work would be high, (b) intrinsic 
Motivation would be high, (e) rated perfor- 
Mance quality and overall performance effec- 
tiveness would be high, (d) job satisfaction 
SS 


es used, unfortunately, do 
ount of variation 


correlational data 


mee The analytic procedur 
on permit computation of the am 
3 ntrolled, for comparison with the 
“ported earlier. 


and involvement would be high, and (e) 
absenteeism would be low. All of these ex- 
pectations were borne out substantially and 
statistically significantly, except for the one 
involving absenteeism. While absenteeism was 
lowest when jobs were seen as high on all 

differences among the 


four core dimensions, ere! mong 
three groups were not statistically significant. 

Although these results support the conten- 
tion that motivation, satisfaction, and per- 
formance are enhanced when jobs are high 
simultaneously on all four core dimensions, 
they do not imply that a disjunctive model 
for combining the core dimensions is neces- 
sarily the optimal one. Therefore, the relative 
usefulness of other ways of combining the 
dimensions was explored. In particular, (a) an 
unweighted sum of the four dimensions was 
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computed for each S and correlated with each 
of the dependent variables, (b) the four dimen- 
sions were used as independent variables in a 
multiple-regression analysis predicting each of 
the dependent variables, and (c) the product 
of the four dimensions (Variety X Autonomy 
X Task Identity X Feedback) was correlated 
with each of the dependent variables. The 
latter score is a continuous measure which is 
congruent with the disjunctive combinatory 
model specified by the theory. That is, for a 
product score to be high, all of the four core 
dimensions must be moderately high, and, 
therefore, the measure reflects the degree to 
which a job is high simultaneously on all four 
core dimensions. Correlations involving the 
product score were compared to those ob- 
tained using the other two combinatory models. 

Correlations between the unweighted sum 
score and the dependent variable: 
—.08 to .61, with a median of .30, Multiple 
correlations ranged from .13 to -65, with a 
median of .37. Correlations using the product 
score ranged from —.05 to -60, and also had a 
median of .30. Since some shrinkage would be 
expected in the magnitude of the multiple 
correlations obtained upon cross-validation, it 
Seems appropriate to conclude that the results 
did not meaningfully differentiate among the 
three means of combining the core dimensions, 
Thus, while the theory-specified disjunctive 
model is not disconfirmed by the data, neither 
has it been shown to represent a more adequate 


means of combining the dimensions than the 
other models tested. 


S ranged from 


Does High Order Need Strength Moderate 
Obtained Relationshi ‘ps? 


ained throughout this 
h are high on the core 
Motivating only to indi- 
rous of the intrinsic re- 


he jobs provide, namely, higher 
order need satisfactions, 


The analy: 
the possibility that Ss differi 


between each oi 
dependent vari 
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for those employees whose higher order oe 
strength scores were in the top one-third of t ; 
distribution of scores for all Ss, and for thong 
employees whose scores were in the borta 
one-third of the same distribution. The nei 
higher order need strength score for the E 
ployees in the top third was 6.78, aa 
mean score for employees in the bottom thi 3 
was 5.09. Since 5.09 is still one full point et 
the midpoint of the scale, it was expected thé 
even for these “low” need strength employes 
relationships between the core dimensions oe 
the dependent variables would be in the ea 
tive direction, but lower in magnitude ie 
the relationships for Ss in the top third of a 
distribution. It also was expected that k 
relationship found for the top third should i 
Stronger than those reported in Table 6 for @ 
Ss. Results are presented in Table 9. a 
For variety, autonomy, and, to a lesa 
extent, feedback, the expectation of a park 
ential relationship was confirmed. rere 
ships between these job dimensions and ba 
dependent variables are larger, and or : 
strikingly so, for the Ss with high higher or¢ a 
need strength than for Ss with modera n 
higher order need strength. Further, the se 
relations for higher need strength aripilor 
are typically larger than those reported in a 
general analysis involving all Ss (see Table i H 
and the correlations based on the lower nee 


Strength Ss are typically lower than the ge” 
eral relationships, 


The moder, 
strength was 
Part of the 


ating effect of higher order need 
not obtained for task identity: 
reason for this may have to do wil a 
the nature of the Particular jobs included !" 
this study. Among the jobs in this stud) 
which were high on task ide 
of telephone o 


Customer comes on the line unt! 
the called p answers and confirms i 
identity, The Operator will do and redo th! 
task asks many times during * 


H 
ay. Observations by the tor 
kly 


=e 0 
mall but “whole” ne 
then it should not be 


f, 


fy 


S 
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TABLE 9 
MODERATING EFFECTS OF HIGHER ORDER NEED STRENGTH 
Variety Autonomy Task identity Feedback 
Dependent variable 
High Low High Low High i 
igh Low 
mo strength strength | strength strength | strength strength PL Pass: 
ae | of intrinsic motivation 15 ao | 
Taki motivation jiy P an aan 
aking personal 00 39* | 
Dolog large quantit —!20 ~ LS 09 
s high quality work 09 
ated performance ee i pi a Be 
aay 
24 04 .23* 
effectiveness 05 ‘bt 106 
b satisfaction .28* Tal $3 
4 130* 15 
k <04 206 
Salt aine .06 30* 
ersonal growth and development 03 47% Soe a 
Arcatige of job inside company -17 324 cae 2 
\mount of close supervision received 355) 43* 5* 38% 
Independent thought and action ae | BS 37t a 
Security o4 38* iis -55 
Pay —10 15 26% Soe 
Feeling of worthwhile accomplishment ‘06 -49% zaar | 40 
Participation in job-related decisions :20* :32* 41% 1258 
Brvceament of close friendships atte aor 36% “13 
n = 3 : : 
omotic ‘ j 235 :40* 234% 
espect and fair treatment from boss 06 45* 33 3° 


Note.—N : 
* .—N = 67 in each group. 

b <.05 (one-tailed test). 

little difference in 
k identity and the 
loyees with high 


preng to find that there is 
a e relationship between tas 
ependent variables for emp 
versus low higher order need strength. 
Indeed, it would be expected from the theory 
that differences between high and low need 
Strength employees should be maximized when 
Jobs are moderately high on all four core 
dimensions. The job of the toll operator 
Clearly does not meet this criterion: while it 
ia reasonably high on task identity and feed- 
back, it is very low on both variety and 
autonomy (see Table 4). 
To examine the validity of 
ment, therefore, the product score (Variety X 
Autonomy X Task Identity X Feedback) de- 


scribed earlier was correlated with the depen- 


dent variables separately for the third of Ss 


highest in higher order need strength and for 
the third of Ss lowest in higher order need 
Strength. Since the product score reflects the 
degree to which a job is seen as being simul- 
taneously high on all four core dimensions, 
Substantial differences would be expected in 
the magnitude of correlations obtained for 
high versus low need strength employees- Re- 
sults are presented in Table 10. 
„Again, obtained relationships 

higher for Ss who were high in re 


this argu- 


were much 
Jevant nee 


strength than for Ss low in need strength for 
nearly all of the measures of motivation, satis- 
faction, and performance. The difference be- 


TABLE 10 


MODERATING EFFECT OF HIGHER ORDER NEED 
STRENGTH FOR JOBS HicH ON ALL 
Four CORE DIMENSIONS 


High Low 

Dependent variable need need 
strength | strength 

Sat .23* 


Doing 
Doing 
Rated performance 
Quantity 
Quality 
Overall 
General job sa 
Job involvement 
Absenteeism (no. of times absent) 
Specific satisfaction items | 
Self-esteem obtained from job 
Personal growth and development 
Prestige of job inside company 
‘Amount of close supervision rece! 
Independent thought and action 
Security 


Pay : = 
Feeling of worthwhile accomplishment 


Participation in job-related decisions 
Development of close friendships 


Promotion. 
Respect and fair treatment from boss 


effectiveness 
isfaction 


ived 


Note.—N, = 67 in each group. 
a The difference between the cor 
strength Ss and for low need strengi 


< 05. 
kp < .05 (one-tailed test). 


high need 


relations for 
ficant at $ 


th Ss is signi 
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tween the correlations for high and low need 
strength Ss was tested for statistical signifi- 
cance, and these results are included in the 
table. Even though most of the correlations 
were positive (because of the relatively high 
mean need strength for even the low need 
strength employees), a number of pairs of cor- 
relations were significantly different from one 
another. All in all, the data make a strong case 
for the moderating effect of individual higher 
order need strength in determining the effects 
of job characteristics on employee behavior 
and attitudes at work. 


DISCUSSION AND IMPLICATIONS 


Job Design and Individual Differences 


The results of this study suggest that there 
are important interdependencies among the 
characteristics of individuals and the char- 
acteristics of jobs which must be taken ac- 
count of in the development of any full under- 
standing of the impact of various kinds of job 
designs. Both the advocates of a “scientific 
management” approach to job design (make 
the work routine, simple, and standardized) 
and the more recent Supporters of “job en- 
largement” (make the work complex, ¢ 
ing, and demanding of individual respo 
and decision making) appear to have 
insufficient importance to individual 
actions in determining affective and 
reactions to jobs. Those of the scie 
agement persuasion, f 


halleng- 
nsibility 
attached 
-job inter- 
behavioral 
ntific man- 
or example, have tended 
to assume that the typical employee will be 
content, if paid judiciously for his 
to work on jobs which 


d will work 
y have a job 
them. The present research 
he character- 
» both points of 
Practices which 


are appropriate some of the time and inappro- 


priate other times, 

The present findings 
well with the previous res 
Lawrence (1965) and Hul 


and conclusions fit 
earch of Turner and 
in and Blood (1968). 
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In both of these studies a sociological level 
variable (urban vs. rural background) hes. 
shown to moderate the relationsiip bahri 
job characteristics and employee Toe. 
The relationship between job level (i.e. e 
which would be high on the four core eee i 
sions) and satisfaction was found to be l = 
for rural workers and low for urban Leas 
The present study demonstrates that TEA 
ual higher order need strength also moden y 
the relationship between job level and eat 
faction (as well as the relationship ce i 
job level and other dependent ee ae 
particular, individuals with strong —— 
higher order need satisfaction respond n da 
more positively to high level jobs tan 
individuals who have weaker higher or 
needs. , nT. 
It may be that urban workers desire mgin 
order need satisfaction less than do md 
workers—in which case the findings ad F 
earlier studies and the present study woul os 
highly congruent. This possibility ge ane 
with the present data and a small differe a 
in the expected direction was in fact cam 
workers with rural backgrounds were De 
on higher order need strength than w 
workers with urban backgrounds (t = e 
$ < .10). Thus, the present research has ta lif- 
the previous finding regarding urban-rural eo 
ociological level to, ily 
fined it, and successit > 
resent results also ea 
Previous results in Bee: 
of individual differen 


F r ariot 
Spond to jobs with vari 
characteristics, e 
abov' 
Tt has been assumed throughout the ab 
discussion that 


ally 
i E x 1a 

: Job characteristics act! 
cause the differences in e 
Motivation, 

which were 
tions which were made (and confirmed i- 
the data) were based on a conceptual fra pe 
work which includes Causal propositions, oint 
study design was correlational and at no P 


by 
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Fa causal links in the theory directly 
that th here is, however, reason to believe 
eor A causal directions specified in the 
ee on be correct. While the simple cor- 
EFTS etween job characteristics and the 
aed variables (see Table 6) are open to 
to be : sie causal interpretations, this appears 
Eitri ot h less the case for those data which 
Hikers oher more specific, predictions of the 
the i n particular, it is difficult even after 
hicks to imagine an alternative causal model 
all hane predict simultaneously that (a) 
ey of the core dimensions need to be 
and bet fo realize the most positive affective 
ee reactions on the part of em- 
higher , en that (b) the strength of desire for 
tially 3 a er need satisfaction will substan- 
a, = erate the relationships between the 
5 imensos and the dependent variables. 
be ce therefore, that a cautious interpre- 
— at the results in terms of the causal 
able, an job characteristics may be reason- 
ae his does not deny the fact that any 
aliena can best be tested by experimental 
AN ation of the independent variables, and 
anipulation of the core dimensions 1n an ex- 
Enea setting would seem to be a clear 

xt step in research on job effects. 


Use of Perceived Job Characteristics 


lo one of the major conceptual and methodo- 
T problems which pervades studies of 
with FAR job effects on behavior has to do 
they p differences between task materials as 

y exist in objective reality and as they are 
Perceived by individual performers. Tasks and 
Be are invariably redefined by the individuals 
S 10 perform them, sometimes deliberately and 
ometimes without full awareness by the per- 


formers of the changes or re-emphases that 
are being made. Further, it is the redefined 
hich the 


task rather than the objective task W 
dividual tries to perform, and thus only 
those aspects of tasks or jobs which are 
els perceived or experienced by @ per- 
ormer can have an impact on his performance 
and attitudes. This would seem to argue for 
the use of task and job characteristics 2S 
described by the performers themselves 1n re- 
Search aimed at ascertaining the effects of 
these characteristics on performance. Yet when 
Such subjective assessments are used, many of 
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the important conceptual and methodological 
advantages associated with the use of inde- 
pendently and objectively described indepen- 
dent variables are lost.” 

There are at least two strategies for dealing 
with this problem in research on task and job 
effects on behavior. One, which was used in 
the present research, is to employ subjective 
assessments of the tasks or jobs by the per- 
formers themselves, but simultaneously to 
develop means of determining the relationship 
between these assessments and others, includ- 
ing objective measures when possible. Thus, 
in addition to employee descriptions of the 
characteristics of their jobs in the present 
study, jobs were rated by three other methods: 
by company managers, by the researchers, and 
by an adaptation of the Turner and Lawrence 
(1965) procedures for describing jobs in opera- 
tional terms. As was shown in Table 2, there 
was high convergence among the four methods 
for all job dimensions except feedback. Thus, 
while the job descriptions used in the analyses 
are those actually experienced by the em- 
ployees (and thus presumably are more di- 
rectly causal of their behavior than would be 
the case for objectively described character- 
istics), they also have been shown to have 
substantial convergent validity and to be 
based upon the objective job characteristics. 

A second strategy for dealing with the re- 
definition problem has been proposed by 
Hackman (1970) and, while methodologically 
more difficult to implement, probably has more 

otential for the development of a general 
understanding of the effects of tasks and jobs. 
In essence, this strategy suggests that the 
redefinition process should be viewed as the 
first stage of the performance process itself, 
and the redefined task should be conceived 
of as a potentially measurable intervening 
variable in the causal chain between the ob- 
jective task input and the dependent variables 
of interest (egy performance, satisfaction). 
Just as individual differences in need strength 
have been shown in this research to moderate 
the effects of job characteristics on employee 
behavior and attitudes, so would individual 
needs, values, and goals be expected to inter- 


enid EES 
10 The conceptual and methodological implications of 


the task redefinition phenomenon are discussed in more 


detail elsewhere (Hackman, 1969b; Weick, 1965). 
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act with the objective task or job in influencing 
task redefinition. It might be, for example, 
that individuals with a particular pattern of 
needs tend to redefine the tasks they perform 
to be more consistent with those needs than 
is actually the case. These individuals might 
then develop hypotheses about appropriate 
on-the-job behavior which they expect to lead 
to need-satisfying outcomes, but which, be- 
Cause of the lack of congruence between the 
objective and the redefined task, might in fact 
serve only to set the stage for disillusionment 
and frustration. In any case, it appears that 
research on the ways different individuals 
redefine tasks and jobs may have considerable 
potential for furthering general understanding 
of the effects of tasks and jobs on the behavior 
and attitudes of performers. 


Nature of the Four Core Dimensions 


The results of the Present research show 
that, in general, employees with 


addition, it was shown that fo 
favorable outcomes, jobs need to 
moderately high on all four of the 

Further refinement of the core dimensions 
and exploration of their impact on individuals 
clearly is called for by the research. For ex- 


r the most 
be at least 
dimensions, 


means), it 
g at it is not necessary for 
, that one 

€ other). The finding of 
hat jobs which were 

\ and low in task identity 
are Not associated with high quality perfor- 
hers, would tend toa 


É ty, however, er 
tive possibility which S oo 


to the Hall and Lawler i 
moderate level of varie 
to keep the employee f 
his work (Scott, 1966), 
achieved, experienced 
work will vary direct] 


be contrary 
is that some 
ial simply 
rom being bored with 
and that once this is 


Meaningfulness of the 
y with th 
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task identity present. F inally, it may be pe 
increases in variety can serve two ainera 
functions, depending on the amount initia a 
present on a given job. For low initial le 
of variety an increase may serve mainly a 
decrease the monotony of the work and make 
it possible for perceived meaningfulness a 
vary as a function of the amount of pe 
identity present, as suggested above. At so 
point, however, variety may assume a ditey 
function, namely, introducing challenge a 
the work which, when successfully dealt m 1 
can be satisfying to many individuals. Clear n 
additional research on the psychological impa! 
of these dimensions is called for. 

Additional work also needs to be done on 
the Operationalization of the feedback dimeny 
sion, since no Convergence was obtained a 
the several methods for m easuring this dime! 
sion. The fact th 
predicted in the 
the employee 


: ‘ “ee on 
groups who rated the jobs did not ate ct 
back present, the shit 
mension remains high 


jety an 

-67) between variety 4 
y should be tested. The 
of the scales used 


ilar, and it may be t! 3 
h correlation Was obtained simply a 
high variety in the Pa 
‘ation where the study ae 
Place also tended to be jobs which provide 
considerable autonomy. ‘Phe fact that mos 
telephone tra (i » various kinds r 
variety and = 
tainly facilitate obtaining | 
on between the dimension’ 
gest that the high correlation i 
t organization specific. The ve 
€r, to believe that the posi y 


n 
£ 3 otwe® 
acy high: correlation be 
‘decision time” (which bas ss 7 
conceptual Similarity to autonomy) act 


~- 


pe 


Sample of 30 jobs which included management 

Jobs. It appears well worth examining the 

pe ey that the psychological meaning of 

E wo dimensions is similar in ways that 

Br not been attended to in the present 
ptualization. 


I, ‘ 
nler personal Com ponents of Jobs 


~~ two job dimensions reflective of the 
€rpersonal components of jobs (dealing with 
alg and friendship opportunities) did not 
on o employee work motivation or perfor- 
F aby The dimensions did relate positively 
oe Same kinds of satisfaction, but the rela- 
ae PS were not as substantial as those in- 
ving the four core dimensions. 
a onding to the conceptualization on which 
a oe study is based, the degree to which 
int require interpersonal activities should re- 
ate to work motivation only when (a) workers 
a high desires for the satisfaction of social 
ica and (b) working hard on the job can 
ewe the satisfaction of these needs. Even 
ie i Jobs which scored relatively high on the 
die interpersonal dimensions (e.g, some op- 

‘ators jobs) fail to meet the latter criterion. 
oors: reported in fact that they could 

Aun social satisfactions best when they were 
Not “working hard’? by company standards 
(i.e, completing a large number of calls), 
Since when the load of calls was heavy they 
had little or no time for meaningful inter- 
Personal activities with either customers or 
fellow employees. 

Even if jobs were designed so that relating 
Meaningfully to others was an integral part 
of doing the job well, there is reason to doubt 
whether such jobs would have long-term mo- 
tivational payoffs. The reason for skepticism 
is the arguments of Maslow (1953) and of 
Alderfer (1971) (presented earlier in this 
article) that when individuals have had ample 
Opportunities to satisfy their social needs, the 
level of desire for additional social satisfaction 


will decrease and the level of desire for higher 
order need satisfaction will increase. When 
should be 


this occurs, of course, then jobs 
high on the four core dimensions (rather than 
high on an interpersonal dimension) to pro- 
vide conditions for internal work motivation. 

All of the above is not to suggest that the 
interpersonal aspects of work are unimportant. 
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They do affect satisfaction and it is also pos- 
sible that they affect other variables which 
were not considered in this study (e.g., prob- 
lem-solving effectiveness, turnover). It should 
also be noted that all of the jobs in the present 
study were at least moderately high on the 
interpersonal dimensions; it is very likely that 
important negative reactions might be ob- 
tained from employees on jobs which were 
very low or totally lacking in interpersonal 
aspects. Further, the present data address 
only the degree to which the existing structure 
of jobs requires or provides opportunities for 
interpersonal activities. Alderfer (1967) has 
shown that when jobs are changed, interper- 
sonal relationships (especially between the 
worker and his supervisor) are markedly af- 
fected, and that “relationship problems” clearly 
have the potential of negating or reversing 
increases in motivation and satisfaction which 
are anticipated as a result of job enlargement. 


Nature of the Impact of Jobs on Behavior 


Until relatively recently, research attention 
given to jobs has focused mainly on the ways 
jobs can be designed for most eficient pro- 
duction, or on the analysis of jobs into com- 
ponent parts to facilitate employee selection, 
, and compensation. Since the mid- 


placement 
attention gradually 


1950s, however, research 
has shifted toward examination of the effects 
jobs can have on the people who do them, 
and how jobs can be designed so that these 
effects are desirable for both employees and 


organizations. 


Many studies, including this one, have shown 


that jobs can and often do have a substantial 
impact on employee behavior and attitudes; 
this finding is by now rather well documented. 
It appears that additional understanding of 
the nature and implications of job effects will 
require more direct investigation of the process 
by which these effects take place. 

The present study represents some progress 
in this direction, by providing data which 
show that certain job characteristics and 
certain individual differences interact in deter- 
mining behavior and satisfaction. Apparently, 
individuals who are desirous of higher order 
need satisfactions will, when working on a job 
high on the core dimensions, gradually develop 
and/or verify the hypothesis that personally 
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valued rewards can be obtained by working 
hard and effectively on the job. When this 
occurs, the job can be said to have influenced 
the behavior of these employees by affecting 
their personal hypotheses about what kinds 
of behaviors will lead to favorable outcomes, 
Thus, one general way that jobs can influence 
behavior is by creating conditions whereby 
employees are likely to develop and validate 
specific behavioral hypotheses about what they 
can do on the job to obtain work outcomes 


favorable to themselves (or to avoid unfavor- 
able outcomes). 


on the job. For example. 
behavior by influencing t 
of employees at work 

arousing (or depressing) 


discussed in some detail by Hackman (1969; 
1970). The Point of emp 


effects can and do come about via a number 
of different (albeit i 


s these processes will operate 


and will be mutually reinforce. 
Ing; other tim 


interact with individual differences would seem 
to be warranted, 


Implications for Organizational P 


Standard organization 
ment Procedures attem 
ilities of a pros 


ractice 


and °Pportunities of 
the present results 
als who desire higher 
n will be likely to con- 


Suggest that individu 
order need satisfactio 
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tribute most effectively to organizational ee, 
(and simultaneously to satisfy their own ae A 
if they are placed on jobs which are high 2 
the four core dimensions. Other employs 
of course, who may be neither desiro of 
higher order need satisfactions nor eat 
dealing with complex jobs requiring consi “a 
able autonomy, would be ineffective on st 
jobs and dissatisfied with them. sas te tHE 
It appears from research cited earlier in oat 
article that many organizations err rather be 
sistently by designing jobs which are too ay 
on the core dimensions. The present a 
Supports this conclusion. It suggests that t ae 
are many workers who want to obtain ie i 
higher order need satisfactions from their w we i, 
but few who are overwhelmed by the eet 
chological demands of their jobs. The imp a 
tion of this argument, of course, is that ee 
nizations might be well advised to cons! 
redesigning many of their jobs. the 
When job enlargement is carried out, m 
question often arises whether the chang 
should be toward horizontal enlargement (i-€» 


S 3 3 hings an 
increasing the number of different thing 
employee does) o 
ment (i.e., incr 


awler 
about his work) or both. Lawl 


d arding 
he literature regarding 
the effects of vertical 


largement 


study provide some su 
a job is en] 
ployee likely to feel 


a ? me 
Is work outcomes; and only if a job has so! 
amount of horizont 


A ikely 
d al enlargement is he lik il 
to experience his Work as meanin gful—althous, 
it should be i 


. rr 

kept in mind that too much E i 

apparently can cause a ae 
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ie potential of jobs can be realized only 
aa the psychological demands and oppor- 
ee bi of jobs mesh well with the personal 
fem a goals of employees who work on 
T is kind of matching can be developed 
ies : selection and placement of employees, 
a wi Job redesign, or (perhaps optimally) 
a = sa to fit people to jobs and jobs 
sth pa simultaneously and continuously as 
of it e organization and the characteristics 
its employees change over time. 
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EFFECT OF JOB ATTITUDES, TRAINING, AND ORGANIZATION 
CLIMATE ON PERFORMANCE OF THE 
HARD-CORE U {EMPLOYED ' 


ER? anp STUART GREENBERG 


FRANK FRIEDLAND 


School of Management, Case Western Reserve University 


The following potential contributors toward the job performance and retention 
of 478 hard-core unemployed (HCU) were explored: the HCU’s biographic and 
demographic background, his attitudes toward work, the organizational climate 
d the effect of a 2-wk. training/orientation program in 


in which he is placed, an 
None of the HCU’s biographic/demographic background 


which he participated. 

data or his attitudinal characteristics were related to his job performance Or re- 
k. orientation program have any effect on his attitudes 
toward work, job performance, Or retention. The sole correlate of the HCU’s work 


effectiveness and behavior was the degree of supportiveness of the organizational 
slaced. In addition, the HCU saw this climate as far less 
s geared 


tention nor did the 2-w 


climate in which he was p 


J supportive than did his supervisor. An important inference is that program: 


primarily toward adapting the HCU’s work attitudes to the predominant social 
structure in the organization are far less potent than those that also incorporate 
the adaptation of the organizational climate. 

educational and economic level ; it values work 
jmarily as instrumental to getting money, 
han for the sake of involvement an! 


With the sharpened focus directed today 


toward the training and employment of the pr 
hard-core unemployed (HCU), we nee a rather t 

better understanding of the objectives of the having a job (Goodwin, 1969) ; and its members 
aining and the relative frequently have a police (or other deviant) 
phasis of most training pro- 
the HCU so that he can 


methods selected for tr 
impact of these methods. One of the implicit record. The em 
objectives of such training is to provide the grams is on changing 
‘ithin the organization, 


minority culture? with adapt to the culture W 

rather than attempting to effect change or 

in the organizational culture to 

t culture is white, adapt to the HCU. These cross-cultural issues 
get acted out by both the predominant and 

minority cultures within the arena of training 

T o placement and within the 


rganization subsequent to 


participant from a 
values and attitudes that facilitate his adapta- 
tion to the predominant social structure an adaptation 


its culture. The predominan 
ucated, of a middle- 


and one that values 
t culture in programs prior t 


employed, more highly ed 
class socioeconomic level, 


highly the ethic of work. The targe 
is primarily black arena of the o 


many training programs 
and unemployed; it has a relatively lower placement. ? 
The study that follows dwells primarily on 


1 Thi z was ed by Jnited States De- $ A 
Tbs sti we supported py tae G Policy eaha evaluating a program that was intended to 
a A i 5 f 
der Contract No. 41-7-002-37. recruit and train HCU persons, to provide 
ent the Depart- them with permane 


the authors arè pusiness organizations, 
racy of all ma- entry and retention in these organizations 
after placement. The program in its entirety 
was directed toward alleviating two interre- 
issues: unemployment, and atti- 
king world of 


partment of Labor, 


tion and Research un D 
t necessarily represi 


This article does n° ae employment ia private 
ment’s official opinion or policy, and aed to support re, 
solely responsible for the factual accu 
terial in the article. 

The authors are indebted to 
rviews from W 


Lester D. Jaffe who con- 
ducted the inte hich the climate instru- 
lated social 


b ment was derived. N ien 
2 Requests for reprints should be sent to Fran! tudes inapp 
Friedlander, School of Managemen 
Reserve University, 
3 The term “culture” 
in that it provides a 
dy-made set © solu- 2 J str 
a significant changes in the HCU 


2 


ropriate to the wor! 


Cleveland, Ohio te Westem fhe predominant social structure. 

is used in the traditional an- The specific purposes of our research were 

thropological sens? human beings twofold: first, to evaluate the impact on 

with a design for living, yale Fee s work attitudes (a) of 

tions for human problems, and so serves 
(Lewis, 1966). 


adaptive function 
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a 2-wk. orientation/ training program in social 


es 
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skills, and (b) of Job placement; and second, 
to examine the predictive power of the follow- 
ing variables as to whether the HCU will be- 
come successfully employed : (a) his biographic/ 
demographic background, (6) his attitudes 
toward work, (c) changes in his attitudes to- 
ward work that might result from the training / 
orientation program, and (d) the nature of the 
job climate in which he was eventually placed 
for employment. In a larger sense, the authors 
were interested in exploring the feasibility of 
increasing the adaptiveness of the HCU to his 
organizational and social climate; an alter- 
native might have been to attempt to effect 
change in the climate itself. The variables 
used in this study to explore these several 
issues included psychological (biographical 
data and initial attitudes), social (demo- 
graphic data), learning (changes in work 
attitudes as a function of the training program 
or of having a job), and Organizational (the 
social climate in which the HCU was placed). 


Program and Ts Participants 


The training and job placement efforts ex- 
plored in this study were primary components 
of the AIM-JOBS Program in Cleveland, 
which was a federally funded Concentrated 
Employment Program (CEP). Th 


formulated in 1966 by the Presi 


Problems of the are: 


, the first funded CEP, 
Was established in Cleveland in 1967, with the 
ng securely in perm, 


i anent 
disadvantaged res 


» Job orienta- 

2 -velopment and Placement, 

and supportive Services to Participants, 
The training/orientation 

a 2-wk. period during 


Was assigned toa counsel 


l gram covered 
which the Participant 
Or/teacher who worked 
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with the participant individually and in ae 
The following subject areas were Oye s- 
these sessions: (a) human relations—discus 
sions with the broad goal of strengthen 
respect and understanding for oneself p 
others through recognition of human a 
versity; (b) job ag Ale rg ae 
getting a job, including how to fill ou ae 
application, how to conduct oneself at T yer 
terview, acceptable work attitudes, emp eh 
expectations, attendance and pace 
fringe benefits, routes to advancement, iis 
(c) money Management and budgeting— oo 
cussions of the importance of sound ene fe 
management related to personal goals, biti 
ing, investment and savings, installment Pa 
chasing, credit ratings, etc.; (d) grooming A o 
personal hygiene—discussions of proper a 
for interviews, Personal grooming, ete.; D 
physical fitness—including exercise, nutrition, 
recreation, weight control, etc. ; (f) transporta 
tion and urban development—map readin 
and use of public transportation systems 
reach job sites, 

Recruitment into t 
was restricted to th 
inner city Poverty ar 
age range of 18-35. 
cluded who had 
from the Progr: 


he AIM-JOBS Program 
ose who resided in the 
eas and who were in the 
Applicants were also m 
obvious inability to benes 
am because of grave physica’ 
or mental impairment, Participants in the 


Study were drawn from the 839 HCUs who 
entered the AIM-JOBS 


was 26; 91% 
of unemploy? 
cement was 15 wk. ; 2070 
tds (exclusive of trafic 


and misdemeanor arrests) ; average education 


Was completion of ten 


l th grade; average reading 
skill was approximately. at the eighth-grade 
level; and 


i average arithmetj 
Proximately at th 


Mary, the Popul. 
largely male, black o 


» and obvio 
to the Program, 


© skill was ap- 
© sixth-grade level, In sulle 

as 
for the research wa 


Variables 


jor sets of variables included: 
/demographic data, (b) mea 


The four ma 
(a) biographic 


a 
lo 
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TABLE 1 


Listinc OF NINE SCALES OF ATTITUD! 


ES TOWARD WORK UseEp IN STUDY 


Source 


Attitude scale ry | Item Measure 
1. Motivation to v ive i 
i work | .66 9 | Relative importa vorki 
esti coal ive importance of reasons for working 
work 52 7 | Degree hi j BERE 
a ten oaii | 7 egree to which HCU finds work disagreeable 
unemployment | 49 4 | Discomfort in bei { 
4. KERS ad n being unemploy ed 
: vious job 71 6 | Recollection of previous job 
5. Importance of job 
6P characteristics 64 18 | Relative importance of various job characteristics 
J Pried self-image 67 6 | How HCU would like to be seen by his friends 
. Powerlessness 70 6 | Degree to which HCU feels he cannot determine 
Troia his future 
- Cynicism .73 6 | Degree to which HCU is cynical about life and its 
Orvik opportunities 
+ Vigor 60 4 | Perception of his own pace and vigor 


ee 
Indik (undated) 
Indik (undated) 


Indik (undated) 


Friedlander (1965) 
Wilensky (1964) 


Rotter (1966) 


Srole (1956) 
Guion (1966) 


Note.—HCU = hard-core unemployed. 
sures of the HCU’s attitude toward work, (c) 
measures of the organizational climate in 
which the HCU was placed and employed, 
and (d) criteria of the HCU’s job performance. 


Biograph ic/Demograph ic Variables 


A variety of demographic and biographic 
data was collected from the records of all par- 
ticipants in the Program. Demographic data 
included length of time at present residence, 
geographic area of birth, and duration of prior 
unemployment. Biographic information in- 
cluded number of dependents, number of pre- 


vious jobs held, education, marital status, prior 
police record, and Jevel of reading and arith- 


metic skills. 


Altitudes toward Work 


A number of measures were administered 
that were either based on past research or Were 
developed specifically for this study (see 
Table 1). We hypothesized that scores on tne 
first four scales listed in Table 1 would be 
predictive of job success and that some of these 
might be afiected by the orientation/ training 
program. For example, the HICU’s perception 
of his previous job experience mi ll be 


related to his expectation 
experience and, therefore, 
and retention. We further hypothesize 
the job success of the HCU would b 


of the extent to which job charac 


portant to him (Scale 5) were found in his 
eventual job and the degree to which the role 
demands of that job were congruent with his 
prized self-image (Scale 6). Finally, we hy- 
pothesized that the HCU’s powerlessness, 
cynacism, and vigor (Scales 7,8, & 9) would 
predict attitude changes resulting from the 
orientation/ training program. 

The reliability coefiicients of these scales are 
of a minimally acceptable magnitude, although 
the following two factors may account for this. 
First, these reliability measures were computed 
on HCUs in training—that is, the 2-wk. span 
between test and retest encompassed an orien- 
tation program supposedly relevant to these 
attitudes. In @ strict sense, the coefficients are 
therefore not representative of a pure test- 
retest reliability, even though no pefore-aiter 
differences were found to 
Second, HCUs represent 4 culture that may 
well have experienced these questionnaires as 
alien to its values, resulting in suspicion of and 


resistance to the questionnaire. 


Job Climate 

Climate is conceptualized as an interaction 
of personal factors (personality, needs, values, 
etc.) and organizational properties (structure, 
supervisory practices, etc). This relationship 


emphasizes the role of perception of organiza- 
i & yon Gilmer, 
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Preliminary interviews with HCUs indicated 
that one of the components of the organiza- 
tional climate of most concern to the HCU was 
the degree to which he perceived the climate 
as supportive. Specifically, three aspects of a 
Supportive climate seemed most salient: (a) 
new worker treatment, (6) support from peer 
workers, and (c) support from his supervisor, 
Both HCUs and their supervisors completed 
questionnaires concerned with these 
aspects of their work climate, : 

The specific items that comprised each of 
the three climate measures are listed below, 
Response options for each ite 
Š-point multiple-choice Likert 
within each of the three measur 
bined to yield three climate score 
account the arithmetic sj 
item, 


three 


m were on a 
scale. Items 
es were com- 
S, taking into 
gn that follows each 


1. New worker treatment. (How are new workers at 
your plant generally treated?) They are usually made 
—); They are usually given more 

breaks than others (+); They are usually treated like 


all the others (+; They are usually given a hard 
time (—). 


2. Supp 


now how to do Something (+); 
Almost everybody g 


(+); Most of the wi 
It’s not smart to 
tend to take advan 

3. Support trom 
where I work?) To get ahead, you have to “brown 
Nose” (—); Supervisors wi 


ould just as soon get rid of 
you rather than teach you or help you ona job (—). 


like to work 


Criteria of Job Success 


Three different criteria of the HCU’s job 


obtained : job retention, 
nd work behavior, Job re- 


istics, each of which 
items: competence ( 


(is friendly and agreeable), e 


his best and works Carefully), and reliability 


(shows up each day, shows up on time). Ti 
HCU’s supervisor was given the a pee sa 
structions in this rating: “Compared to o ef 
employees doing the same or siriar W 
(or at a similar skill level), how wou y 
rate this employee on each of the R 
The multiple-choice format ranged in eq x: 
percentile intervals from “top 20% 
“bottom 20%.” ; 
The ro A criterion was designed 
to determine the supervisor’s general dech 
tion of the HCU as a person in the jae 
situation. The work-behavior criterion : be 
Composed of three component characteris E 
of the HCU as (a) smart (he knows wian 
going on in life, he does whatever he does a ); 
he knows how to do many things, he is smar ‘ 
(6) friendly (he is a good friend to people, ja 
is a friendly person); (c) conscientious K 
wants to do his best, he does a careful job, H 
wants to do a good job). A 5-point multiple 


choice Likert-type scale followed each © 
these items, 


METHOD 


For questions con 
of the 2 


ientation program 
the orientation program a 
3 (c) after the HCU had ya 
placed on a job and was working (6 mo. after the firs 
- The impact of Participating in e 
gram on changes in work attitudes W2 
ce between b and a above, 
i f job placement and actually holding 
a Job on changes in work attitudes was measured 
veen c and b above, 


Sample sizes for different parts of the study ranged 
from 478 to 22, In general, random sampling for eac 


Successive phase in which HCUs participated was ac 


complished, except that those who dropped out were 
obviously excluded from re 


“aH $ i for 
ceiving questionnaires 
future phases, Of the 839 HCUş Sia initially entere 
the Program duri 2 mo. of the study, 478 
were randomly ‘ompleted the first ques- 
tionnaire on ent; gram. Biographic/demo- 


ng the initial 
selected and ¢ 
Ty into the Pro; 


ae 
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phic on were available on 300 of these HCU. of 
129 “ae h %) who completed the orientation program, 
Metio F randomly selected and completed the second 
Ninet naire (at the end of the orientation program). 
ee ee aan were matched on the first two 
ered aa administrations. of the 428 who com- 
‘cen porentener program, 147 (31%) were working 
110 Ses Oe iced 6 mo. later. ‘And of these 147, 
22 Sata sted a third questionnaire. Among these 110, 
ae a with the second questionnaire ad- 
a ation, and 24 were matched with an evaluation 
y oas HCU’s supervisor. 
Tesi control group, matched by age, educ: 
bilit ce, and work history to the AIM 
Er y requirements, Was obtained fr 
mployment programs. They were 

teal as AI M-J OBS participants under similar 
ee, ave of administration. They were readministered 
of 105 e questionnaire from 3 to 6 wk. later. A total 
complet matched control group questionnaires was 

ed. 
Parany used cither Pearson correlation coef- 
a ate simple differences between means. Coeficients 
Sie tall erences were tested for significance using 2 
REN ed £ test at the .05 level of significance fOr corre- 
ated or uncorrelated means. 


RESULTS 


_ Our findings are divided into two sections: 
3 first, those dealing with changes in HCU’s 

attitudes toward work as a result of the 
orientation/training program and as & result 
of working on @ job and, second, those findings 
dealing with the prediction of HCU job per- 
formance and retention. 


Work Altitudes of the Hard-Core 


Changes in 
Unemployed 
computed 
. toward-work measures betwee 
last day of the orientation program \ 
None of these differences Were i 
Similarly, differences betw 
attitudes On the 


program and his a 
job (a 6-m°- interval) were 
(N = 22). Our conclusion 1$ 


perience oster! 


» attitude towar : 
attitudes as 


on the 
instruments used. 


was predictive f work effectiveness 


behavior, OY job retention 
more, the h ingraphic/demographi dat 


not significantly correlated with HCU work 
the orientation 


attitudes at the outset of 
N = 300) or at a period of 6 mo. 


program ( 
later (N = 83). 

None of the nine measures of the HCU’s 
attitudes toward work at the outset of the 
orientation program was predictive of work 
effectiveness, work behavior, OF job retention 
(N = 83). The only variable that seeme! 
relevant in any way was the degree of vigor Or 
pleted the 2-wk. orienta- 


energy. Those who com 
N = 428) were significantly 


(05) on vigor than those who 


higher (p < 
before completing the program 


dropped out 
(N = 50). 

Our conclusion is that the job performante 
and retention of the HCU in this study can 
neither be understood or accounted for by 
knowing his attitudes and motivation towar! 
work nor by knowing about his previous work 
history, his geographic mobility, his education 
and scholastic skills, his family structure, OF 
his prior police record. In a broad sense, the 
characteristics and perceptions that the HC 
brings into the world of work (in terms of the 
piographic, demographic, and attitudinal di- 
mensions measured in this study) seem to have 
little to do with whether he performs well on 
the job and little to do with whether he stays 
on the job. 


is a finding which emerges from 
a comparison of perceptions by th 

his supervisor k climate in which the 
HCU is placed. The second comes from an 
analysis of the relationships between the 
supervisor’s evaluation of the HCU and job 


retention. 


The dramatic differences between 


‘ed in Table 2, Perceptions bY 
HCU of the Jack of supportiveness 

to new i particularly 
In the case 


the difference in perception js at 


provided 
noticeable. 
variables, 
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i z behavior 
TABLE 2 ratings of work effectiveness /work bel 
PERCEPTIONS Or Work Crostate Herp and job retention. It js apparent that 
o ISON OF 
$ =e THE HARD-CORE UNEMPLOYED AND 


two criteria of job success are com oy 

THEIR SUPERVISORS the most part two separate dimensions a 

haps Supervisors have little control over re i. 

Perceptions of tion of HCU Persons, this decision is ma a 

work climate r primarily by the HCU (to quit), or by Pai 

Components of held by Dite sures on the Supervisor from higher an 

Morcclinate ss within the organization (not to fire the HCU). 

gy Se This does not imply that supervisors E 

in Seneral, critical of the HCUs working un a 

New worker treatment 19 A 2.1 them. In response to the work eee 
ae fon ane Fe A by question in which Supervisors were asked 

uppor i H i 


m y re in 
compare the HCU to other employees during 
Note.—HCU = hard-core unemployed. 
8 


4 er- 
i the same or similar work, the average ee 
2 ke N=24 (matched pairs Of workers and their centile ratings for the HCU were as eet 
eo) 7 + Oe an aes 
=a N = 73): reliability, 46th percentile; co 
least two full scale points (ona 5. 


-Point scale), Petence, 54th Percentile; effort expended, a 
i Percentile; and Sociability, 60th percha 
Sets Sxceeds'8, Tt is appoint that the HCU Tt should be noted that the HCUs were os, 
i i ly less Suppor- higher on competence than reliability (p < .05). 
tive than does his Supervisor, j 


When the HCY iS present, his work is com- 
8 results in Table 3 parable to th 


at of other employees ; the problem 

t= 60) is absence and lateness not his competence. $ 

between the supervisor’s evaluati he main body of findings in Table 3 indi- 

U as reliable and the HCU’s j cates that HCUs who percei z 

(number of weeks worked). Those HCUs who Supportive also tend to pe rated by their 
are rated as most reliable (“he sh 


OWS up each Supervisor more favorably in terms of work 
day,” “he shows UP on time”) by their Super- effectiveness and work behavior, Those HCUS 
i rely Short duration on i i 

the job. Or, conversely, those w 


J y rated as more competent an 
Ss reliable by i 


as 
: cllow workers and ics 
5 : aving the general behavioral characteristi 
In Seneral, zero to negative relationships i 
are indicated in Table 3 h 


s of being Smart, friendly, and conscientious: 
etween Supervisory i r 


TABLE 3 


LD BY THE HARD-Corp UNEMPLOYED 
s Evatvarroy OF Tir 


RExatyo: 


NSHIPS BETWEEN PERCEPTIONS He 
LIMATE ANp THER SUPERVISOR’, 


OF THEIR Work 
R Work PERFORMANCE 


Perception of his work 
climate by the HCu 


New worker treatment 
Support from Peer workers 
Support from Supervisor 
No. weeks worked 


scient 
m A 49** 
40" 


58** 


—.13 


ae tive T 
ir climate as supportive 
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also show some tendency to be rated as ex- 
erting their best effort on the job. The only 
supervisory rating unrelated to work climate 
appears in the area of worker reliability, where 
three negative (but nonsignificant) correla- 
tions appear. 


‘DISCUSSION 


The effectiveness and behavior of the HCU 
does not appear to be a function of the personal 
and social characteristics that he brings into 
the world of work (such as his motivations and 
attitudes toward work, his prior work history, 
geographic mobility, education, scholastic 
skills, family structure, and prior police record), 
nor are these characteristics and their resultant 
behaviors readily susceptible to change through 
training programs focused on adaptation or 
enculturation. And, since positive HCU atti- 
tudes toward work were uncorrelated with 
higher job retention or job performance, there 
is no indication that improved job attitudes 
(through more powerful social skill training 
programs) would result in improved job per- 
formance or retention. 

Rather, the work effectiveness and behavior 
of the HCU depends predominantly on the 
social climate in which he is placed and works. 
Increasing the supportiveness of this climate 
seems to be a major avenue for increasing the 
HCU’s performance and retention. The imple- 
mentation of this change lies less in increasing 
the HCU’s competence than in exploring the 
absence and lateness he shows in response to 
his climate. 

These findings are neither unique to this 
study nor to the Program studied. Research 
on JOBS NOW, a Chicago program similar to 
AIM-JOBS, indicates that participants who 
were successful and unsuccessful on jobs 
could not be differentiated on the basis of age, 
education, IQ, marital status, police record, 
number of previous jobs, or length of previous 
employment; the sole factor that differentiated 
successful from unsuccessful HICU persons at 
statistically significant levels was the degree 


of support within the immediate organizational 


context in which the HCU was placed. The 
retention rate for “high support” organizations 
was 82%, while that for “low support” organ- 
izations was 28% (JOBS NOW Project, 1967). 


The HCU’s prime method of coping with a 
work climate that he perceives as highly un- 
supportive (and his supervisor perceives as 
highly supportive) is through his own un- 
reliable behavior—being late and/or absent 
from work. HCUs who are reliable on the job 
find the situation intolerable after a short 
period of time and leave the organization. 
Others cope with the unfavorable climate by 
being tardy or absent. Furthermore, our find- 
ings indicate that the characteristics that 
supervisors were most critical of in the HCU 
was the HCU’s lack of reliability, a finding 
replicated in a number of other companies 
(Johnson, 1969). Tardiness and absence are 
characteristics frequently ascribed to the black 
HCU culture, but these same characteristics 
can also be viewed as indexes of escape Or 
avoidance behavior from a climate the HCU 
experiences as intolerable. 


The failure of attempts at adaptive training of the 
HCU should come as no surprise to those familiar with 
the International Harvester study (Fleishman, 1953). 
Tn that study, it was found that after training, foremen’s 
attitudes and values reverted to those dominant in their 
back-home organizational climate—particularly those 
of their supervisor. Both studies indicate the impotency 
of training when the organizational climate does not 


support values inherent in the training. As Fleishman 


notes: 
Existing behavior patterns are part of, and are 


moulded by, the culture of the work situation. In order 
effectively to produce changes in the foreman’s [HCU’s] 
behavior some changes in his “back-home-in the plant” 
environment would also seem to be necessary. The 


Jone cannot do it [p. 2157“ 


training course @ 
to which a program 


The greater the degree 
actice and structure, 


is tied in to company pr 
the more effective that program will be. Rosen 
and Turner (1970), for example, found that an 
orientation period conducted by members of 
the hiring organization who were able to in- 
tervene between hard-core and first-line super- 
vision in problem areas was significantly more 
effective than a quasi-therapeutic program 
focused on individual adaptation jssues and 
conducted by a university. 


sie 

4 What is in brackets is obviously not Fleishman’s, 
but rather our addition to his statement. It might also 
be noted that in the Fleishman study, some temporary 
changes were noted in foreman’s attitudes as a result 0} 
training, while no such changes occurred in the current 
study. The assumption is that HCU attitudes are more 
culturally determined and, therefore, more resistant to 


training mechanisms. 
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The supervisor seems to be a key element in 
the concept of social climate as it effects the 
HCU. Goodman (1969) comments on the 
added time demands placed on the supervisor 
and adds that failure of the supervisor to re- 
spond to these demands increases the trainee’s 
dissatisfaction and chances for turnover, 
Lawrie (1969) notes that the white supervisor 
often erroneously assumes that the black 
knows the ropes. When the black’s own ap- 
prehensions around entry into the white world 
of work result in tardiness and absence, the 
supervisor withdraws whatever support he 
initially provided. 

Morgan, Blonsky, and Rosen (1970) found 
a decreasing acceptance and support by super- 
visors of an HCU program as it Progressed. As 
causes for this decrease, they mention lack of 
support of the program by management and a 
consequent uncertainty by supervisors re- 
garding policy and guidelines in dealing with 
the HCUs in the program. Hodgson and 
Brenner (1968) report special supervisory edu- 
cation is desirable where trainees have been 
identified to supervisors as hard-core before 
going on the job. Goodman (1969) reiterates 


-this need, specifically in order to reduce the 


fears and stereotypes held by HCUs and their 
fellow employees toward each other. In a 
review of a number of case examples, Dunlop 
(1969) notes that the attitudes of supervisors, 
fellow-employees, union Tepresentatives, and 
top management may be as important for 


~ managers, supervisors, 
—and not expecting only 
the necessary adaptations 


focus suggests a change p 
mutual adaptation, through 
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attitudes and values of the predominant white 
middle-class culture and the HCU culture. In 
an extensive literature review, Amir (1969) 
found that such an integration process is 
facilitated (a) when there is equal status con- 
tact between members of the two groups, (b) 
when the contact is between members of a 
majority group and higher status members of 
a minority group, and (c) when an “authority” 
and/or the social climate favor and promote 
intergroup contact. These findings reiterate 
the need for programs and interventions that 
focus on the total organizational system rather 
than on only one type of person (the HCU) 
and programs that ascribe equal status to all 
related parties rather than treat one party 
(the HCU) as deviant.* 

Since most HCU programs focus on changing 
the HCU to adapt to the predominant white 
middle-class structure, they would seem to 
proceed on the assumption that the culture 
of the HCU is defective and that the HCU is 
accordingly deviant, Wellman (1968), for ex- 
ample, describes a program similar to AIM- 
JOBS as follows: 


The desire for work 
problem was what 


of no consequence in obtaining a 
ant, that they pretend they, 
responsibility for their heng 
the program] didn’t deman 

much of the men: only that they become white [p. 13]. 

In a strong cri 
Valentine (1968) 
tion back to our 


ticism of such programs, 
Suggests we turn our atten- 
dominant Social structure as 


roblems of the e black, 
rather than Proceed with actio Poor blac 


ograms keep our 
and his “defective 
the overall social 
Ture GF are all a part, 

It is within HCU Programs such ag the one 
reported here that these Cross-cultural issues 
at acted out by both the dominant culture and 
the disadvantaged minority,” The results of 


“core unemployed connotes 
€, chronic type. 


EThe very term hard 
deviance of a very durab] 
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“Sponsored orientation 
ng approach, proved to 


fferentiated from 


University 


c orientation proved 
which, although in- 
havea dynamic free- 
“supervision character, 
al hires in terms of turn- 
teeism. It was concluded 
be superior to supportive 
e hires provided with such 
hired who met normal hiring 


tervention-with. 


effective transition 


for the hard-core (Anony- 
mous, 1968). Unf 


ately, there are few, if 
aluations of the success and/ 
of the many orientation efforts. 
hard-core hirin 


documented statements of 
almost impossible to find. 
Teported is one attempt ® 
evaluation of hard-core 
t should be pointed out, 
blems involved in design- 
8 a well-controlled, quasi- 
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a formal presentation 
’» @ brief discussion © 
and their impact is 


tudy 
untered 
necessary, 


he most critical 
he mat 
anded by 


Problems encount- 
ter of implementing 
the design in an on 
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ered was re 
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y training Personnel. Only after the 
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Program was well underway was there a realiz- 
ation that the informational orientation had 
become far more than intended: Discussion 
was encouraged using the information presen- 
tation as a springboard. Individual counseling 
of problem employees, designated as such by 
line supervision, occurred frequently. And, in 
many cases, training personnel interceded on 
the behalf of the employee in the relationship 
between first-line supervision and the hard-core 
employee. What had been designated as a 
Passive placebo appeared to have become a dy- 
namic, personalized treatment condition. As a 
Consequence, this study became less a test of 
the impact of quasi-therapeutic style orienta- 
tion per se and more a study of how effective a 
Company-run orientation could be, given com- 
mitment, latitude, and innovation on the part 
of the training staff. 

Another critical problem encountered, and 
one characteristic of many research efforts 
where intervention is used (Webb, Campbell, 
Schwartz, & Sechrest, 1966), relates to the fact 
that any modification of an organizational en- 
vironment has the potential for being reactive. 
Bringing hard-core into a plant as special em- 
ployees, requiring extensive off-the-job sup- 
port time for them and their supervisors, well 
may have communicated to first-line super- 
vision the company’s commitment to “make 
the program work at any cost.” Absenteeism 
data (presented in Tables 2, 3, and 4) suggests 
that a leniency occurred regarding absences for 
the hard-core in comparison to the normal hire. 
Consequently, differences between hard-core 
and normal hires probably underrepresents the 
true magnitude of the difference—particularly 
with regard to turnover via termination. 

One final limitation should be noted. This 
study focused on job-stability characteristics 
of hard-core. Any attempt to generalize from 
these findings to results of other studies and 
programs is hazardous. Not only would general- 
ization be limited by lack of comparability in 
intervention strategies imposed, but it also 
would be limited by lack of comparability 
among the focal groups. Unfortunately, in 
many programs, the hard-core designation re- 
fers to urban, minority group members—usu- 
ally black—unemployed at the time of hire, 
that is, persons that company personnel officers 
privately confide are not really different from 
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the general run of new hires in a tight labor 
market. Hard-core in this study refers to per- 
sons having long histories of unemployment for 
a variety of reasons (some of which relate to 
being black), and therefore represents a diffi- 
cult group to retain in employment. To com- 
pensate for this lack of comparability, job 
stability was compared not only between the 
two intervention groups of hard-core but be- 
tween those groups and normal hires employed 
during approximately the same period and as- 
signed to similar, unskilled positions. 


METHOD 


Subjects. Forty-nine Negro Ss (males), who had 
been residents of the urban community in which the 
sponsoring company was located for at least 1 yr., made 
up the hard-core sample. Hard-core unemployment was 
defined essentially in terms of work history. To be per- 
mitted to enter the program, the applicants could not 
have held any single job for more than 3 mo. and had to 
have total employment of less than 6 mo. during the 
2-yr. period prior to hiring. Educational level, selection 
test data, and criminal record (except for crimes of ex- 
treme violence) were ignored. The Ss did, however, have 
to be evaluated as physically able to work by the esti- 
mation of the company medical staff. The Ss were ob- 
tained from welfare roles, supportive agencies, and from 
day-labor lines. Normal hires were defined as walk-in 
applicants who met the typical hiring standards of the 
company. This second group happened to be one-third 
black and, like the hard-core, were assigned to manual, 
unskilled level jobs. 

Setting. The employing company was a public util- 
ity. The company was located in a large, Midwest urban 
center that, during the year prior to the study, had ex- 
perienced severe race riots. A large proportion of com- 
pany employees were technicians rather than unskilled 
workers. Most new nonprofessional or clerical em- 
ployees, however, were assigned to an unskilled labor 
pool. The company, as a matter of policy, viewed the 
labor pool as a resevoir for future technical personnel 
rather than as a stable, unskilled work force. It actively 
encouraged job bidding and training for higher skill 
positions. At the time of the study approximately 10%, 
of the work force was black. 

Procedure. The hard-core sample was divided into 
two groups. Group designation was determined by 
matching in terms of demographic variables obtained 
during the hiring interview.’ Orientation for both groups 
consisted of 21 hr. of exposure in 14 sessions over a 12- 


wk. period. 


3 Critical variables included age, education, number 
of dependents, and criminal record. As far as possible, 
job assignments were balanced so that any given super- 
visor would have an S from each type of orientation 
group assigned to him. 

‘For the first 4 wk., two 14-hr. sessions were held. 
For the next 4 wk., there was one session per week. 


have good potential on the basis 
from their immediate Supervisors, 
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TABLE 1 


TURNOVER RATE AND AVERAGE LENGTH or Esproy- 
MENT BEFORE TERMINATION FOR Nomar Hires 
AND UNIVERSITY- AND COMPANY-TRAINED 


Harp-Core 
Uni- Com- 
ver- 
Nor- | Hard sity | Pany 
Measure mal | core hard- | bard- 
core | Core 
Original N 49 49 24 25 
No. terminations 12 16 11 5 
% Terminations 24.49 | 32.7 45.83 | 20.0 
Average no. days be- 
fore termination (5- 
day week) 59.15 65.45 | 56.25 


University-developed training was conducted with 
three groups of eight Ss each. The training style used a 
quasi-therapeutic approach, Although the direction was 
in terms of job-related problems, 
permitted in group meetings, The groups were led bya 
white, clinical Psychologist who had had considerable 
experience in group psychotherapy with mixed racial 
groups. 

The company-developed training was directed by a 


white member of the personnel division of the company, 


y the training 


is, essentially a 
1es—it was discovered that 


dy design. 
» three types of criterion data were 


m company personnel records and 
-core and the normal-hire 


Resurrg 

At the tenth week after hiring, 44 hard-core 

remained employed. Tp terms of supervisor 
During the final 4 wk., a Sessi 7 very 

iets Ata, a ton was held every other 


conducted on company 
the employees were paid their normal TEN mee we 


ë The company had a Policy th, P 
sige at per 
merit increment for employees ie Permitted a small 


Were Considered to 
of recommendations 
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evaluation for a pay increment, 57% of the 
hard-core were considered worthy of the merit 
raise. The company-trained group had = 
within this category and the university-traine 

group had 48%, Although the advantage wie 
not statistically significant, it was the first ue 
dication in the study of the possible spenar 
of the company-sponsored training over tha 
offered by the university, 

At the end of 6 mo., 34 Ss, or 69% of the 
original hard-core hires, remained on the com- 
pany employment rolls, At this point, woe 
data for the two hard-core training groups an 
the normal-hire sample were compared. From 
Table 1, it can be noted that the hard-core as a 
Composite had a turnover rate, which, although 

igher, was not significantly different from that 
of the normal-hi 
trained group had a significantly higher (/ 
= .01) turnover Tate than did either the ees 
Pany-trained group or the regular hires. Ter- 
mination time, approximately 12 wk., did not 
differentiate among the three groups. 

In comparing the three groups on 6-mo, au- 
thorized and unauthorized absence rate (com- 


total time worked), no Significant differences 
were found in terms of authorized absence (see 
- The university-trained group had the 
ighest rate, In terms 6f Unauthorized absence 
rate, however, significant differences occurred. 
oth hard-core sroups had a significantly 
= .01) than diq normal hires. 

Mor , the university-trained group had a 
Significantly higher (6 = .05) rate than did the 
company-traineq group. 
Breaking down the absenteeism data in terms 
of stays and leaves for the three groups, Table 3 


TABLE 2 
Comparison OF AUTHORIZED 


AND UNAUTHORIZED AB- 
SENTEEISM RATE FOR To 


TAL SAMPLES oF NORMAL 
Hires, Universiry-Tpa. 


INED Harp-Core, AND 
ComPANY-Tramen Harp-Core 


Measure Normal University | Company 


hiresa trainedb trained“ 
% Authorized 3.52 6.67 4.43 
% Unauthorized! 5 '50 11.83 6.48 
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TABLE 3 
COMPARISON OF AUTHORIZED AND UNAUTHORIZED AB- 
SENTEEISM RATE FOR THosE REMAINING 


EMPLOYED AFTER Sıx MonTHS 
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TABLE 4 


COMPARISON OF AUTHORIZED AND UNAUTHORIZED AB- 
SENTEEISM RATE FOR THOSE TERMINATED 
DURING THE FIRST Sıx MONTHS 


Meas Normal University Company Normal University | C 

Se hires* trained? trained? Measure hires® pana od 
7% Authorized Ar À 7.93 4.85 % Authorized 7.88 6.53 3.80 
Jo Unauthorized 1.95 6.79 4.44 % Unauthorized 6.33 16.44 12.99 


Note.—Comparison is in terms of percentage of time lost. 


aN = 37. 
bN = 13. 
eN = 20. 


presents the data for the stays, that is, for those 
remaining employed after 6 mo. The rates of 
authorized and unauthorized absences were 
higher for the university-traine 


for the company-trained group, 
significantly 50. Both hard-core trained groups 
had significantly higher ($ = 01) unauthorized 
absence than did the normal hires, and both 


were higher, but not significantly so in au- 


much the same for the 
although the 


ism rate was considerably higher than for the 
nonterminated employees Table 4). Uni- 
versity-trained and company-trained hard-core 
were not significantly differentiated from one 


another, either in terms of authorized or unau- 


thorized absence- In terms of these variables, 
however, the hard-core groups did have signi- 
ficantly higher (p= 01) unauthorized rates 
than did the normal hires No significant differ- 
ences were found among the three groups Te- 


not absenteeism 
6 the rate of ab- 


that is, exit inter- 


the study, 
it appeared 


psp AE, 
6 From other data in | 
ing session tapes, 


views and foreman train 


CoMPARISON oF UNAUTHORIZED AND Av’ 
rv NORMAL Hires, UNIVERS 


yty-TRAINED, AND 


Note.—Comparison is in terms of percentage of time lost. 


senteeism for stays versus leaves in the three 
groups were analyzed (see Table 5). For all 
groups, unauthorized absenteeism was sig- 
nificantly higher for the leaves than the stays 
p= 01). For the terminated employees in the 
hard-core training groups, the rate of unau- 
thorized absence was significantly higher (p 
= 01) than authorized absence. 


DISCUSSION 


Keeping in mind the variance from the in- 
tended design, particularly with regard to the 
company-sponsored orientation, two impor- 
tant aspects of the results stand out. The first 
relates to the comparability between the hard- 
core (taken as a totality) and regular hires in 
terms of turnover, in spite of the significantly 
higher absenteeism rates for the hard-core. 
The second is the clear advantage in terms of all 
of the criteria used of the company-trained 
university-trained hard-core hires. 
of the hard-core versus the normal- 
summarize the impli- 


versus the 

In terms 
hire findings, one could 
that absenteeism was a significant variable in termina- 
tion. Company records indicating formal reasons for 
termination supported this conclusion. 


TABLE 5 
THORIZED ABSENTEEISM RATE FOR 


Srays VERSUS LEAVES 
Company-TRAINED GROUPS 


a A 
| 
i 


Measure 


Leaves Stays 


Stays 


Total hard-core 


University trained Company trained 


co Unauthorized 


y 7 6.33* 6.44 
% Authorize 95 7.88" 5.62 


Leaves Stays Leaves Stays Leaves 
15.36* 7.93 16.44* 4.85 12.99* 
5.67 6.79 6.53 444 3.80 


wrote: Comparison in terms of time lost. 
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Cations of the results in the following manner: 
If hard-core unemployed are given extensive 
orientation training, particularly if such tram 
ing is job oriented in content and directed by 
company Personnel, and if company Standar s 


The key to developing job stability among 


tended periods of unemployment, Particularly 
i to be extin- 


of time. The Comparisons of the two hard-core 
Orientation @pproaches shed additional] light on 
adaptation might best be brought 


about, 
The on] PParent ady antage the University 
orientat: ad, a compared 


3 Professional Stature and ex 
perience he entation leader jn using 
quasi-ther Peutic, Participative Sroup tech 
Niques, i 


Id not a Priori be evaluated as 

a Superio Cculturation Strategy over the free 

'SCussi 2 Persona] unseling, and Interven 

ton with Ston used in the Company-orj 
€ntation dures, 

On the other hand, the co Pany strategy as 
It was ™plementeg had so; obvious advan 
tages over the Universit Proach. First, the 
Subject matter Cover © Sessions wag job 
oriented and Crucial ț ceptable work $ 

avior. Benefits deri om Working for the 
‘ompany Were Spelled out ai a 


ind penalties for their infractis $ 
ized time and again. 7 € ra 
ules and Penalties also wag isc 
1 Contrast, the University traj 
leit own content, much of wh 
ctly related to their Jobs—o, 
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A second accrual of the company raed 
the direct access the training supervisor ha 2 
the organization and the fact that he repr 


and give him aclear Picture of the ie eee 
breaking, but could and dic 
intercede with the employee's supervisor oR 
the employee’s behalf, In a sense, company 
ining included a liaison function that was 
absent from the universit y-orientation training 
and may have played a critical role, 
i the university-spon- 
Stroup orientation 
raises Several critica] questions, First, is it ad- 
visable to ave orientation initiated by a source 
outside the company organization? Second, is 
there utility in using a clinically-derived ad- 
Justment technique for hard-core employees in 
i -behavior criteria used, 
rnover, and ab- 


ed for special 
ot orientation, the 

a distinct disadvantage regardless 
ure and experience, He is 
in company practices 
ate exposure and 
ain is a set of 


Assuming for the Moment the ne 
ion in t 


differentiateg in terms of 
Importance and orcement, Second, he is not 
1n a Position Mtercede for the employee 
either to clarify y le-eng, 


Orcement Standards of 


ogic, there is also a MELT 
© merit of using a quasi- 
Proach. The goals of such 
make a good deal of 
of common problems 
e SUPortive but should lead 
IC group. €rived solution. In 


Ey 


t 


, 
& peutic group treatment, 
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turn, assuming that the problems are job re- 


; lated—even if indirectly so—the trainee should 


benefit by being able to cope with his adjust- 
ment problems more successfully. Theoretic- 
ally, he should be less likely to quit because of 
the group support and should Jearn to modify 
his behavior and thereby reduce punitive action 
from the organization. 

A critical assumption, however, is that job- 
related problems will be salient ones for him 
and will be the problems he and his group will 


discuss. This assumption did not hold true for 


the group studied. Job-related problems were 
s induced by being 


lost in a mosaic of problem: 
black and living in the ghetto. 

Although, ideally, personal adjustment may 
be desirable for job success, its development 
may be too amorphous and gradual to help in 
coping with ongoing, job-related problems. To 
remain employed, the employee, hard-core or 
not, must conform to avoid dismissal and must 
accept to minimize self-termination. These ad- 
justments must be achieved in a relatively 
short period of time regardless of his overall 
personal adjustment. The quasi-psychothera- 
as implemented, did 


not seem to achieve these ends. 
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SUMMARY AND CONCLUSION 


In terms of the findings of this study and the 
conditions under which they were generated, 
there is little doubt that hard-core orientation 
can be advantageously accomplished, in terms 
of the criteria used, by: (a) introducing a con- 
tent structure in orientation that focuses on 
work-related materials, (b) having the program 
administered and conducted by members of the 
hiring organization who are not only knowl- 
edgeable about company practices and policies 
but who can intervene between hard-core and 
first-line supervision in problem areas, and (c) 
using a part of the orientation period to cope 
with and counsel problem employees. Extra- 
organizational intervention, particularly using 
a quasi-therapeutic group strategy, does not 


seem to be an optimal or viable alternative. 
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EFFECT OF INDIVIDUAL, GROUP, 


AND INTERGROUP RELATIONS 


ON ATTITUDES TOWARD A MANAGEMENT 
DEVELOPMENT PROGRAM 1 


his own career Prospects. ‘The 


the program also depended heavily on his status 
as an employee, trainee, or officer, 


Almost any 
tinually face t € problem of 


Ways prepare people for position 
responsibility in the future, 


interchangeable functions 
© provide Organizations with in- 
o are Properly Prepared to meet 
of Specific roles 
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: the organizat 
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From the Standpoint of scientific paremo i 
there may be no need to consider more co el 
plex human groupings if the individual a 
of analysis provides adequate conceptual le 
methodological tools to account for observab f 
phenomena, But if the individual level k 
analysis is not adequate, then more elabora 
Conceptual schemes are necessary. 


Pts of open systems theory er 
an important approach to thinking about ae 
ip among human units at 55 
ferent levels of analysis (Argyris, 1960; Katz i 
Kahn, 1966; Rice, 1969, Schein, 1965). Al 
daries that Serve to define the!" 
differentiate them from other 
al system. The concept of a 
veloped because n 
ystem is completely snes 
ms outside its boundary. é 

d subsystems exchang 
matter, energy, a, 


m 2 e i 
3 y, information with the 
environments (Emery i 


I Y & Trist, 1965). t 

© existence of a Management recruitme? 

and development rogram may be viewed ase 
Organization Coping with its needs for survive 
and growth by settin UP a subsystem within 
the organization to attract managers and pre- 
pare them for future Tesponsibilities. Upon 
its Creation, such & Program defines a mew 
group within the Sanization and a new wa) 
for the system to relate to its environment. 
'S trainee STUD also has relationships with 
other groups in the rganization. Initially, & 
trainee becomes a Member of the general male 


th 
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or female employee grou! s, but most often he 
is a temporary member, shortly to become an 
officer of the bank. Until, and sometimes after, 
he ‘becomes a member of management, the 
trainee may be viewed as 2 partial or tem- 
porary member of the officer corps (Glickman, 
Hahn, Fleishman, & Baxter, 1968). 

Exclusively individual level analyses of 
management development would not identify 
these various group poundaries that a trainee 
crosses during his preparation for management 
responsibilities. The function 0 
identifying, organizing, and protecting systems 
and subsystems means i 
cannot be taken lightly. Tt nearly always sig- 
nals potential threat (Rice, 1969). One would, 
therefore, hypothesize that the reactions of 
organization members to & management de- 
velopment program wou i 
only of individual level variables 
group level variables. 

The objectives of this stud 
test the foregoing hypothesis, 
ways to enrich and elaborate the general hy- 
pothesis, an to follow UP a study by 
Argyris (1954) in the same bank. 

Fifteen years earlier i 
a tendency for 
develop. and ret 


slightly nervous, Very 
g <e orders, and relatively unwilling 
to give orders. This person tended to have 
+ security; stability, predict- 


ability, isolation, and the avoidance © aggres- 
in others. Since Argy™s 
] that 


Operation of the Program 


The actual administration of the program 
was learned about gradually during the study- 
The procedures were common practice, but 
they were not common knowledge except to 
the few people who had direct responsibility 
for administering the program. There was, for 
example, nO written document outlining the 
procedures. Tnitial screening of applicants for 
the program, often through interviews On 
college campuses, was done primarily by two 
of the first members of the training program 
who were in their early 30s. After initial screen- 
ing, some candidates were invited to visit the 
bank for further interviews. At the main 
office, they faced a series of interviews that 
always included the President, the Head o 
Personnel, the Executive Vice-President, and 
several Senior Vice-Presidents. After the day 
of interviews, the interviewers discussed their i 
impressions of the candidate, sometimes tO- j 
gether and sometimes by being polled by one — 
of the program administrators. Usually the | 
decision of whether an employment offer was 
to be made was communicated by the man who 
had done the initial screening of the applicant. 

Tf a person received and accepted an offer, he 
began 2 33-wk. program containing severa 
components. First, he was to spend some time 
‘or departments of the bank, 


in most of the ma} 
their respective operations. Each 


learning 
department designed its own component of the 
program. During the 33-wk. period, each 


trainee Was assigned & “big prother,” who was 
a graduate of the training program. The big 

3g role was tO aid the smooth function- 
ing of the junior man. He helped schedule 
and when the trainee would be; he 
shared whatever information he had about the 
ctive departments; and he saw key people 
in the various departments to obtain an 
valuation of his trainee’s performance in the 
departments: Tt occasionally happened that 
terminated at the end of the 
training period if their evaluations wer" 
uniformly poor Otherwise, & trainee woul 
then be assigned to @ full-time position. 
usually took at least 18 mo. until a trainee Wi 
made a membet of management after he hi 
completed the orientation phase. 


trainees were 


304 Crayton P. 


METHOD 
Procedure 


Five different approaches to data collection yas 
used to provide information about the eee 
development program: (a) individual ae D 
signed to cover a wide range of work attitu i ) 
a fixed alternative questionnaire covering a number Ke 
satisfaction issues, (c) group interviews using the 
individual interview questions that were appropriate to 
ask in a group setting, (d) a fixed alternative ques- 
tionnaire asking specifically about the management 
training program, and (e) group feedback sessions in 
which the results of the preceding data were pre- 
sented and discussed with interested members of the 
organization. y A 

The individual interviews, general 

ionnaires, and grou interviews wer 
Reap in the ai ie of the Study, Individuals 


people. The groups were composed so that 
and employees would not be in the same s; 
than this Constraint, assignment to intervi 


happened in the interview after it w 
interviews Provide two Opportunities 
found in individual interviews. First, 
Setting in which the participants can stj 


ual and group interviews, questions 
does one i 


understand jf e is to capture the essence of this 
organization?” led 

ment development 
to write in comments on th 
that they would like to ha: 


tl ve an Opportunity to answer 
questions about t 


he program, 
‘actions a number of addi- 
tional methodological tacti 
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learned from the organizational study. After all a 
data had been collected and analyzed, two panera 
Prepared for the organization, One focused p ike 
on the management development program, me con 
other covered the organization as a whole. Af ter the res, 
management group had reviewed these pores i 
gether with the investigator, they decided to me aa 
reports available to any officers who wished 2 aa 
them, provided that these people would then A the 
some group discussion meetings after they had rea hid 
reports. These groups consisted of 6-8 people ded 
shared common organization positions. No one atten is 
a session in which his immediate supervisor W Fs 
Present. These meetings provided an bs sia a 
enlist the members of the organization in explain 
the various findings uncovered by the study. 


Response Rate 


At the time of the study, the bank employed 120 
proximately 700 people of whom approximately 
were officers (the title given t 


2 of 
made a member of the management team). ape tbe 
the employees were women, while all but five o! 
Officers were men, T 


trainee, 7 officers, 27 male me 
ale employees returned the que 
Swering the attitude items. 


Resutts 
Factor Analysis of Altitudes 


The 24 attitude Statements about the pro- 
gram were factor analyzed by the principal 
components method. All 


y about the program, four 
ere included in the ques” 
a © Person’s perception of 
1S career promise and his Pay satisfaction. 
analyzed along with 
e program, 

actor analysis and rota- 
nterpretable factors about 
Wo additional factors about 
pay Satisfaction an 


mm 
l career prospects as show! 
in Table 1. All of the items on the questionnaire 
resulted in loading. 


8S of .40 or higher on a factor. 
might be characterized as 


A 


- 
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TABLE 1 


PRINCIPAL COMPONENTS FACTOR ANALYSIS AND VARIMAX ROTATION OF STATEMENTS 
FROM MANAGEMENT DEVELOPMENT QUESTIONNAIRE 


Statement 


1, The Management Development Program is making an im- 

2T portant contribution to the management of the bank. 

» The bank should give higher pay and responsibility to 
trainees if necessary to keep them from being drawn away 
by more attractive positions with other organizations. 

The Management Development Program has made the 
bank a more attractive place for me to work. 

. On the whole, the trainees are a bright, competent group. 

. The Management Development Program does not provide 


effective training. _ 
. Trainces are being given positions of responsibility 
before they have enough experience to be able 


„ „to handle them effectively. 
7. The Management Development Program has 


outlived its usefulness. 
MW Trainees tend to have less loyality to the bank than others. 
. Trainees are given undeserved preferential treatment. 
. Once a person is selected for the Management 
Development Program, he is almost guarantee 
to move up the ladder rapidly in the bank. 
. In the future, it will be increasingly difficult for 
nontrainees to reach positions of senior 
_ responsibility in the bank. s 
Trainees are more willing to use the bank for their 
own personal gain than other organization members. 
. Some trainees have a great deal of influence with senior 
management—more than their positions justly. 
. There isa tendency for trainees to have closer 
relationships with each other than with others 


in the bank. X 
. Trainees tend to associate mainly with other trainees. 
` The standards for evaluation by the bank are 

higher for trainees than for others. 

. The Management Development Program has 
made it harder for me personally to progress 
in my career with the bank. 

. There should be a clearly specified way for current 
members of the organization to be considered 
for the Management Development Krogan: 

. Cu ly there is no opportunity for pres pi 

3 peren to be considered for the Management 


Development Program. 
ai graduates of Ivy League colleges. 


a trainees are 
21i. ees has graduated from an Ivy League college, 
he doesn’t stand much of a chance of being 
considered for the Management Development Program. 
. Trainees are not given positions of responsibility 
soon enough in their careers. i 
The Management Development Program has directly 
or indirectly made it easier for me personally to 
progress in my career with the bank. 
24, Sometimes trainees are not p 
they might be in order not to | k 
. Ifcel that I have a very promising future with the 
" {feel that my own career in the 
Spout reached its peak, -ouj j 
1 feel that my compensation is quite satisfactory. 
Right now I feel underpaid for my work. 


p nè p 


23. 


‘bank. 


we 
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Factor 
1 2 3 4 5 6 A B 
—.78 | -.09 | .10| —.08 | —.16 | —.01 | 04} —.09 
—6 | -14| 00| 07] -01| -33| —07] 13 
—.56 | —.19 | —.02 | —.16 | —.25| -33| — - 
23| 107| 17| —14 | —.27| -00 “03 | 30 
s5| .26| —13} as] .o7} 01] 15] .18 
53| 44] —.22 | —02 | .o8| -13| .32] 11 
71| .28| —02| 19| .05| .09j—.01| -06 
29| 73| 13| <05] 112] —03| -18| -14 
sol (57| =i] 7| 5] —16] 34] 13 
a| 4] 42] 38] afar 28] as 
os| #9 | —21| as| 02| —10] 17] 10 
as| .68| 18] 19] 13] —.08| —10) 16 
asl 2] —.21| 02] .16] 13] 24] 18 
2} 32| —79| .05| .10] 04] ~.06 | —,05 
32] 31] —.80] i04] <04] —.03 | —.02 08 
-16| 24| 44] -05| —.13] 23| 27| 12 
32| 3] -09} 7| -07| 10) 05] 45 
os| as| —41] 54] —0s| -06| 11] 01 
a| os| a2] 72| -27| -.06/ 
"2 s| -36} <02] 67] D $6 06 
a| 30} or} 24| 68] -05| s| 16 
-14| -o| 24] 13] -.03] 63 | —.15 | —19 
—.42| —.06} .00| -02| -s| -58| —.26) a3 
.10 | —.02 2| st] 14| 01 
= e) 10 | —.02 | —:87 | —:19 
o) aof ao] 01} 00) —13] 85) -1t 
a| -17| os| 03| —:06] 14| —02| —.85 
931 Tiia] oa] ia| 08) <02) 04) 85 


Note. —Each statement was answered on 


rals indicate which items were summed to, 
piste which dealt indirectly with ‘attitudes toward the program. 


an overall evaluation of the pro- 
from this factor dealt with the 
f the program to the 
d with the training 


dealing with 
gram. Items 
overall contribution 0: 
pank’s management an 
effectiveness of the program. The second factor 
had to do with the degree to which the trainees 
received undeserved preferential treatment and 
misused their privileges. There was a close 
association between seeing the trainees as 


receiving undeserved preferential treatment 


a 6-point scale in which 1 = strongly agree an 
form attitude scales. The last four items 


d 6 = strongly disagree, The italicized 
contain the pay and future prospects 


and seeing them as abusing their status. A 
person who saw the trainees as receiving special 
privileges also saw them as misusing their 
influence and vice versa. The third factor con- 
cerned the degree to which the trainees were 
perceived as a closed cliquish group. Also 
associated with this factor was an item dealing 
with the degree to which the trainees were seer 
as having to measure up to higher standards 0 
evaluation than others in the bank. The fourt 
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TABLE 2 


CORRELATIONS BETWEEN INDIVIDUAL LEVEL VARI- 
ABLES AND REACTIONS TO THE PROGRAM 


Individual satisfactions and roles 


Reactions to program 


is-| Pay "3 s 
Pree | sane | Sek | Rank 
career | faction 
Evaluation 20%% iS aa au 
Preferential treatment po . ay Oe 
Closed group Gon ait 2 
Career barrier = o5 a 
Ivy League graduates -0 i a 
Accelerate trainees al y 


Vote.—N = 263. 
te .05 (two-tailed test). 
++p <.01 (two-tailed test). 


factor concerned the degree to which the pro- 
gram was perceived to be a barrier to one’s 
own career development. The fifth factor con- 
cerned the degree to which trainees were seen 
as exclusively graduates of Ivy League col- 
leges. The sixth factor dealt with the degree to 
which the trainees were not moving fast enough 
in the organization and were barriers to each 
other in the organization. The items concerned 
with career promise and pay satisfaction loaded 
on distinctly separate factors, which were 
labeled A and B in order to underline the dif- 
ferent level of conceptualization. 


Individual Level Variables and Reactions to the 
Program 


Two types of individual level variables were 
correlated with the measures of reactions to 
the program in order to test the hypothesis that 
there was a relationship between an in- 
dividual’s state in the system and his reactions 
to the management development program. 
One kind of variable was based on self- 
perceptions and included a person’s assessment 
of his career promise and his pay satisfaction.’ 
The other kind of individual variables were 
based on a person’s actual position in the sys- 
tem and included his seniority and organiza- 
tional rank. Table 2 contains the relevant 
correlations. 

There were significant correlations between 
perceiving one’s self as having a positive future 
and positively evaluating the program, tending 


2 Based on the factor-analytic results, summed scales 
were developed for each of the scales by giving unit 
Weights to the items defining a particular factor. 
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not to see the trainees as receiving preferential 
treatment, tending not to see the program a 
barrier, tending not to see the tramees be 
predominantly Ivy League, and tending to ‘ 

in favor of more rapid mobility of the me 
There were also significant correlations © 3 
tween pay satisfaction and positively Aa 
ing the program, tending not to see the — an 
as receiving preferential treatment, tending a 
to see the program as a barrier, and anaes A 
to see the trainees as predominantly 43 

League graduates. P 

Seniority with the bank and organizations 

rank were also significantly related to some = 
the attitudes about the program. een 
service organization members were more wer) 
to evaluate the program negatively, see ba 
trainees as overly privileged, and sce 2a 
trainees as a clique than newer members of es 

organization. Higher ranking members of t s 
organization tended to see the trainees as mee 
of a clique and as less of a barrier to their own 


F he 
careers than lower ranking members of t 
system. 


Group Level Variables and Reactions t0 the 
Program 


Group level analyses of reactions to the Pt” 
gram were performed in two ways. The mean 
Scores on each of the attitude scales were come 
puted and compared among the officers, 
trainees, male employees, and female em 
ployees. A multiple discriminant analysis W35 
also performed using these same data. 

Table 3 contains the mean values on the 


TABLE 3 


Mean Factor SCORES ACCORDING TO 
EMPLOYMENT GROUPS 


Group* 
X factor scores le 
emal 
Officers | Trainees | Male rnp T 
s s veesjemploy 
(= 80) | u = 22) |employecs enn 200) 
Evaluation = 
Preferential a aa sates 
treatment 14.10 5,59 14.08 
Gareer baner | S10 | zis | 308 
Ivy League na gas pies 
graduates 7.95 5.55 7.82 
‘79 5.16 59 
2.62 3.42 2.44 
2.49 2.40 1.36 
— i 
* Overall significance of i = 001 for Ai 
mean factor scores mean difference was p = 2008 with 
df = 3,359, Sores but pay satisfaction (ns) by F te 


f 


D 


= 


in the way they evaluated the program. More importantly, 


ATTITUDES TOWARD A MANAGEMENT DEVELOPMENT PROGRAM 307 


attitude scales for each of the groups. There measure of empirical validity for the various 
was a significant difference among the groups factors that were structured by factor analysis. 
however, the group dif- 


Trainees themselves gave the program the ferences point up the fact that there were 


highest evaluation, officers were next, female rather strong group level phenomena that, in 


employees next, and male employees gave the part, determined an organization member’s 
program the lowest evaluation. There was aà reaction to the program. In some sense the 
t marked between 


significant difference among the groups in the group differences were MOS 
degree to which they perceived the trainees as the trainees and everyone else. Just being & 
receiving preferred treatment. The officers and participant in the program Jed one to evaluate 
the male employees were approximately the program more positively, see the members 
equally high in seeing the trainees aS having as less privileged, see the program as less of a 
and misusing 2 preferred place in the system; barrier for others, see fewer of the trainees as 
the female employees were slightly lower, and Ivy Leaguers, and make more demands for 
the trainees themselves were substantially rapid mobility of the trainees. The other 
lower. There was & significant difference among groups, however, had their own unique com- 
pination of attitudes about the program. 


the groups in the degree to which the trainees 
losed group. The officers Perhaps the officers had the strongest mixed 


were perceived as a € 
were highest on this perception, the male em- feelings about the program, for, next to the 
he female trainees, they gave the program the highest 


ployees next, the trainees next, and t 
employees were lowest. There was à significant evaluation. At the same time, however, they 
difference among the groups in the degree to held the strongest view that the trainees formed 


which they held this attitude. Male em- @ clique, and they rather strongly felt that the 
ployees were most firmly of this opinion, an trainees received undeserved preferential treat- 
female employees were only slightly lessof the ment. The female employees had a different 
same view. The officers and the trainees combination of attitudes, @ predominant 
themselves were substantially lower. There theme of which was undoubtedly romantic. 
was a significant difference among the groups T heir evaluation of the program was almost 
in the degree to which they saw the trainees as high as that of the officers, and they saw the 

vy League colleges. trainees as less of a closed group than the 


as primarily graduates of Ivy l 
Female employees, officers, and male employees trainees themselves did. At the same ume 


were strongly of this opinion while the trainees however, they saw the | 
were less convinced. There was also a sig- their status and as blocking women’s careers. 
nificant difference among the groups intermsof For male employees, the emerging picture of 
the degree of perceived need to accelerate the the program was almost entirely dismal. They 
mobility of trainees as & function of employ- gave the program & low evaluation, saw the 
ment group. While there was an overall sig- trainees as abusing their preferred status, and 

barrier to their own 


nificant difference among the groups, this dif- saw the program as & { 
i result of the trainees careers. Tn a very real sense the program pitted 


ference Was primarily a . r 
who showed a substantially higher demand for the trainee against the male employee and the 
id advancement than any of the other attitude results reflected this fact. 
groups, who did not appear to differ sub- To further explore the role of group effects 
stantially from each other. in determining reactions to the program, & 
multiple discriminant analysis ‘was performe: 


There was 2 strong significant relationship u l; e 
between the perception of having & promising using scores from the eight scales to dif- 
oup. The trainees ferentiate among the four groups: Multiple 

tatistical tool that 


future and employment Stony < ea as 
viewed their futures most optimistically, the discriminant analysis 1s a § 
the determines how & set of variables should be 
ate maximally among @ 


officers and male employees next, and 

female employees least favorably- There were weighted to discrimina 

no significant differences among the groups set of a priori groups. A discriminant function 

in their pay satisfaction. is a set of weights that defines a vector. Tn the 
S, normalized vectors are re- 


All of these group differences provide @ present analysi 
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TABLE 4 


MULTIPLE DISCRIMINANT ANALYSIS ar THE 
ÅTTITUDE DIMENSIONS FOR THE FOUR 
EMPLOYMENT GROUPS 


Discriminant vectors 
Attitude scales — 
1 2 3 
=, 6 
Evaluation -08 A | cs 
Preferential treatment | —.09 = H 
Closed group = ye E 
Career barrier — 33 =. |=. ? 
Ivy League graduate =~ 15 13 32 
Accelerate trainees 71 — 49 AS 
Promising future A2 82 ~ 67 
Pay satisfaction =22 24 15 
Ratio of between to > a 
3 within variance Ti | art 21* 
= 01, 
=b = 001. 


ported. These vectors are Standardized; they 
do not reflect differences in variation in the 
variables used and, therefore, are an 
beta weights in multiple regression, 
Table 4 presents the norm 
differentiating among the 
The first vector is characterized by substantial 
positive weights on accelerating the progress 
of trainees by people who saw themselves as 
having a promising future with the bank. The 
second vector is characterized by large negative 
weights on the Program as a career barrier and 
on accelerating the Progress of trainees. The 
third vector has Substantial negative weights 


alogous to 


alized vectors for 
employment groups. 


Promising future with the bank. 

he means of the three functions for each of 
the employment Sroups are shown in Table 5. 
Vector 1 serves Primarily to discriminate the 
trainees from the other three groups and 
especially from the female employees. Trainees 
would be readily characterized as wanting to 
See themselves move more rapidly up the 
hierarchy and having a Promising future, 
while the female employees would have 
the opposite views. The second vector 
Primarily to emphasize 
the officers and the tr: 


seems 
the differences between 


officers? careers, providi 
are not accelerated 
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Vector 3 serves mainly to discriminate ie 
male employees from the other groups. ue 
vector emphasized the reactions of those Me 
saw the program as a barrier to their o i 
careers and yet saw themselves as having 
promising future with the organization. k 
The multivariate analysis called arten ian 
to the role of group level effects in a way t a 
univariate analysis alone did not. Al inoue 
global evaluation scale showed substantial ae 
ferences among the four groups when 
analysis was univariate, it had consistently = 
weights on all of the discriminant inenen 
Perhaps this is a way of showing that what va 
at stake for the groups was not whether tl 


arting it 
program was good for the bank. Starting It , 


a, i u 
positively, one might make a case for gon 
self-interest as well as individual self-inter 


Combined Impact of Group and Individual 
Factors 
In addition to considering the sepata 

effects of individual and group effects a 

Person’s reaction to the program, there vid 

also the possibility of finding that there wou 

be an interaction of individual and group 
effects. In other words, a person with scoot 

individual qualities might react differently t 

the program if he was a member of one me 

than if he was a member of another group: r 

perception of a 

promise, at the individual level, and for t 

officer, trainee, and male employee groups there 

were statistically significant interactions oP; 

e 6). Trainees who perceive 

ve promising careers were Jess 

Program as a closed group, g 

bers associated mainly with 

an were either the officers or the 


TABLE 5 


Exproystenr Grours 1N 


THE Discrnnnany Space = 
—— == ne 
Discriminant axes = 
Groups oe —— 
i u $ 
a ee a te 
Officers 3.81 — 9.02 1 -24 
Trainees ~2.12 —5.64 1.55 
Male employe 4.72 —7.32 aa 
Female empl 5.82 | —6.20 dah 


t 
$ 


ge 
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male employees who perceived themselves 
to have a promising career. Trainees who 
perceived themselves to have a promising 
career were less likely to see the program as a 
career barrier or the trainees as predominantly 
Ivy Leaguers than were the male employees. 
‘These interactions underline in yet another 
way the degree to which the program was a 
psychologically different thing for the aspiring 
male employee who perceived that he was suc- 
cessful than for the trainee who perceived that 
he had been successful. 


Discussion 


There can be little doubt from these results 
that reactions to the management develop- 
ment program were a function of not only 
individual but also of group level variables. 
Individual variables such as perceived career 
promise, pay satisfaction, seniority, and 
organizational rank were related to two or more 


different reactions to the program. A person’s 


reaction to the program also turned in a large 
ent group he was 


measure on which employm 
a member of. The various employment groups 
turned out to have their own unique patterns 
of attitude clusters about the program, which 
in part also reflected the interaction of group 
and individual level variables. 

The emphasis on intergroup dynamics could 
dow the existence of intragroup conflict 
pers of the program. But the sixth 
factor among the attitude scales drew atten- 
tion to this phenomena in an unmistakeable 


way. Only trainees were concerned that they 


were not moving fast enough in the organiza- 


tion. For those s0 concerned, it was often other 
trainees who formed their career barriers. This 
point received an interesting independent 


replication jn a report on the program pre- 
dministrators 


pared by one of the program ac 4 
prior to this study and made available to the 


present author after ort had been 


his own report <, 
prepared. In it the program administrator 
concluded: 


oversha' 
among mem 


It is my opinion that our excessive turnover witht 
the middle group was caused - ++ by a frustration. ° 
opportunity within the minds of the trainees compris! g 
the middle section- The surface reasons for leaving W ere 
varied. But beneath the surface; whether realized Or 
unrealized, the middle group of trai 
the older grouP of trainees as direct compe 


similar positions. By virtue of their presences the older 


TABLE 6 


SIGNIFICANTLY DIFFERENT CORRELATIONS BETWEEN 
PERCEPTION OF A PROMISING FUTURE AND 
REACTIONS TO THE Procraw FOR DIF- 
FERENT EMPLOYMENT GROUPS 


Group 


Reactions to the 
program 


Officers | Trainees lee 
(n = 80) (n = 22) (n = 62) 


** +k 
Closed group 13 —.59 30 
* 
poa 
Career barrier —.18 —A8 07 
* 
mA 
Ivy League graduates —.01 —Al 06 


* Difference between correlations significant at P <.05 
(two-tailed test). 

Ye Difference between correlations significant at 2 <.01 
(two-tailed test). 


group, as long as they continued to demonstrate their 
ability within a particular area, pre-empted to a degree 


the movement of the middle group towards respon- 
sibilities and positions they were secking. 


Behavioral Explanations for the A ttitude Effects 


By virtue of being members of the same 
organization, the trainees and the other groups 
had regular opportunities to interact with each 
other both within and between groups. The 
process of gathering data about the program 
and feeding it back to the organization pro- 
vided an opportunity to observe how the 
various organization members behaved toward 
one another and to ask the organization mem- 
bers what they perceived in each other’s be- 
havior. These various behaviors are offered as 
some mediating variables to explain why the 
attitudes about the program came to be what 
they were. There is, of course, no claim of 
exhaustiveness for these behaviors. No ex- 
planatory behavior is reported unless it was 
observed or reported more than once. 

Mechanism 1: The meeting of the “right 
types” and the “new types” result in contrasting 
styles of dealing with aggressiveness without 
having the capacity to discuss openly the con- 
sequences of the different styles. At the time 
when Argyris (1954) studied this organization, 
he reported that the modal employee and 
manager tended to avoid being aggressive him- 
self and to avoid being in situations where 
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aggressiveness would be present. W pg the 
present study was conducted, the oe ks 
was no longer populated as exclusively bis 
nonaggressive types. Most of the a ea 
many of the younger officers and employees 
were much more comfortable with aggression 
in themselves and in others than the old right 
types. But when the two types the oe 
contrasting styles tended to block vali 
communication. : 

Ina group interview, the following exchange 
took place: 


Older “right type”: “Do people find a frustration of 
opportunities because of the training Program? 
(He was interrupted in midsentence by a trainee 
who said, “Absolutely not!) 

Trainee: “I don’t think so! I don’t think so! We have 
always had internally excellent reception of 
trainees, cooperation from people... . They’ve 
accepted them readily, happily. There has been 
no frustration.” 

Right type: “Well, I can’t 
don’t think it’s, . .” 

Right type: “In our particular department . | .” 
(Interrupted by trainee). 


Trainee: “Oh, in your area. I used to get quite a bit 
of flack from some of the lads in the back that were 
not college grads and thou 
as far as the bank was 
limited by trainees, . .”” 


quite agree with that, I 


ght that their potential 
concerned was severely 


This kind of pattern could 


often be observed 
when the contrastin 


g styles encountered one 
about which both had 


» but that would 
l naggressive type more, and he, 
in turn, would withdraw further. 

Mechanism 2: By means of a self-fulfilling 
prophecy the inler personal pallern was almost 
institulionalized, Although the trainees did 
not talk with their antagonists about the prob- 
lems, they did talk with each other. New 
trainees were warned to expect trouble getting 
information in certain areas advised 
to be more aggressive with the 
resistance. 


and were 
1n order to cope y 


Graduate trainee: “I think Possib] 


] ly there may 
few people who are gunning for 2 


f be a 
the trainees 


asa 
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group. There’s a little resentment i pene 
young guys that are coming along eee? i maybe 
going to be officers quickly. . . . So, I ne AH 
it’s a concept they may be gunning a URES 
individuals, it’s going to be up to you whether they 
are gunning or not. . .” ia 
Trainee: “You’ve got to press these people for HE: 
information. You've got to drag it out of them mé 
cause, after all, they know you're going to ae 
an officer . . . a lot quicker than they will e 7 
be... If you go into an area and you sinpli 
keep on pressing these people they’re going tö ied 
you the information as long as you show 
initiative...” , m 
Trainee: “There are people that will give et a 
much as they think you need, but you gotta 
like hell to get any more out of them. : i 
Trainee big brother (telling how he advises ar 
trainee): “Now look, X, this department is eae 
to be a difficult department for you to get i al 
and this is what’s expected of you. You're y re : 
bright guy; you graduated from Yale; I knov 
you're a fighter. You're dynamic and aggressive, 
Show them that you are. Go in there and fight. . « 


One could speculate further that by en- 
couraging the newer trainees to express their 
aggression in the departments the big brothers 
were (intentionally or unintentionally) dis- 
charging some of the intragroup conflict by 
channeling it into intergroup conflict. 

Mechanism 3: Both the most negative and the 
most empathic views of the program tended lo 
be shown without the presence of both parties to 
the conflict. One place where a person was free 
to express his views of the program was on the 
anonymous questionnaire dealing specifically 


with the Program. Some of the most extreme 
views included the following: 


ments, ds, Questions 5, 19 and 24, 
although valid objections perhaps, in some cases are 
not directed at ‘trainees’ but i n 


È als [written by 
a trainee], [written by 


and 24 express my compl isgust 
with the Program. The program is basini : ood 
€ management team, However, it is 
titude anq behavior of the trainees 
ntrainee officer in his mid-30s]. 


nt, however 
at showed that 
Were two sides t 


The same instrume 
reactions th 
that there 


» Produced some 
both sides could see 
© the problem. 


~ equities in terms of non- 
evaluations and opportunities for 
+++. Perhaps ifs wre and complete the pro- 
More were aware of the op- 


| 


j poon = and requirements of the program, and if 
i : e college education requirement was waived if other 
{aptitudes offset it, more would come forward, be 
accepted and boost morale, and ease some ill feeling 
within the bank [written by a trainee]. 

ek being a college graduate and not qualified to enter 
the Training Program and progress more rapidly in the 
bank, my instinct for survival pledges me against the 
program. Deep down inside, I know it means for a 
better bank [written by a male nontrainee in his 
mid-20s]. 

Mechanism 4: Most of the initial recruitment, 
selection, and evaluation of trainees was done 
either by former trainees or by the top most 
members of management, resulting in other mem- 
bers of management feeling as though they had 
almost no influence on the program. In addition, 
there was almost no source of public informa- 
. tion or open discussion about how the program 
operated so that people could check their 
perceptions against some measure of reality. 
As the issue was discussed time and again, it 
became apparent how the operation of the 
preceding mechanisms may have influenced 
the decisions that brought about this structural 


pattern. 


Action I mplementation 
As was men considerable atten- 
tion was given to the results of this study by 
; ank. After the 


members of manag s 
feedback meetings had been completed, a tas 


force to consider changes in the training Pro- 
med. This group was chaired by 
nel and consisted 


ome current 


trainees, SON 
, graduates of the program, a 
officers. One of the first t a 
discovered was that they W 


formation about the program. ‘As a result, the 


ay eat ss 
program administrators prepared an age i 
as: 

tion memor: 


g such facts 
(a) which colleges were 


andum providing s, 
visited for recruiting, 

(b) other methods for making A 

known 


_pank, (c) th 
Ivy League 
who entere 


outside the 
f trainees from 
of trainees 
he bank, (e) the 
number who had been on 
resign from the ) the percents 
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of retention of trainees from Ivy League versus 
other colleges. This memorandum was sent to 
all personnel in the bank, and it was reported 
to this writer that the reaction to it was very 
positive. 

In addition, the task force also made a 
number of recommendations for structural 
changes in the program. First, they recom- 
mended that more people than the Program 
Administrators should be involved in the 
recruiting interviews and that these inter- 
viewers should be assigned by the Personnel 
Department. Second, they recommended that 
the pool of people from whom big brothers 
were assigned to trainees should be broadened 
to include more people than just former 
trainees and that these counselors should be 
assigned by the Personnel Department. Finally, 
they recommended that each trainee should be 
assigned a research paper on some general topic 
in banking (e.g., “Banking by Computer— 
1980”). The opportunity to do a project like 
this, and thereby be counseled by a senior 
officer, should be opened to anyone in the bank. 
At the time of this writing, all of these recom- 
mendations had received the informal approval 
of senior management, and the second one was 
already being implemented. 
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NEED SATISFACTION TO JOB PERFORMANCE? 


JOHN W. SLOCUM, JrR.: 


Department of Management Scien 


ce and Organizational Behavior, 
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During the past decade, 
satisfaction studies of diffe 


atisfac- 
ases with each higher 


ences þe- 
( 1961) and 


» two levels of Management were 
questionnaire designed to tap 
Maslow’s (1943) hierarchical 
The results of this 
Personnel 
hough the 


y f managers, Porter’s 
results Supported his earlier find- 
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needs (esteem, autonomy, and 
self-actualization), Whils:thetower order aee ds 
security and social needs) were almost equa 
Similar results 
reported by the Opinion Research 
and Haire, Ghiselli, and 
herefore, from the evidence 
» it can be stated that job 
at each higher level of 
patterns of need satis- 
across different organiza- 
he Managerial rank. 


have been 


Relationship between Satisfaction and 
Performance 


study reported by Lawler and 
and also jn their book (Porter & 
Lawler, 1968), these researchers posit a theo- 
retical mode] that indicates that performance 
hich may jn turn lead to 
that satisfaction, 
ausing performance, is caused by 
y these researchers 
between two types of rewards. 
: » Such as Pay, promotion, 
and Security are likely to be imperfectly related 
fa Performance, i rewards are often 
Satisfaction of the 
Safety and security dis- 
3). 
` and Porter (1967) Posit the opposite 
Case for Intrinsic rewards, They comment that 
À » for example, the feeling of 
aving accompli he Something worthwhile, 
ated to performance. The 


e 
Y associated With 
‘der needs of 
cussed by Ma 
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model also indice : 
ee pe pat «Spm intrinsic 
related to job sati _Sewares: ate directly 
ship is job satisfaction, since the relation- 
a o by the amount of expected 
wards oe pn Expected equitable re- 
individu i to the amount of rewards that an 
of his job eels he should receive as a result 
eee ae (Porter, 1961; Porter & 
ion fe 3 5 5). Thus, an individual’s satisfac- 
the fre NAERON both of the magnitude and 
ah quency of rewards he receives as well 
T expected equitable rewards. 
Bente predicts a low but positive rela- 
ance “th etween job satisfaction and perform- 
teen nae of the imperfect relationship be- 
wees ormance, rewards, and satisfaction. 
systematically y few studies could be found that 
AE, Heres se i consider satisfaction of needs 
studies rel Gi managerial performance, and 
Egan? et satisfaction of needs at different 
yet b ational levels to performance have not 
A een reported in the literature. 
OF hace by Lawler and Porter (1967) cor- 
pe ae of managers’ needs with 
but sete All correlations were significant 
nail ow, ranging from .16 to 30. The 
Sider seat conclude that satisfaction of higher 
ance den more closely related to perform- 
Bes thev pape of lower order needs, 
betteen a = test for statistical differences 
thelnwe ie ations, The correlations for 
21 and j y er security and social needs were 
AEREE respectively, while the correlations 
and pe me needs of esteem, autonomy, 
respectivel pe ization were .24, .18, and .30, 
telations a . he differences between the cor- 
arian = not appear to be great enough to 
en EN conclusion that higher order 
haa ik ore closely related to performance 
( 1968 ower order needs. Porter and Lawler 
their ) a additional data for support of 
eir model. Higher order needs accounted for 
more significant relationships at the 01 level 


han lower order needs. However, they do not 
eport correlations between these needs and 
s can 


performance. Therefore, no evaluation 

be made regarding the relationship between 

need satisfaction and performance. 

i The purpose of this article is to reexamine 
he earlier findings relating need satisfaction 

to managerial levels and also examine the 

model posited by Porter and Lawler in terms 


TABLE 1 


ha OF PROBABILITIES OF THE TESTS FOR 
‘ONVERGENT AND DISCRIMINANT VALIDITY 


| Peer-superior rating 


Validity aae 
1 | 2 | a4) S76 
Convergent validity OL | 0 5 | .05 
onverge a y -01 | .05 | . 
Discriminant validity | he | malic 
tests h | | 
| i ns | ns | ns | ns | ns 
= .01 | .05 | .01 | .01 | .0: 
Test 3 (Overall)* | | | | | ; | a 
| < 01 


aW = .80, x? = 48.87. 
of performance. This article considers one 
specific prediction : Satisfaction of higher order 
needs is more closely related to top manage- 
ment performance than satisfaction of hi her 
order needs for lower management. s 


METHOD 


Data were collected from 87 first-line supervisors and 
123 top- and middle-level managers employed by a steel 
plant in central Pennsylvania. Two kinds of data were 
collected for each manager by the researcher and his 
assistant. The job performance scale used for this stud 
was a multitrait scale developed by the director of AES 
ing. The traits that the manager was rated on were: (a) 
technical knowledge, (b) functional knowledge, ; (e) 
drive/aggressiveness, (d) reliability, (e) cooperation. 
and (f) organizing ability. For each trait, the EA 
chose one of the five responses—superior, fully compe- 
tent, shows improvement, getting by, and poor. The 
raters used were S’s superior and peers. In the choice of 
the raters, the constraint was that they must have been 


familiar with the aspects of the individual’s perform- 
ance that they were asked to rate. To insure the choice 
S was asked to name three 


of knowledgeable raters, 
‘ormance best. The choice of 


pee 


The performance m 


suggested by Campbell 


below. 
Test 1. The corr 


single trait should be greate: ; 
ting of that tral 


rated by the other rater. Failure 
the high intercorre! 
this is a very strin 
by behavioral trait data. 


tween any one ra 


ment was due to 
to Lawler (1967), 
that is seldom met 


Test 2. The correlation between 


single trait should be greate 
ting of that trait an 


tween any one rai 


rs who knew his perfi 
which peer rated S was do 
easures were tes 
and discriminant validity according to the criteria 


posited by Campbell and Fiske 
interpretation of the tests 


lations among two mea: 
r than the correlations be- 


ne randomly. 


ted for convergent 


(1959). The purpose and 


for discriminant validity as 
and Fish are briefly described 


sures of a 


t and any other trait 
to meet this require- 
Jations. According 
gent requirement 


two ratings of a 


r than the correlation be- 


d any other trait 
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TABLE 2 


AVERAGE NEED SATISFACTION SCORES FOR MANAGERS 
AT THREE DIFFERENT LEVELs 


e First-line | Middle- 
Need category and item managers manesi 
1. Security need, i 1.19 
ecurity in job 2.12 
4 ial needs 
á Fona to help people TA ij 
Pportunity for friendships r . 
. Esteem needs J $: 
2% Feeling of self-esteem 1 34 ie 
Prestige inside the company 1.52 30 
{Prestige outside the company 1.44 
s tonomy needs z 
paa Pportunity for independent ? 
thought and action 1.39 119 
Authority in position ve. 1.40 268 
Phortunity to Participate in = 
goal-setting 1.53 1.17 
5. Self-actualization needs 
Pportunity for growth and a 
evelopment 1.35 97 
Feeling of self-fulfillment 211 62** 
eeling of accomplishment 1.38 41 


*p = 05 (two-tailed l test), 
™ > = 01 (two-tailed 1 test), 


aS supported by the 


e coefficients in 
e monorater-multitrait triangles at the .05 level, 


agreed on the Patterns, 

The data for both groups of employees? need satisfac. 
tion was obtained by means of the Porter (1961) ques- 
tionnaire. The questionnaire Contained 12 items that 
Were predicated Upon Maslow’s the 


ory of Motivation, 
Ss were Provided with the following form: 


Connected wit 
b. How much of the characterist; i 
nected with your job? 


Each rating will be ona Seven-point Scale 
look like this: 
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(minimum) 1 2 34567 (maximum) 


r! at repre- 
You are to circle the number on the scale that B 
sents the amount of the characteristic beine Tin 
Low numbers represent low or minimum ar 


- de- 
the amount of priedai a 
z 7 i 
ficiency of each item. Thus, the larger the eater 
between “should be” and “is now,” the grea 
dissatisfaction, 


r a classi- 
e items, Presented in random order, were pre 
fied into the five ty; 


(1961): security, 


ResuLTs AND Discussion 


The data collected from the sample spo 
the previous work done by Porter ( 1962, 1 le 2 
and Rosen (19614, 1961b). The data in Tab 
indicate that satisfaction 


‘creased 
generally increase 
each level of management, 


a 
d 
ont ake 
5 k 5 *tualiza- 
first-line Supervisors’ self-actualizé 


TABLE 3 


CORRELATIONS BETWEEN PERFORMANCE AND SATIS 
FACTION ON ITEM Scores IN Eacn NEED 
Catecory FOR MANAGERS Ar THREE 
DIFFERENT LEVELS 


Middle-, 
First-line manager 
Fi TEAN r managers’ OP as 
Need category ang item DEn perform 
ance ance 
fre 
13 Ne 
Pporty; ity to help People 16 ee 
PPortunity to develop 8 
riendships 18 j 
3. Esteem needs 0 
eeling Of self-es eem A7. 35 
Prestige inside the company AS 39 
Prestige Outside the company 19 h 
4, Autonomy needs 
PPortunity for inde, end 
thought and action Pendent 22 3 
Authority in Position 15 F 
Pportunity © Participate į S s 
goal-setting Pain «20 í 
5. Self-actua ization Needs 
PPortunity OF gri 
devel lopment” 8Wth ang 3 
Keeling of Self-fulfillment +22 
teling of accomplishment +18 = 


Was 


ificant 
. Tequired to be significant 
men sample, ¥ correlation of ‘p manager 
ARE (E castor thee 


-< -05) for 
quired to 
Sample, adie? 

. 5 i ine vs. mi 

b= 05 (two-side, à . ee 
top managers), ded test between rs for first 


„a 
aanas 
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tion category, the need satisfaction profiles 
are similar. For the security and social needs, 
satisfaction was roughly equal for both levels 
of management. The higher levels of manage- 
ment, on the whole, reported greater degrees of 
need satisfaction in their jobs than did lower- 
level managers. In no instances did higher 
levels of management express greater dissatis- 
faction than did the lower level. It should be 
indicated that for the lower order need of 
security, first-line and middle-top managers 
were not getting less fulfillment, but that there 
was a larger difference between their expecta- 
tions and their fulfillment than was found at 
the higher order need level. 

The Lawler and Porter (1967) model posits 
that an individual’s degree of higher order need 
satisfaction is related to his performance. The 
current research supports this general predic- 
tion. As predicted, the higher order needs of 
autonomy and gelf-actualization were more 
closely associated (p < .05) with performance 
than the lower order needs for poth first-line 
supervisors and middle-top level managers. 
This finding did not carry over for the esteem 
need. The data in Table 3 indicate these 
relationships. 

The highest average correlation was found 
between performance and self-actualization 
for both groups of managers. In testing for 
significant differences between the correlations 
in Table 3, only three differences were statisti- 
cally significant at the .05 level. The correla- 
tions for the middle-top level managers 
autonomy needs were statistically different in 
two need items, and one self-actualization need 
item (feeling of accomplishment) was statisti- 
cally different between the two groups in that 
need category. Thus, the prediction that satis- 
faction of higher order needs is more closely 
related to middle-top managerial performance 
than for lower managerial performance is 
supported by these data. Although only three 

ag (see Table 3) were significant, dif- 
„oms in the esteem, autonomy, 


lization categories were m the 


feren 
and self-actua 
predicted direction. 

The data collected from the sample also 
indicate that job satisfaction (the sum of the 
difference score ON all 12 items) correlated 
significantly with performance for both groups 
of managers- Although the correlations were 


not large (r = .26, p < 01, for first-line super- 
vision; 7” = a prs 001 for middle-top 
managers) it is substantially different from 
that originally reported by Lawler and Porter 
(1967; r= 3h Ps 01) in their study of 
middle and lower-middle managers and that 
reported by Vroom (1964). 

Overall, the data from’ the present study are 
in general agreement with the predictions 
forwarded by Lawler and Porter’s (1967) model 
concerning the relationship between higher 
order need satisfaction and performance. 
Similarly, the relationship between need satis- 
faction and managerial level has been sup- 
ported. Patterns of need satisfaction were 
similar across organizational levels, and the 
higher order needs were more satisfied for the 
middle-top level managers than for the first- 
line supervisors. While statistical differences 
between need satisfaction and job performance 
were found in only three need items, these 
differences were found in the higher order 
needs. This indicates that for these needs, 
middle-top level managers’ higher order needs 
are more Closely related to their job perform- 
ance than the satisfaction of higher order 
needs for first-line supervisors. 

The data also indicate that satisfaction of 
lower order needs are not correlated with 
performance for first-line supervisors. While 
statistically significant correlations were noted 
for middle-top Jevel managers in each need 
category except for the security needs, the 
satisfaction of higher order needs are more 
closely associated with performance (p < 05) 
than the satisfaction of lower-order needs. 
Since there has been relatively little evidence 
concerning Maslow’s (1943) need theory and 
managerial performance (Hunt & Hill, 1969), 
future studies using the same measures should 
be undertaken to examine the relationships 
between organizational level and performance. 
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A STUDY Or ENTREPRENEURIAL MOTIVATION: 


COMPARISON oF FAST- AND sLOW-PROGRESSING SMALL-SCALE 
INDUSTRIAL ENTREPRENEURS IN PUNJAB, INDIA ` 


P. S. HUNDAL* 


Punjab University: Chandigarh, India 


The purpose of this study was to determine the level of achievement motivation 
1 and other motivational factors associated with the differential rate of industrial 
growth of the fast- and slow-progressing entrepreneurs. One hundred and eighty- 
four small scale industrial entrepreneurs were administered tests of achievement 
motivation, attitude toward money and attitude toward labor unions, and inter- 
viewed to assess job satisfaction, job prestige, value patterns, aspirations, a0 
irations, achievement motivation, and investing 
tendency are associated with a fast rate of industrial growth, whereas, hoarding 
i ial 


tendency and optimistic tendency seem to be related to the slow rate of industri 


growth. 


‘The economic survey of small-scale industrial inner concern for achievement, 2 disposition to 
| entrepreneurs of Punjab, India (Rangnekar, engage in activities in which doing well in com- 
1966), located two groups of sharply contrast- petetion with others is important. 
ing small-scale industrial entrepreneurs in Studies by McClelland (1961), Bradburn 
Ludhiana, India, @ renowned center of small- (1960), Bradburn and Berlew (1961), Cortes 
scale industries in the country- One of these (1961), and many other investigators have pre- 
groups was engag i 5 sented a wide range of evidence indicating © 
(metal group) and the other in the hosiery and relationship between level of achievement moti- 
textile industries (hosiery group)- The exp? vation and national economic development. 
nential growth rate for th groups, Over a The relationship, however, Was of low magni- 
period of 10 yr» 195 —1961, was 32% and 10%, tude. Nations were used as units of analysis in 
respectively- Rangneka o reported that the these studies. While this type of data 1s of great 
\ difference a orowt he two ce, evidence in which the individual 
an ; 
AD E veconomiC fac t study is 2 step in this direction. 


that there might be some social and psy’ Tts main purpose 15 to determine the level of 
id growth achievement motivation 1n & group of fast- and 


logical variables influencing the rap! y i ; 
of the vntreprencurs engaged in metal-using clow-progressing small-scale industrial entre- 
industries i preneurs of Punjab, India, and also to find 

e i , development other motivational factors that characterize 


e 
d, been stressed the two contrasting categories of entrepreneurs: 
> 


Hagen, 1961; “Rosen METHOD 


\ much earlier (e-8 gi 
1961; W 1930). More recently McClel-  cupjects 
q (1961 1963) $ ht to show that the es- : 
A abe dient © trept eneursh' p js a One hundred and cighty-four small-scale industrial 
sential meree hievemen motivation entrepreneurs comprised the samp e. They were drawn 
social value alled acme from the industrial population of Ludhiana, India 


: ta nn 
nition, to at + wi vere chosen, the textile indust Jusive of hosie 
A Je also € uates it with Y osen, ustries (inclusive © osiery 

onal 8 mplishment- F a accounting for 1,390 and the metal-using industry 8 
— = akon i pn ir S71. The two categori of all 
“hi j vas SUD ported by the Asia F oundation, counting l o categories cover fo O! | 
T a pot we PE units. Once this population was fixed, 200 sample units, 
ew Oe esl reprints should be sent to P- S: Hundal, 100 from each category, were taken at random for ae 


Punjab University, study. Due to one oF the other reason, however, OMY 


«x-chol cain 
psychology: 184 units could be included in the study. 


Department A 
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TABLE 1 
we wee 
Tres 1; “In THE Next Five Years I AM GOING TO 


Point 


Typical response 
score 


7 | Manufacture top-class goods; establish a big 

factory, a big mill, or big business; have more 

than 1,000 workers in my factory; extend my 

work by having a maximum number of 

workers. p x 

6 | Extend my factory or business; improve the 

quality of goods manufactured in my factory; 

increase the ouput; get some new machinery ; 

earn more by making my business flourish. 

5 | Be a good business man (good in dealings) ; be 
a municipal commissioner; be a rich man 

(field of activity not mentioned). 

4 | Do my present business; have a better life; ex- 
tend my business if government will give a 

loan (element of doubt present). 

3 | Increase production if circumstances will per- 
mit; join some other friend in his business 

(somewhat like an inactive partner), 

2 | Be in the same position; earn enough to meet 
my daily needs. 

1 | Leave this business, 


The age of the sam een 
25 and 45 yr. with a mean and standard deviation of 
39 yr. and 10 YT., respectively. With a few exceptions, 


almost all of them belonged to the middle strata of the 


-nine percent 
jab and about 
East Punjab, 
back of the 
of the Ss were agriculturists, 


Test Variables 


Achievement motivation test. The “ 
vation” (AM) was Measur 
Rogers & Neill’s (1 


In selecting and weighing responses for t 
categories, McClelland’s (1961) descri 
with high n Ach was kept in view, 
responses showin. 
business and industry 


stry or business, 
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sented \ 
The actual scoring examples for Item 1 are presented \ 
in Table 1. 7 nea- | 
E- test-retest reliability coefficient of he ae 
sures with a time gap of about 3 mo. for 35 Ss, 
from the sample, was .84. idity coeficients, 
The item homogeneity or item validity determining 
following Adams (1960), were obtained by nd the total 
Phi coefficients between the single item ar computed 
score on the test. The phi coefficients were ception o 
for 27% high and 27% low scores. W Et a nt at the | 
Items 3 and 9, all the phi values were signi ae the test i 
-O1 or .05 level. It was inferred, therefore, aa igh an 
items differentiated well between persons with ot Items 
low degrees of AM. The discriminating are from 
3 and9 was rather low, but they were not exclu 
t ontence 
gone the scores obtained for 60 Ss by ane 
completion test of AM were compared with reClelland’s 
Scores obtained by a modified version BE ees rank 
Thematic Apperception Test (1961 TAT)” og was aS 
order correlation between the two sets of Score much in 
high as .61. The fact suggests that there ain es two 
common between the measurements made by 
tests. , 30 in- 
Attitude toward money. The test consisted ty S 
complete sentences with money as a key wort ‘vay tbe 
Were required to complete in a meaningful q EA the 
Sentence as quickly as possible. The responses invest 
test were scored in terms of (a) tendency ie (Hoa 
money (Inv T); (b) tendency to hoard moa (Soc 
T); (6) tendency to spend money on social i rel and 
W); (d) tendency to spend money on perso! z inde- 
family comforts (P Comf); (e) responses showing a e 
cisiveness and lack of confidence, conflicting > i 
(Conft A); (f) responses showing faith in e gol 
mism (Opt T). In all cases the responses were Ca a 
zed at their face value and no attempt was mí 
infer any underlying unconscious trends. vere tale 
The Tesponses for the total test of 30 items We ach 
lied under the above categories of responses for €? 


t 

Id n° 

Separately. A smal] number of responses that CON ne i 
be grouped under any of these categories were 18", 

The total numbe: 


ua 
«a divid 

zi r of responses made by an ind 

1n a particular 

variable. 


at 
tha! 
2 r r 
category comprised his score fo 
ks 
ank5 
As a check on the scoring procedure, 30 test bl % 
Were rescored by an independent judge. There was 


the 
agreement on the items scored by the writer and 
independent rater, i 


The consisten 


Cy of responses was assessed by Jatio® 
7 a period of about 3 mo. The corte" for 
Coefficient between the initial and the final scote® 3 
ifferent Categories of responses ranged betwee? the 
and 82. The retest coefficients were very low fg for 
categories Soc W and P Comf No reasons are offere 
their instal ility, 7 
The validity 


ollowing a met 


ked 
as onnree 


Ted 
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then asked to rate each of these persons on a 10-point 
scale to determine their standing on the test variables 
of Inv T, Hoa T, Soc W, P Comf, Conft A, and Opt T. 
The final scores for each S on each of the rated variables 
were found by averaging the independent rating of the 
judges. 

The set of scores thus obtained for these 30 Ss was 
then correlated with their respective scores on the sen- 
tence completion with money. These correlations ranged 
between 45 and .81, the lower values being for Conft A 
and Opt T. The low correlations can be attributed to 
error of judgment by the rater or their failure to grasp 
the complex concepts of conflicting attitude and optimis- 
tic outlook, It may also be due to some error in categori- 
zing the responses. However, all the correlations were 
positive and in the expected direction and it seems safe 
to infer that the sentence completion test with money is 
a reasonably satisfactory instrument for assessing the 
strength of the various aforementioned trends. 

A Witudes toward labor unions. Attitude toward labor 
unions (Att L) was measured by a test following Thurs- 
tone’s (1929) technique of equal-appearing intervals. 
It had 17 statements that were selected on the basis of 
the responses of 50 judges. The Q values for the selected 
items ranged between .19 and 1.74 with a mean of 1.21. 
The range of weight scores was between 0 and 10, the 
lower score denoting a favorable attitude, and the higher 
vice versa. ? 

Interview schedule. The data for job prestige, job satis- 
faction, value patterns, and aspirations were obtained 
by interviewing Ss and asking them some set questions 
designed to elicit responses in the specified area. The Ss 
were completely free to state their responses. They could 
thus express their true reactions. i , 

Their responses were then rated on a 5-point scale in 
consultation with some teachers and students of psy- 
chology who had no prior information about the growth 


rate of Ss. 

In case of job sati 
responses that stated 
tendency to stick to it. s 

Regarding job prestige, Ss were required to rate on a 
5-point scale their jobs and the activities related to their 
jobs. The total score for each S was found by adding the 
rating scores. 

The problem of quantifying the value pattern was 
rather complicated, especially because of disagreement 
about the hierarchy of the expressed values. It was, how- 
ever, decided to assign higher scores to responses that 
attached importance to growth and expansion of in- 
dustries. On the other hand, love for tradition, content- 


ment, and an easy way of life were given lower scores. 
While scoring the responses concerning aspirations, 
the highest scores were given to those responses that 
emphasized doing things in a big way, for example, as- 
piring to own a big factory, be a great scientist, a big in- 
dustrialist, a millionaire, etc. At the lower end were the 
that emphasized the routine way of doing 
things, for example, taking up a job, setting up & small 
stall or a shop, etc. Responses showing desire to close 
up business were scored still lower. ' 
Interests. Here, persons showing no interest In any- 
thing other than their work were given a score of one. 
When the interests were confined to family affairs or 


sfaction, higher scores were given to 
d liking for the present job anda 


responses 


circle of close friends, responses received a numerical 
score of two. The next higher category of responses in- 
volved activities without much intimate social con- 
tact (gardening, music, etc.). The highest scores were 
given to activities demanding more and more social 
contact (club, social and political activities, etc.). 
Thus, a person getting the lowest score has no time for 
anything other than his work, whereas a person with 
the highest score takes much active interest in social 
and political activities. 


Administration of Tests 


The entrepreneurs were interviewed by two trained 
September 1964. 


psychologists in the months of May: 

They had no information about the exponential growth 
rate of their Ss. Their work was distributed in such a 
manner that each investigator had about 50% of the Ss 
from each group. In general, two sessions were held for 
each S. For uniformity of procedure, the interview 
schedule and attitude scales were given in the first ses- 
sion, whereas the second session was devoted to sentence 
completion tests. The time required for the two sessions 
averaged about 2 hr. 

In both the sessions, when preferred by Ss, the re- 
sponses were recorded at the appropriate places by the 
investigators. The responses were then converted into 
numerical values as explained earlier. 


RESULTS AND DISCUSSION 


The results have been discussed in terms of 
intergroup comparisons between metal and 
hosiery groups and intrasubgroup comparisons 
after dichotomizing the metal and hosiery 
groups at the median of their respective expo- 
nential growth rates. Both for the metal and 
hosiery groups similar intrasubgroup compari- 
sons were made for those Ss who had been 
administered the TAT. In the third phase the 
two groups were combined and then divided 
into fast- and slow-progress groups. The fast 
group was comprised of 92 Ss, 46 each from the 
metal and hosiery groups with the exponential 
growth rate higher than their respective me- 
dian. The other 92 Ss comprised the slow-pro- 
gressing group. The intercorrelations for the 
two groups were computed and factorized 
separately by the principal axis method. In 
each case, four factors were extracted. The ex- 


4Intercorrelation matrices for the fast and slow pro- 
gressing groups have been deposited with the National 
Auxiliary Publications Service. Order Document No. 
01474 from the National Auxiliary Publications Service 
of the American Society for Information Science, ¢/0 
CCM Information Science, Inc., 909 3rd Avenue, New 
York, New York 10022. Remit in advance $5.00 for 
photocopies or $2.00 for microfiche, and make checks 
payable to Research and Microfilm Publications, Inc. 
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TABLE 2 
Means, STANDARD DEVIATIONS, AND T Ratios FOR INTERGROUP COMPARISONS _ -- 
MEANs, STAND: —=_ 
| 5 
Metal group | Hosiery group ea 
Test variable A = j z | SD | 
| | g.41** 
aci ) 13.74 3.39 9.22 3.36 
Job satisfaction | 18.55 2.70 1869 | aso Jo, Da 
Ppp iee ir 23.85 3.40 18st | 337 ee 
Neen Pa 33.85 468 | 28.55 630 | Say 
Petrie 1.90 TE | ge | a. 
Achievement motivation 77.22 5.65 | 62.30 | 7.25 eee 
Investment tendency 9.04 3.46 7.75 | 3.36 asst 
Hoarding tendency 3.00 2.40 | 4.50 2.10 379% 
Conflicting attitude 4.00 1.71 2.90 2.28 5.47 
Optimistic tendency 2.03 1.52 3.44 2.06 182 
Social work 3.10 1.50 3.50 1.45 125 
Personal comfort 3.00 1.00 2.80 1.20 rei 
Attitude toward labor unions 4.59 1.57 5.60 1.40 | -= 
*p = .05, £ = 1.99. 


++p =.01,! = 2.63. 


traction was done to the criterion of the Scree 
Test (Cattell, 1966). In the application of the 
Scree Test, before proceeding to the judgment 
on communalities and factor number, unities 
were inserted in the R matrix diagonal and 
latent roots were plotted down to eight factors 
separately for the two groups. The scree ap- 
peared at the fourth factor in each case. Com- 
munalities obtained for the four factors in the 


above analyses were used as guessed communal 
ities in the extraction of factors. The a 
were then rotated using the Varimax crite ee 
for simple structure. The Varimax factors a 
reported in Table 5. 


Inler- And Intracomparisons 


pe andard 
Tables 2 and 3 report the means, standar 
deviations, and £ ratios for the metal and th 


TABLE 3 


Means, STANDARD DEVIATIONS, 


AND T Ratios FOR INTRAGROUP 


COMPARISONS 


| 


Metal group 


| Hosiery group 
‘Test variable Above Mdn. Below Mdn. | Above Mdn Ee Mdn 
FEES E R He ae ratio j o o eratio" 
Bwe |) gwl Gs | x | 
| | io | 
Job satisfaction 15.30 | 3.20 | 12.18 | 280 ; | aca 
30 | 3. : i 4.80 
Job prestige | 19.45 | 2.70 | 17.65 | 315 | 290 18.50 
Value patterns | 25.55 | 3.10 | 2245 | 278 | s48 |2030 
Aspirations | 35.94 | 4.28 | 31.74 | 408 | 477 31.82 
Interests 180 | 1.00 | 2.00} 1.01 | ‘or | 9 
Achievement motivation | 80.30 | 5.15 | 74.14 | 425 566 | oreo 
Investment tendency £88 | 334 | 9.20 | 3.55. | ao | 4% 
Hoarding tendency 2.60 | 2.30 | 340] 243 | 160 | 4° 
Conflicting attitude 435 | 1.50] 3.65] 1.83 | 200 | 320 
Optimistic tendency 2.00) 148 | 206] 1.56 | 18 ie 
Social work 3.00 | 1.50 | 3.20] 149 | ‘e4 Pi 
Personal comfort 290) 39% | 3.40) 1.05: | too | Soe 
Attitude toward labor | oc i i 
ions | 4.28 | 4.9 4 3 
unions | 1.20 490 | 1.25 2.38 5.20 
ALP = 05,1 = 201;at b = 01,1 = 2.68, a ee es 


TABLE 4 


MEANS AND MANN-WHITNEY U VALUES FOR THE ABOVE- 


AND BELOW-MEDIAN SUBGROUPS 


| Metal group Hosiery group 
Pest variable xX | Xx xX | i 
above-Mdn. | below-Mdn. U value | above-Mdn. below-Mdn. U value 
E group | group group group 
Job satisfaction 1265 | 10.80 3 | 5 58** 
Iy prestige 23.24 14.66 60** Hs Ee Ger 
Value patterns 26.33 | 15.67 45% 22.50 14.43 53+ 
Aspirations 38.60 23.40 38 35.18 21.64 ae 
Interest 200 | 9% 2.50 235 sot 
Achievement motivation | 54.13 | ages 72.30 52.27 36 
Investment tendency | 6.20 FY asus 11.84 4.53 pe 
Hoarding tendency | 228 6s* | 500 4.30 63** 
Conflict attitude 3.26 735* 4.22 2.30 70* 
Optimistic tendency | 3.00 70* 4.20 2.88 66* 
Social work | 2.00 ore | 3.66 3.52 69* 
Personal comfort | 1.63 54 | 3.38 2.83 74* 
Attitude toward labor | ” 
„Unions | 5.62 58** 6.13 5.21 530% 
Thematic Apperception 
Test 8.20 29 | 6.85 3.15 


* p = .05 (two-tailed test). 
gep = 02 (two-tailed test). 
> = 1002 (two-tailed test). 


hosiery groups and also for the above-median 
and below-median subgroups in the metal and 
hosiery groups. In case of the above-median 
and below-median intrasubgroup comparisons 
(see Table 4) for Ss who were administered the 
TAT, the significance of the differences was 
checked by the Mann-Whitney U test of two 
independent samples (Siegel, 1956). 

A review of Tables 2 and 3 indicates that 
the metal group and the above-median sub- 
groups differed significantly from their counter- 
parts on job satisfaction, value patterns, aspi- 
rations, achievement motivation, and attitudes 
toward labor unions. The picture that emerges 
from the intracomparison of the smaller sub- 
groups (see Table 4) is more or less comparable 
with the one obtained from their respective 
counterparts in the bigger samples. The minor 
differences can easily be attributed to sampling 
error. The differences, in all the cases, are m 
favor of the metal group and above-median 
subgroups. Keeping the scoring procedures in 
view, it implies that the fast-progressing group 
or subgroups tend to be more satisfied with 
their jobs, attach greater value to the import- 
ance of their growth and expansion of indus- 
tries, want to do things in a big way, score 


higher on achievement motivation, and are 
more favorably inclined toward labor unions. 
Further examination of the £ ratios suggests 
that the metal and hosiery groups differ sig- 
nificantly on all the tested variables except 
those of job prestige, money spent on social 
work, and money spent on personal comforts. 
Such differences are inconsistent and much less 
marked in the subgroups. This is particularly 
true for Inv T, Hoa T, Conft A, and Opt AT 
This can be explained by assuming that these 
variables do not contribute much to entrepre- 
neural success. The differences on these vari- 
ables in the metal and hosiery groups are 
special characteristics of the two groups. ` 


Factorial Study 


Table 5 reports rotated factors for the fast- 
and slow-progressing groups, respectively. The 
common variance accounted for in the two 
analyses are 57.5% and 51.2% for the fast- and 
slow-progressing groups, respectively. 

The major components of Factor 1, both in 


the fast- and slow-progressing groups are AM, 
Inv T, aspirations, value patterns, and Att L. 
es 


It would appear from the description of th 


ee Pee A BE, 
e eme 
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TABLE 5 
VariMAx-RotaTep Factor LOADINGS g = 
Fast-progressing group Slow-progressing group 
Test variable : 5 F 7 E i 5 A | j 
| 
5 3 01 
i i 22 -56 | —.02 19 40 | —.02 63 2 i 
pe a 46 65 | —.10 | —.09 65 | —34 -52 | —31 | — 7 
E l Baten 59 .12 AS | —.10 40 | —.60 al | —.10 T 
As a 76 -32 | —.06 | 11 -70 | —.71 -20 | —.01 = 
kevesen motivation .81 .12 .02 10 | 08 | —.72 | —.07 z ~ 
Investment tendency -69 13 | —.08 .01 50 | —.46 | —.08 | 7 AL 
Hoarding tendency 03 -10 16 92 88 | — 44 -24 | —.04 A 
Conflict attitude —.02 | —.01 08 01 .01 05 —.01 —.03 ea 
Optimistic tendency —.07 | 09| 76 14| 61 07 | —01 .57 60 G 
Social work 12 61 56 | —.24 -76 08 21 79 a aa 69 
Personal comfort 10 67 Al 13 64 | —17 47 58 
Attitude toward labor e 
ed 39] 25] 68 02 68 | 23 17 67 13 
7 51.22 
% Contribution 20.84 | 14.88 | 13.31 | 8.48 | 57.51 | 16.66 | 9.72 | 1612 | 8.72 | 31 
70 | Don eae 
—— , 


tests that they measure tende: 
sist, cooperate, and go abou 
way in business and industry, 
characteristics, Factor 1 is n 
structure of this factor in the 
group differs slightly as it has an additional 
significant loading on the variable of Hoa E 
The contribution of Factor 1 in the fast- and 
slow-progressing group, is 21% and 17%, 
respectively, 

Factor 2 has hi 


ncy to strive, per- 
t things in a big 
In view of these 
amed AM. The 
slow-progressing 


ments made by the tests showing high loadings 
on this factor, j 


self-image. Its Contribution to the to 


and slow-progressi 
is 15% and 10%, respectively, 

howing significant loadings on 
WO groups are Opt T, Soc W, 
P Comf, and Att L. It appears difficult to find 


tal vari- 
ng groups 


ive name that might 
ell. The con- 
t- and slow- 


Progressing groups is 13% and 17%, respec 
tively, has 

Factor 4 in the fast-progressing groups T 
significant loadings on the variables of Hoa i 
and it practically accounts for the total yar 
ance attributed to this factor. In the Sip 
Progressing group the significant loadings A 
this factor are on the variables of Conft Tyk a 
T, and P Comf. Apparently, the siren is 
this factor in the two groups is different. ree 
difficult to name this factor and may be ! 
garded as quasi-specific, 
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economy between 13th and 18th centuries. Economic 


rate of industrial 
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in economic develop- 
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l growth. 
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plex, and valuable nature of R&D functions, 
Experimentation with o 
is always subject to diffi 
subject mortality, 
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the organization u 
On the other hand, excellent 
been made favoring the laboratory as a site 
for testing organization hypotheses (see Weick, 
1965; Festinger, 1953). Carzo (1963), Carzo 
ecker and Baloff 
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anization structure 
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vel (individual versus group) on 
valuated, creative task 


. STONE? 


of mode of organization (specialization 
the per- 


groups organized by 
ack had higher mean word 


camine 
of existing Managerial committees to fa 
the effects of individual and group eve e Ta 
back on performance and morale. In ae 
search reported here, the laboratory bnew. 
as a testing ground for examining hypo! atin 
regarding the effects of mode of Organi gee 
(specialization versus project teams) and the 
back level (individual versus group), on aai 
performance and satisfaction of creative t 
groups. rgan- 
Since creative task groups are usually 0 ai g 
ized specifically for the purpose of genere and 
new or unique ideas or products, it is jimpo: aye 
that they be organized and managed in W ro- 
that are most conducive to high group Fad 
7- The problem of how research nin 
ratories should þe organized is one of T by 
i in industrial research poo e 
Shepard (1956), Shepard suggests that vale 
a ic modes of Organization, Spec the 
“organization by task,” In oach 
Specialization mode of organization, a i 
Person in the SToup approaches the task fits 
Specialist or Specializes in only one pits o tly 
task. The Specialization mode is frequen or 
characterized by a clear division of la J. 
among members of the creative task groui” 
Project teams typify the “organization. eJ 
task” mode. Such groups are organi ing 
around a certain task, their raison d’être aer 
to solve a Particular problem or class of Padi 
lems. Group members organized in this O 
dissect the group problem e 
areas of specialty; rather, 1 
lem or task as a group. 


4 


Creative Task VE” 
Literature comparing the two modes of responding to individual task achievement, the N 
organization has tended to favor project teams reward system M y serve to motivate the jn- 
es groups organized by specialization. dividual to further accomplishments. 
hepard (1956), for example, has stated that Performance evaluations are necessary if 


the specialization mode leads to a number of feedback is to be administered to individuals 
i ask groups, 


difficulties such as “resistance tO crossing oF groups- Jn the case of creative t 


specialist poundaries and cooperating with two sources of evaluation are common. Evalu a 


other groups: - cand a lack of creativity and ation of performance may be undertaken either 
responsiveness tO challenge LP- 302].” In his by a single supervisor who is not a member 
study of 11 R&D laboratories, Pelz (1967) of the group OF py task grouP members them- 
found that high-perfort ienti : 
to be those with not one, put two oF three tion,” has pecome increasingly 
areas of specialization. He also concluded that recent years. © 
“specialization lends security, ut diminishes appraisal, it has been used st 
challenge [p- 165].” On the other hand, it has poth military situations Hollander, 1954; 
been said that groups organized into project +; 
teams will often turn out to be “enthusiastic, Weitz, 1958). Peer evaluation was employed 
creative, and dramatically successful [Shepard, py the 24 creative task groups in this study- 
1956, p- 3027” and also more innovative than The following hypotheses were tested: 


those groups EM! hasizing “narrow nondupli- 
k groups EMPEY del A 1. Performance and satisfaction of groups 


cati i iti uties an P A 
RUDE nonoverlappi*t oer of ea x organized into project teams will be higher than 
l responsibilities [Thompson 19G05Be T hose organized by specialization. 
Let us now turn our attention to the secon! 
i i is study— 2. Performance and satisfaction of groups 


variable under investigation in th 2. Perfor 
reward an given individual-level feedback will be higher than 


ing group-level feedback. 


feedback level. The problem © y r 
i 1956) nme those rece 


recognition 1S another 0 Shepard’s ( 


dilemmas M industrial research. 
rovi METHOD 
Eke quality Design. In this experiments mode of organization and 
ance, 4 reward system 38 & feedback level were examined in a 2 X 2 design that 


K Pe aaa on age ol: ; jalization— dividual; and $ ecialization— 
re er an individu al or a | ; specialization in A pecialization: 
tered oR an k may be rected toward each roup- Each cell contained 6 groups, making & total of 
edbac ee 7 24 experimental groups- Each group had three Ss who 


organizational unit (individual worked together on 2 word construction task. The 
retested and minor adjustments in 


z > t a ap Subjects. The Ss were 72 students in elementary 
d individual wersue payeboloey courses at the University of ee 
eedback has tended to favor the Students who volunteered to take part in the 2-bhr. ~ 
not furnish experiment received bonus points for their participation. f 
sE. word-construction task devised by E was 
used. The object of the task was to make as creative 


io 
erformance feedback to the amount of times using a number of letters and employing 
no correct oF preplanned solu- 


alua For each trial, six letters were placed face 
der on 2 table. In addition six letters 


i mportant to “the were placed in a row, face down on the table. The backs: 


bY 3. Py 
tenance of one s self- of these unknown Jetters Were numbered 1, 2, 3, 
£ ree rules, SS could either: 


concept [Sco t Dornbusch, sehing, & and 6. Through the use of th 

Laing 4967, P- 5.” Shepard (195 as (o) set aside permanently one of the face-up Jetters 

y ; ute reward sys am st ould be nd replace it with a letter drawn (in numerica order) 

stated that the rew system * š i from the row of unknown letters Rule No. 1); (b) ad 
ical order 


esponse to in- tothe face-up letters a letter drawn (in numer 


3051.” ‘By tor the unknown letters Rule No. 2); ° (e) chang 
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one of the face-up letters into any letter of their = 
(Rule No, 3). Use of a rule entailed a cost that varie 
according to utility of the rule (i.e., Rule No. 3 cost 
more to use than did Rule No. 1). An 8-min. time limit 
was set on each trial, with warnings given at 6 and 7 
in. 
M Bach trial was scored by counting the number of 
letters in the final word constructed by the group, 
multiplying this number by 20, and then subtracting 
the number of points entailed by use of rules. The 
resulting number for each trial was a group’s final word 
Score. y Se 
Manipulations. Groups were organized by specializa- 
tion or into project teams by having Ss in each mode 
approach the task in a different way. Regardless of 
mode, the same three rules were used to work the task. 
However, in groups organized by Specialization, each 
S was given a different rule to work with (ie., Person 
No. 1 received Rule No. 1, etc.) while Ss organized 
into project teams were each given the same three rules 
to work with (i.e., each S received all three rules). 
Each S in the specialization mode was told that he 
was to be considered an expert on the use of his partic- 
ular rule. These Ss were also instructed: “In working 
on the task you are to advocate the use of your rule and 
actively encourage the other group members to use it, 
Remember, you will be rated on how well you do this,” 
In the project team mode, however, Ss given the same 
three rules to work with were instructed: “In working 
on the task you are to cooperate with one another, Re- 
member, you will be rated on how well you do this.” 
To manipulate level of feedback, Ss were told that 
they would be receiving either individual or group-level 
feedback in the form of perform: 


ance evaluation scores. 
Individual performance evaluation scores informed each 


S of how his performance compared to each of the 
other two members in his group. Group performance 
evaluation scores let each S know how his group's 
performance compared to that of two comparison 
groups.” The Ss in each feedback condition were told 
that “a high performance evaluation score reflects both 


high verbal ability (a part of intelligence) and 
creativity,” 


Since peer evalua 

told that they would be 
or group perfo 
tions, each § 


$ rocedure, After readin 
Sheets, Ss were provided with an exam 

ple of how th 
task was worked and score a 


* Two comparison groups were u 


feedback condition so that the same false feedback 
Scores¥could be used for both individual and group 
feedback conditions, } 


sed in the group 


were collected by Æ as soon as the Ss had Mina 
filling them out. Feedback was delayed one trial ae 
is, performance evaluation scores for Trial 1 wert ee 
ceived only after Ss had finished evaluating their Pi 
formance on Trial 2, etc. Feedback was lagged E 
way both to enhance the credibility of false feed a 
and to simulate the delay of knowledge of oer 
common to R&D groups. A one-page miiderpernnt 
questionnaire was filled out by Ss after camperen, si 
evaluation of Trial 2 but before any feedback fr z 
Trial 1 was given. This questionnaire was cone ae 
to provide a measure of satisfaction with the tas! Pe 
was administered before any feedback was given na i 
attempt to avoid possible contamination due to flied 
feedback. After receiving feedback from Trial 5,58 Ta 
out a short postexperiment questionnaire and W 
debriefed. 


Measures of Performance 


Three measures of task performance were used: 
word score, creativity, and time. PE ENEE 
Word score. High word scores on the task ag cate 
good performance, Word score was calculated for Bee 
of the five trials by multiplying the number of letter 
in the constructed word by 20 and then subtracting 
the number of points accrued through use of cone 
Mean word score, over five trials, for the 24 group 
-79, with a standard deviation of 6.71. -T 
y. High creativity scores indicated good pes 
formance on the task. This performance measure AS 
obtained from 7-point Likert-scale ratings of creativity? 
A panel of three judges was told briefly about the word- 
construction task. Each judge was then asked to rate ns 
to creativity the many different words constructed by 
the 24 groups over all trials, Average interrater re- 
ability for creativity ratings was .71. Creativity scores 
Were assigned to cach word by averaging the three 
creativity ratings given it, Mean creativity, over five 
trials, for the 24 groups was 3.59, with a standar 
deviation of .78, Creativity correlated .42 with word 


score, 
Time. Good performance with respect to time wae 
indicated by fewer seco: 


nds taken to complete each tria 
of the task. The time limit set for each trial was 8 mn» 


or 480 sec. Mean time over five tri oups 
time, als, for the 24 gr 
was 313.96 sec, with a Standard deviation of 72.93 sec- 


Time and Word score co; S ivity 
rrelated 17+ a reativit) 
correlated .12, 7; timeand c 


_ Task Satisfaction Measure. Task satisfaction was 2? 
index of five questionnaire items four of which were 
S-point Likert-scale items, The items touched on hoW 
Ss felt while Working on the task, how much they 
The ne ra! tand fun it held for them: 
an: of the tasi items used were from the 

questionn: administered after Trial * 
performance feedback. T ne 
from the postquestionnaire: 
ernal consistency reliait 
“ngs : Pages 195-106) for the items e 
24 e —— ae “74, Mean task betes for the 
Task ees 18.30, with a standard deviation of 1.6- 
elated — 16 with word score, 


>and — .56 with time. 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF THE FOUR DEPENDENT VARIABLES 
ON MODE OF ORGANIZATION AND FEEDBACK LEVEL 


| a 
| Mode of organization Feedback level 
Dependent | . | 
variable Project team | Specialization Individual Group 
] 
_ a | 

| xX | SD | X | SD X SD x SD 

7 ime | 285.50 57.91 342.42 71.03 322.67 | 58.67 305.25 70.39 
ask satisfaction | 19.13 1.38 17.47 1.24 18.44 1.25 18.17 1.37 

W ord score 92.50 5.63 89.08 5.72 93.17 6.90 88.42 4.09 
Creativity 3.54 | 75 3.63 76 3.49 17 3.68 JA 


RESULTS 


Separate analyses of variance were done on 
each of the four dependent measures in the 
study: word score, creativity, time, and task 
satisfaction. With the exception of creativity, 
a significant effect was found on each de- 
pendent measure. Means and standard devia- 
tions of the four dependent variables on mode 
of organization and feedback level are given 
in Table 1.4 

Regarding mode of organization, it 
hypothesized that performance and satisfac- 
tion of groups organized into pro, 
would be higher than those organized by 
Significant effects of mode on 
lent measures, time and task 


was 


ject teams 


specialization. 
two of the depenc 
satisfaction, supported this hypothesis. 

The effect of mode on time was significant 
at the .06 level. As seen in Table 1, groups 
organized into project teams had a mean time 
of 285.50 sec., while groups organized by 
specialization on the average required more 
time (342.42 sec.). This difference accounted 
for 11% of the total variance for time. 

The effect of mode on task satisfaction was 
significant at less than the .01 level. This effect, 
which accounted for 18% of the variance, 
supported the hypothesis of higher satisfaction 
bs a 


4 ANOVA tables for the variables time, word score, 


and task satisfaction have been deposited with the 
National Auxiliary Publications Service. Order Docu- 
No. 01475 from the National Auxiliary Publi- 
cations Service of the American Society for Information 
Science, c/o CCM Information Sciences, Inc., 909 3rd 
‘Avenue, New York, New York 10022. Remit in advance 
$5.00 for photocopies or $2.00 for microfiche and make 
checks payable to: Research and Microfilm Publi- 


cations, Inc. 


ment 


in groups organized into project teams. Mean 
task satisfaction for groups organized into 
project teams was 19.13 as compared to 17.47 
for groups organized by specialization (see 
Table 1). 

In light of the above findings, it is interesting 
to note the significant correlation (r = — .56) 
between time and task satisfaction (see Table 
2). This correlation shows that low mean 
times are associated with high satisfaction 
scores. 

There were no significant effects of mode of 
organization on either word score or creativity. 

Regarding feedback level, it was hypoth- 
esized that performance and satisfaction of 
groups given individual-level feedback would 
be higher than those receiving group-level feed- 
back. Although there was no significant effect 
of feedback level on satisfaction, findings for 
the variable word score lent support to the 
hypothesis regarding performance. 

The effect of feedback level on word score 
was significant at the .07 level. Groups given 
individual-level feedback had higher mean 
word scores than did groups receiving group- 
level feedback (93.17 as compared to 88.42, 


TABLE 2 
CORRELATIONS BETWEEN THE DEPENDENT VARIABLES 


Dependent variable 1 2 3 


1, Word score — 

2. Time A7 — 

3. Creativity | A2* k. — 
4. Task satisfaction | —.16 —.56* —.08 


Note.—N = 24 groups. 
*p <01. 
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see Table 1). This difference accounted for 9% 
of the variance. ee 

Table 2 shows that word score is highly cor- 
related with creativity, a measure for which no 
significant results emerged. Word Score was 
not significantly correlated with either time 
or task satisfaction. 

There were no significant effects of feedback 
level on time, creativity, or task satisfaction, 


Discussion 


This study is only a preliminary step toward 
an understanding of how organizational struc- 
ture and feedback level may affect the complex 
workings of “real life” creative task groups. 
Further field and laboratory research are cer- 
tainly necessary before R&D group managers 
modify their practices. Admittedly, elements 
in the laboratory situation are not always 
analogous to conditions existing in the larger 
organization, and it is difficult to estimate the 
degree to which the manipulations in this 
experiment accurately reflect crucial properties 
in ongoing R&D groups. In the first place, the 
study made no attempt to build in effects of 
extended contact between group members, 
such as norms, standards, and status differ- 
entiations. The specialization manipulation of 
giving each S a different rule to advocate may 
not have been sufficiently powerful to capture 
the primary characteristics of “real” specialists, 
And, undoubtedly, the personal consequences 
of performance feedback—whether individual 
or group—were not as 
mental Ss as they 
R&D group memi 


bers whose job security may 
be affected. Fur 


thermore, the experimental 


Performance of 
task groups, if of 
multiple criteri ; 


performance in research i 
is illustrated by the resu 
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ent performance criteria were not ae 
uniformly by the two factors under stu 4 A 
mode of organization affected time, but e 
word score; feedback level affected word soog f: 
but not time. This finding may have impi ga 
tions for the organization and mandeme N 
R&D groups. When rapidity of problem Er 
tion is an important criterion, attention ee 
be given to the mode of organization fac a 
feedback level being of little oa ee 
However, if product quality (here ed 
by word score) is of the utmost ee. 
attention might be paid to the feedback K 3 
factor, mode of organization not being ¢ 
major concern. ine 
Specifically, it was found that gr pe 
organized into project teams tended to a 
solutions faster than did groups organa 
along specialist lines, The Ss in the pan as 
zation mode, each approaching the m ane 
an “expert” and advocating the use 0 ie 
particular rule (or approach), required m ad 
time. Since a single task solution was ae 
from each group for each trial, Ss in apee 
zation groups may have needed more kage. 
resolve differences in viewpoint and to we 
grate dissimilar approaches into a eer 
task solution. Groups organized into ee 
teams did not have these problems, since wit 
Stroup member was given all three rules yan 
which to work in a more cooperative ae oi 
Groups organized into project teams et 
tended to express higher satisfaction wiin TR 
task than did specialization groups. Per Er 
Ss organized into project teams enjoyed the 
cooperative nature of their approach af ad 
task. Or maybe they liked the freedom they Da 
with which to work, not being restricte 


. . KOL ing 
advocating one particular rule or app! e 
the problem fro 


situation may ! n 
groups organi : 
Specializ: 
ad ve 
them t 
that the 


groups 
task be 


the per 
tend to 


; eae ms 9 
Cause of their success at it in oe f 
formance criterion time. That Peen 
gan satisfaction from success ha 
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Pointed out by Locke and Breer (1965). This 
explanation would suggest that one’s satis- 
faction with the task itself may not be com- 
pletely separate from how well one does, or 
thinks he does, on the task. Task satisfaction 
and time were correlated — .56: satisfaction 
is high when time is low and vice versa. Due 
to the fact that performance feedback to Ss 
was lagged one trial, Ss perhaps formed their 
Own, more immediate ideas of performance 
quality after completing each trial. Of the three 
Performance criteria, time would have been 
the most obvious for Ss; the 8-min. time limit 
Posed an absolute standard to work against. 
(Creativity is a fairly ambiguous concept, and 
there was no absolute word score for Ss to 
Strive for.) If, indeed, Ss did form their own 
ideas of performance quality, they may have 
felt more successful (and hence more satisfied) 
the less time they took. 

Feedback level affected only word score. 
Individual-level feedback was found to pro- 
duce higher mean word scores than did group- 
level feedback. If one accepts word score as an 
indicator of quality, this finding suggests that 
higher quality solutions or products may result 
when R&D group members’ contributions are 
individually recognized. This supports the 
notions of Scott et al. (1967) and Shepard 
(1956) that individual-level feedback is pref- 
erable to group-level feedback. f 

Feedback level did not affect satisfaction 
with the task. Perhaps the reason for this was 
that three of the five items in the task satis- 
faction index were obtained before Ss had re- 
ceived any feedback whatsoever. Therefore, 
for the most part, they had had no real experi- 
ence with either individual- or group-level 
feedback. at 

The failure of the performance criterion 
creativity to produce any significant effects 
may be due to the nature of the creativity 
measure. The measure itself may not be valid, 
since ratings of constructed words were made 
by persons who had not performed the task. 
An alternative explanation is that the measure 
is valid, but that neither mode of organization 
nor feedback level has a differential effect on 
creativity. 

Further research might focus on the com- 
bined effects of mode of organization and level 
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of feedback in the hope that a combination 
could be found that might maximize all three 
performance criteria—high quality, high crea- 
tivity, and low time. Other measures of crea- 
tivity should be used in such studies. Research 
should also be extended to the field to examine 
the effects of these variables on the perfor- 
mance and satisfaction of ongoing R&D 
groups. Finally, since peer evaluation is still 
relatively rare in R&D groups, further research 
should include evaluation by a superior or 
group supervisor. 

Summary. In brief, groups organized into 
project teams tended to complete the trials 
faster than did groups organized by speciali- 
zation. Groups organized into project teams 
also tended to express higher levels of task 
satisfaction than did those organized by 
specialization. Regarding feedback level, 
groups given individual-level feedback tended 
to construct higher scoring words than did 
groups receiving group-level feedback. No 
significant effects were found on the perfor- 
mance variable creativity. 
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DEVELOPMENT OF THE SURVEY OF WORK VALUES 


The purpose of this research was tO construct 
a set of scales measuring several areas of work cally by indirect methods that have 
of Work Values (SWV) is presumed to index this concept. Amount of 


intended to be an index of a pe 


values, referring to gen : 
the meaning vidual attaches tohis posedly logically related 


work role; 


satisfaction 
A few sca 
purpose of measuring WO 
1959; Supers 
values (Kilpatrick, Cummings, 
1964; Rosenberg, 1957 

sures have been car y 
The SWV differs from tude scales sh 


separate areas of values @ 
construct of secular 
which work values se 

The principa 
described b 


a 
1 Now at the Procte! 


herefore, 
(an attitude toward on 
les have been construc 
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Bowling Green Slate University 


A series of scales measuring attitudes toward work was developed. The Survey of 


Work Values (SWV) was based on a number of dimensions of 
specifically those aspects that deal with the meaning that an 
to his role at work. The content validity of the items was established by the re- 
g. Items were analyzed to determine the best metho 
for the subscales accor’ à 
scales. Unweighted, multipoint scoring was 5 


54 items. SWV scores discriminated among occupational group 
with background characteristics of employed and disadvantaged persons. 


rson’s attitudes industrialization, amount of e 


al, rather than his feelings nomic level, 
cept of work the more frequently 


des regarding Correlations 


used indexes. 


differs from that of job been found to be low. Frequently, 


e’sown job). of considerations inter 
ted for the havioral manifestation: 
rk values (Stefilre, nomic a 
1957, 1962) and occupational alternative pehaviors # 
& Jennings, 
). ‘Although these mea- be naive to €y] 


efully developed, they seem accurately an indivi 


ed toward sures of these concepts. 
dtothe The categories of the 
ized Protestant Ethic with 

em to be closely linked. the apparent relevance of each categor 
] aspects of Protestant E 
Weber (1958) are in 


d jndustriousness: gi 
3, industriousness probably 


he emphasis Although this selection was judgment 


the items, their scale va 
arch purposes ture concerning the Protestant Ethic. 


and permission to use the scales for rese rs eh 

may be obtained by writing Patricia ain Smith, Despite an eff i oe 
Department of Psychology, Bowling Green State i P s ort to choose areas of W 
University, Bowling Green, Ohio 43402. values that provide a broad interpretation © 
ý i ne previous Protestant Ethic, the categories of the 


don Priten are by no means ex 
onsist of two z A y 
some question may exist regarding any 


B a 

each containing four items measuring 

tant Ethic” and “pnon-Protestant mhic.” Ular aspect of Protestant Ethic 
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tant Ethic. The Ethic has been assessed typi- 


ducation, 
and occupation have been among 


between attitudes and sup- 
behaviors have usually 


yene to inhibit the be- 
s of an attitude. Eco- ) 


nd social factors may greatly limit the 
vailable to an individua! 


regardless of his attitudes. It would, therefore, 
pect indirect measures to index 
dual’s work values. Atti- 
ould provide more direct mea- 


SWV were selected on 
is of two considerations. The first was 


nt ic as secularized interpretation of Protestant 
dividualism, specifically those aspects dealing with work. 


narrow considerably the number of dimensions 
ect of Protes- by eliminating the primarily religious aspects 


r & Gamble Company- of the Ethic. A second consideration was that 
le values, the categories should be relevant to the litera- 


haustive- Unavoidably, 


that was n 
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included among the subscales. Practicality, 
however, dictates a relatively brief scale. 

Probably the most widely accepted notion of 
Protestant Ethic deals with the intrinsic aspect 
of work; that is, work as its own reward. Work 
I is to be valued because it represents the best 
use of a man’s time, not merely because it is 
instrumental to the attainment of external 
rewards. The employee high in Protestant 
Ethic is presumed to prefer working to being 
idle, to be involved in his work, and to derive 
considerable satisfaction from doing his job 
well. The authors selected three dimensions of 
j Protestant Ethic that cover the intrinsic 
| aspects of work: 

j Pride in Work: the satisfaction and enjoy- 
| ment a man feels from doing his job well, 

Job Involvement: the degree to which a 
worker takes an active interest in co-workers 
and company functions and desires to contri- 
bute to job-related decisions, 

Activity Preference: a preference 
worker to keep himself active 
-his job. 

Although the traditional E 


a 


by the 
and busy on 


thic stresseq the 
intrinsic rewards of work, considerable value 


| 
was placed on extrinsic rewards as well. The 
following subscales reflect the extrinsic nature 


Altitude toward Earnings: the value an indi- 
Vidual places in making money on the job, 
Social Status of Job: the effect the job alone 
aS on a person’s Standing among his friends, 
telatives, and Co-workers, in his Own eyes 
and/or in the eyes of Others. j 
_, Two dimensions of the Ethic have been 
i that do not Conform well to the 
ntrinsic-extrinsic dichot 
Ore, regarded to b 
Upward Striving: the desire to seek con- 
tinually a higher level job and a better stan- 
dard of living, 
Responsibility lo Work: the recognition that 
a man is obligated to work and that he must 
depend on himself rather than others for 
support. 


METHOD 


WIJTING, AND SMITH 


then checked with an industrial sample. Generality was 
investigated in two undergraduate samples. Scale values 
were obtained on another industrial sample and a a 
undergraduate sample. Endorsements of the items were 
then obtained on still another industrial sample. 


Scaling by Method of Reallocation 


First, the a priori attitude statements and the list Ks 
defined categories were Presented to a group of 23 
employees of a glass-manufacturing company in nie 
Midwest, The workers ranged from the unskilled to Be 
managerial level. In accordance with the method o 
reallocation described by Smith and Kendall (12638), 
the judges were instructed to assign cach of the A 
Statements into the defined category that they believed 
best corresponded with the meaning of the statement. 

iti defined categories, a category 
designated as other was included. The judges were 
i o this category any ere 
correspond poorly with each 0 
. The Purpose of the reallocation 
termine: (a) whether discriminably 
different Categories exist, () whether the items intended 
to measure a particular concept are judged to ba 
relevant to that Concept; and (c) whether the investi- 
the categories being measured 
Corresponds closely with the judgment of persons for 
whom the scale is intended, 

The criteria for the retention of an item were at 
least 70% allocation to a single category and no more 
than 20% allocation to any second category. An item 
Was rejected if it failed to meet cither of these criteria. 
A scale was eliminated if it Was not conceptually clear, 
as Was indicated by allocation of items to other than 
the intended Scale. Retaine items were submitted to 

i y Samples of 56 and 57 under- 
i at failed to meet the criteria on 


any one Teallocation Was deleted. The use of a non- 
industrial Population was intended to establish the 
Stability of th 


ese judgments Over distinctly different 
groups. . 

; Then scale y; ng level of cach statement 
wong the rele m Were obtained from a 
Sample of 46 Workers from the same company repre- 


s range of occy ational levels. The 
Judges were given six - 


(The Responsi- 
j See Results sec- 
€d of five letters with a 
avioral descriptions as end anchors 
cs. The points of the scale were 
7ypothetical Workers, The judges were 
hich of the five hypothetical 

Bie likely to ave made each of the 
of a 5-point oie bac Were computed on the basis 


vocale values, which had been 
Y small group of judges, ihe 

by a group of 45 under 
© two sets of Values were correlated. 
PS of Judges differed markedly ue 
the Correlation between the two 


Values reflects the stability of the relative 


es 


a 
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Positions of these ratings across different populations 
rather than their reliability across two samples drawn 
from the same population. 

The 67 attitude statements were presented in 
Pe order to a group of 495 employees from three 
bee of the glass-manufacturing company mentioned 
Cae Employees were instructed to respond to 
as ious on a 6-point scale of agreement ranging from 
Ma agree to strongly disagree. The sample in- 

seven occupational groups: management 
(N = 35), supervision (N = 79), professional (N = 82), 
technicians (N = 37), clerical (V = 120), skilled trades 
(N = 47), and semiskilled and unskilled (W = 85). 

Also included in the sample were 10 workers for whom 
Job titles were missing. 


Scoring of the Survey of Work Values 


I pocneth of agreement was computed from a 6-point 
i scale, which was also scored dichotomously 
(agree-disagree). Agreement values were reflected in 
order to maximize the number of positive intercorrela- 
tions within each subscale, For unweighted multipoint 
and unweighted dichotomous scoring, agreement re- 
Sponses were simply summed for each subscale. 
Weighted multipoint scoring and weighted dichotomous 
Scoring summed the products of the agreement value 
of each statement with its corresponding scale value. 

Product-moment correlations were computed for 
cach item against total score for the subscale, and all 
of the items were intercorrelated. Also, subscale totals 
were correlated both within and across the four methods 
of scoring. Because each correlation between item and 
subscale total was based on a small number of items, a 
correction for the possible spurious effect of autocorrela- 
tion was made (sce Nunnally, 1967, p. 262). For each 
subscale, items were arranged in order of their correla- 
tion with total subscale score. Coefficient alpha (a mea- 
sure of internal consistency) was computed for the 
items with the highest item-total correlations. Other 
items were added successively to this core of items in 
order of the magnitude of their correlations with total 
score. Items were retained only if their addition re- 
sulted in an increment to alpha. Preference was given 
to items spaced widely along the subscale continuum, 
but with scale values of low variability. 

An item was retained, therefore, if it: (a) met 
reallocation standards, (b) had a high correlation with 
its subscale total score, (c) resulted in an increment to 
coefficient alpha, (d) increased the distribution of scale 
values among items of its subscale, and (e) had a scale 


value of low variability. 


Dimensionalily of the Survey of Work Values 


The dimensions obtained by reallocation need not 
correspond with those obtained by factor analysis, 
especially since reallocation is based on discriminability 
among the scales, rather than orthogonality. Scoring 
by factors might be better than scoring by allocation 
subscales. Therefore, the factor structure of the SWV 
was examined. The items administered to the last 
industrial sample (V = 495) were analyzed by a 


999 


principal components analysis with quartimin rotation 
(Goodale, 1969b). 


RESYLTS 


Of the 91 items in the original pool, 45 were 
retained after allocation. They had been as- 
signed to a particular category on an average 
of 84.1% of the judgments. The Responsibility 
to Work subscale was eliminated because its 
items did not meet allocation standards. The 
Activity Preference items also failed to meet 
the stated criteria because the items were con- 
sistently allocated to the Responsibility to 
Work scale as well as to the intended scale. 
The definition for Activity Preference was, 
therefore, revised and additional items were 
written. 

The 45 items retained from the first alloca- 
tion, plus the 13 statements for Activity Pre- 
ference that had failed to allocate, as well as 
17 new statements, were subjected to two sub- 
sequent allocations. Two samples, consisting 
of 56 and 57 undergraduate students, were 
instructed to allocate the statements into the 
six defined categories and an “other” category. 
Of the 75 statements, a total of 67 were re- 
tained with an average value of 85% allocation 
to a single category. A total of 42 statements 
had survived two reallocations, while 25 state- 
ments had been reallocated once. Of the latter 
25 statements, 12 were from the Activity 
Preference scale, and 13 were from the new 
items that were added to the item pool. 


Scale Values 

The rating of item scale values by the in- 
dustrial sample produced a wide range of 
mean values within each of the six subscales, 
with relatively low standard deviations. The 
correlation between the two sets of scale values 
obtained from the industrial and undergradu- 
ate samples was .94 (V = 67). In addition, no 
systematic displacement of these ratings had 
occurred. 

Total subscale scores based on the 6-point 
ratings of agreement were highly correlated 
(r = .98) with totals based on the summed 
products of the agreement weights and scale 
values. Dichotomous scoring of the unweighted 
values was also strongly related (r = .96) to 
the totals derived from the multiplicative ap- 
proach. The unweighted multipoint scoring 


TABLE 1 


ITEM ENDORSEMENT MEANS AND 
STANDARD DEVIATIONS 


Subscale Range of Xs | Range of SDs 
Status 2.32-3.88 
Activity 317 360 
Striving E 
Earnings 2014. 
Pride 4.86-5.72 
Involvement 4.16-5.42 


Note.—N = 495. 


method was retained because it increased item 
variance over that from dichotomous scoring. 


Scale Characteristics of the Survey of Work 
Values 


The item analysis resulted in six subscales 
each containing nine items, Table 1 summarizes 
the means and standard deviations for endorse- 
ment of the items of the SWV, 

Median intrascale item intercorrelations and 
reliabilities (alpha) are shown in Table 2, 
To substantiate these values, the SWV was 
also administered to a sample of government 
employees (N = 356) and intrascale item 
intercorrelations and alpha coeffcie 
tecomputed. The results of the latte 
(also shown in Table 2) agree closely 
values obtained from the developme 

The subscale reliabilities are co 
be satisfactory in view 
number of items within each subscale. Also, 
alpha tends to be small when item means are 
as extreme as those shown in Table 1, 


nts were 
r analysis 
y with the 
nt sample, 
sidered to 
of the relatively small 
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Test-retest reliabilities with about a a 
interval were obtained from a group ob a 
ployees (V = 66) of a large eastern ee 
company. The coefficients for each on il 
are shown in Table 2. The test-retest re fe 
ties are as high as could be nag a FS 
the level of internal consistency 0 

scales. . 

egies 3 summarizes the relationships pron 
the subscales based on the intercorrelations a 
the subscale totals. Of the intercom 
among subscale totals, 11 were oe, 
greater than zero (p < .01). The aon aoe 
consistent with the hypothesized two-1a ic) 
dichotomy of work-related values en 4 
and reward-related values (extrinsic). Re 
intrinsic values are Pride in Work, aude 
Preference, and Job Involvement. The se i a 
tions among these variables were all one 
tial and positive in direction. The corre race 
between the extrinsic variables of was 
toward Earnings and Social Status of Jo iL of 
Positive and significant, Furthermore, “a 
the correlations across the intrinsic-extrit te 
groupings were either nonsignificant or nae 
tive. Attitude toward Earnings was eer 
related to each of the intrinsic variables. both 
Upward Striving measure, which is p 

intrinsic and extrinsic in its character, shon 
positive correlations with both sets of ihe 

Principal components analysis reveale 36% 
Presence of six factors that accounted for 3 D 
of the total variance in the matrix (see Table in 
The six factors were rotated by the guene 
method (Harman, 1967), which was selec a 
because it places no restriction on the obliqt 


: P ong 
ness of the solution, The correlations amore 
factors were relativel 


- ranging 
y low, however, ra! = ol 
TABLE 2 from — .20 to .33 with a median valu te 
epresen 
MEDIAN INTRASCALE Irem INTERCORRELATIONS, — 01. The first factor extracted repres 
Corrricrenr ALPHA RELIABILITIES, AND 
TEST-RETEST RELIABILITIES TABLE 3 
o FOR 
; N 7 Torats F 
Industrial Government Insurance INTERCORRELATIONS OF SUBSCALE eke 
g workersa workersb employeese THE Survey or Work VALUES 
Subscale 
oe ide 
Mdn.r| Alpha | Mdn.» Alpha | Test-retest Subscale | Status | Activ- | Striv- Ta i 
sai a j 163 | a3 55 71 = we se 
Activ i Be +15 <61 71 Status 
aig ae = J -55 176 Activity .00 
16 63 ie a m Striving .18* 11 
Involvement | 1117 53 Ne i ý Earnings ait = 948 £ 
5 16 163 ‘68 ride 06 eae eee = | (dt 
‘Ga SL  . Involvement 105 .36* .23* =i mE 
aN = 495, 
SN = 356. 
oN = 66, 


Note.—N = 495, 
*b <.01, 


SURVEY © 


SampLe ITEMS AND Tuer FACTOR 
OF THE SURVEY O 


Work VALUES 


r Work VALUES 


TABLE 4 


Loapincs FOR HE Sıx FACTORS 


Factor loading 


Sample item 


Intrinsic values 
y to do a decen 


A worker shou 
around. 


Jd feel some responsibilit 


If the person can get away with it, he should try to w! 
expects him to. 


The best job that a worker can get i 


working day. 


Organization-Man Ethic 
A man should feel a sense of pride in his work- 
self busy 0n his job, the working 


If a worker keeps him: 
loafing. 


A worker who does a sloppy job ought 


Upward Striving 
Jf a man likes his job, he should be satisfie 
other job. 


fheis satisfied wit 


A worker is better off i 


to another job. 
A promotion toa higher-level job usually 
reason. 
Social Status of Job 
‘The man who holds down & good job is the most resp 
id not 


My friends would not think much of meifI di 
Having 2 good job makes a person worthy of pra 


Conven tional Ethic 


Doing a good job should mean as muc! 
d always be thinking about p 
d to a 


f being promote 
s doing the best 


Aman shoul 
with the hope ° 


There is nothing aS satisfying @ 


Attitude toward Earnings 


an should choos 


Am e the job which p 
Aman should take the job which 
the same. 


Aman should choose one job 


s one which permits hi 


to feel alittle ashami 


d withitand should n 
h his job and is not con 


means more worrie: 


htoa worker as a 8! 


julling hi 
higher level 


ays the most. 


offers the most ov 


over another mos 


t job whether or not his supervisor is 


ork just a little slower than the boss 


m to do almost nothing during the 


day passes more quickly than if he were 


ed of himself. 


ot push for a promotion to an- 


cerned about being promoted 


s and should be avoided for that 


nected man in the neighborhood. 


havea good job. 


ise from his friends and family. 


ood paycheck. 


mself up in the world and should work hard 


job. 


job possible. 


ertime if the regular pay on the job is about 


64 


.62 


tly because of higher wages. 
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an intrinsic dimension in that it was pre- 
dominated by significant loadings for es 
from the Pride in Work, Job Involvement, ani 
Activity Preference subscales. Separate factors 
were extracted for Attitude toward Earnings, 
Social Status of the Job, and Upward Striving. 
The items that loaded on a factor named 
Conventional Ethic characterized a high level 
of the work value dimension that they repre- 
sented and were strongly endorsed by most Ss, 
(Mean endorsement of the items was 5.16 
on a 6-point scale and the mean scale value was 
4.36 on a 5-point scale.) One other factor 
accounting for 21% of the common variance 
was tentatively named an Organization-Man 
Ethic. 
Discussion 


Reallocation resulted in six discriminably 
different sets of items for which a considerable 
degree of rater agreement was obtained. De- 
spite the apparent success of the reallocation 
procedure, the internal consistencies of the 
SWV subscales were not as high as one would 
prefer. The coefficient alpha reliabilities, which 
were mainly in the .60s, are about what one 
would expect, considering the range of scale 
values within each subscale and the small num- 
ber of items within each subscale. Since ap- 
proximately three times this number of items 
is usually required to achieve a reliability 
Coefficient of .80, extreme dissatisfaction with 
the results of the analysis is not warranted. 
Yet, in light of the extensive prescreening of 
items by allocation, one may wonder why the 
homogeneity of the subscales was not greater. 

Two possible explanations for the generally 


low item intercorrelations are worth mention- 


ing. One such Possibility is that the marginals 
for computation of interitem correlations are 
hot comparable, due to the deliberate spread 
of items along the continuum, 
close to the 50% 
cided on the s 


correlations. 
Another possible reason fi 


n ; or the low internal 
consistencies may be that t 


he subscales of the 
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SWV are factorially complex. In view of Re 
magnitude of the item-total correlations, bori 
ever, more than just a few strong factors wou! ‘ 
have to be demonstrated if the internal con 
sistencies are to be ascribed to factorial A 
plexity (Nunnally, 1967). The factor ana oe 
did indicate, in part, a lack of corresponden i 
with the results of the reallocation procedure 
in terms of discrimination of the urtena 
subscales. It did not reveal the ee 
many strong factors to account for mera 
tively low internal consistencies and sae 
total correlations. The question raised by ‘a 
factor analysis was whether to use the su é 
scales determined by reallocation or ae 
determined by the factor analysis. The nE 
of correspondence between the two procedur : 
was not surprising; for although both et 
analysis and reallocation are suitable BE, : 
for obtaining subscales, they are not ee. 
methods for achieving the same ends. Fac A 
analysis clusters items in terms of — 
response patterns, whereas reallocation agers 
items in terms of whether they have ae 
judged to be relevant to a particular du 
category. The relative quality of aes pee 
and meaningfulness of the relationships of mi 1 
scale scores with relevant variables shou fe 
indicate which method of defining subscales 5 
preferable. The authors, therefore, elected is 
use the subscales determined by both method a 
in subsequent studies, and to delay deciding 
in favor of one over the other set of subscales. 
A problem often associated with scales © 
this type is that of social desirability. While 5 
social desirability effect is usually regarded z 
objectionable and as a source of bias, there ! 
reason to doubt whether this criticism 15 
appropriate for the measurement of va 
If one is concerned about the extent to whic? 
a group of individuals shares a common value 
pattern, then in a very real sense it is soo 
desirability that is being measured. Of cours : 
measured differences jn work values between 
diverse cultural groups are necessary if = 
SWV is to have value as a research instrumen 


s 
Some A pplications of the Survey of Work me 
Occupational Level 


The SWV was used to determine eet 
the subscales could discriminate between a a 
pational groups. Wollack (1968) perfor™ 


p” 
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TABLE 5 TABLE 6 
D UNSCALED VECTORS Scarep MEAN SCORES ON DISCRIMINANT FUNCTIONS 
x | 
Subscale I | Il Group É IL 
Status 281 516 Professional 5 
tus 2 e 1.54 —6.T 
Activity | .726 | —.341 Supervision | 1.34 EI 
Striving N 331 Skilled-technical 88 —6.80 
Pernin | —.528 | —.335 Clerical \ 1.33 2739 
fide — 298 —.140 Semiskilled-unskilled 5 | 
Involvement —.144 | —.613 hea s me 


| 


<= 


discriminant analysis using five occupational 
groups (N = 449) ranging from unskilled em- 
ployees through professionals. 

Two statistically significant discriminant 
functions (p < .05) were extracted with latent 
roots of .14 and .06, respectively (chi-square 
test for the dimensionality of discriminant 
space). Latent vectors for Functions I and II 
are given in Table 5. Discriminant scores were 
computed for each person in the sample. The 
procedure for obtaining & person’s scores On 
the two discriminant functions consisted © 
weighting each of his subscale totals by the 
corresponding scaled vectors an i 
these products over subscales. GrouP means are 
presented in Table 6. 

The first discriminant function, which ac- 
counted for approximately 61% O the be- 
tween-grouP variance, contrasted Activity 
Preference (positively weighted) with Attitude 
toward Earnings (negatively weighted). Table 
6 suggests & plue-collar—white-collar con- 
tinuum. 

One word of caution should be raised regard- 


ing these data. One might be tempted to con- 
clude without reservation that white-collar 
ple view of work- 


workers have 4 more favora! 
related activity than do blue-collar workers. 


However, activity probably means different 


things to different groups of workers. There is 
nothing W SWwV that 


specifies the P 
which this subscale 
activity may imply phy 


interpretation. 
The secon 


discriminant fun 
tt 
accounted for 


about 4 


group variance, was characterized primarily 
by a high positive weight for Social Status of 
Job anda high negative weight for Job Involve- 
ment. This function seems to describe a dimen- 
sion of clerical versus other workers. Group 
differences appear 
sionals, supervisors, skilled-technical workers, 
and semiskilled-unskilled workers. 
workers, however, stood lowest on this dis- 
criminant function and were relatively distant 
from the other occupational groups. 


Biographical Factors 


The relationship between biographical fac- 
tors and the subscales of the SWV has also 
been examined. Wijting (1969) studied the 
relationship of background variates and work 
values among unemployed persons (VN = 133). 
A canonical regression analysis revealed a 
relationship (Re 
criterion compos! 
(- 71),* Job Involvement (— .58), and Social 
Status of Job (.43), 

< (— 51), race 
urban versus 
= 298), parents’ education (.31), and amount 
ice trouble (.38). The criterion composite 
reflected an emphasis on the social rewards of 
put a devaluation of activity 
and involvement on the job. The corresponding 
predictor composite rural, 
poor male whose family 
trouble and who has had less education than 
his parents. 

Goodale (1969a) performed asimilar analysis 
put with different predictors and civil service 
(N= 342). This study analyzed 


employees 
determined by both methods. For 


subscales 

4 All loadings used to interpret the vectors are cor- 
relations of the variable with its composite rather than 
canonical regression weights. 
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the allocated subscales, Attitude toward 
Earnings (.75), Activity Preference (— .56), 
and Social Status of Job (— .43), were related 
(Re = .51, p < .001) to race (— .65), occupa- 
tional level (— .60), area of the country from 
which the person came (—_55), and super- 
visory level (.42). The combination of predic- 
tors, describing a negro whose parents came 
from the South and who held a low-level, 
nonsupervisory job, was associated with high 
attitude toward earnings, low preference for 
activity at work, and low social status of being 
employed. In the analysis of factored subscales, 
Attitude toward Earnings (.73), Organization- 
Man Ethic (— .54), and Conventional Ethic 
(— .48) were related (R, = .50, p < 001) to 
race (— .65), area of the country from which 
the person came (— .48), occupational level 
(— .37), marital status (— .29), and super- 
visory level (.35). This pair of composites was 
characteristic of a married Negro; raised in the 
South; occupying a low-level, nonsupervisory 
job; who placed great value on earnings but 
did not subscribe to the Organization-Man 
Ethic or the Conventional Ethic. 


SUMMARY 


The SWV has met some of the common 
criteria (Nunnally, 1967) for construct validity. 
First, the success of the reallocation procedure 
demonstrated that the six w 
discriminably different from o 
that the items well represent 
that they were intended to measure. Second, 
the internal consistencies of the subscales are 
relatively high in view of the small number of 
items comprising each subscale. Third, SWV 
Scores have discriminated me 
occupational groups and have correlated sub- 
Stantially with background variables that 
have been associated with other measures of 
work values. The SWY is, in its present format, 


ork values are 
ne another and 
the constructs 


aningfully among 
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considered to be a useful research tool. W a 
i rove a scale 

is currently under way to impr ove the sca! 

and to investigate new scoring techniques. 
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F CONTROL AS 
INTERACTIVE INFLUENCES ON SATISFACTION 


ABRAHAM K. KORMAN?* 
New York University 


ity is negatively correlated to 


This study hypothesized that envir 
zon only for those with low control over their behavior. The hypothesis was 
studies, with a major finding being the unimportance of 
i explanation js offered in that itis proposed that situ- 
ations of high ambiguity with their implied high reliance on people rather than rules 
and regulations are not dissatisfying if they are in balance and congruent with one’s 


perceptions and beliefs about 


onmental ambigu 


employed individual, the satisfactions of the 
al correlates of satisfaction university student, or similar situations. 

found. The first of Unfortunately, while such a solution seems 
am- to be theoretically adequate, it is, increasingly, 
is negatively correlated with satisfac- in our society becoming extremely difficult to 
. Kahn, Wolf, achieve- The reason for this is that ambiguity, 
That is, to change, and the like are becoming an inevitable 
not provide part of organizational life. Thus, we live in a 


hat- dynamic society, where values, norms, CON- 
constantly 


ast two Major threads in the 


Snoel, & Rosenthal, 1964). 


es to behavior and that W 3 
constantly sumer tastes, and the like, are 


s that are provided are (a) 4 
i changing and constantly ambiguous and where 


ever guide 
changing; and (b) may, on occasion, be contra- gal u r 
dictory; then, to that extent, It 1S often found the administrator himself must keep introduc- 
that dissatisfaction is the result. Second, there ing changes into the organization itself if it 
is, perhaps, an even more common finding that (i.e. the organization) is to remain viable and 
satisfaction is positively orrelated with the healthy m this dynamic world. Hence, the 
extent to which that environment provides attainment of role-prescription clarity and 
a self-controlled, lack of ambiguity, while desirable in order to 


a satisfied organizational membership, is 
n environment 1S not one that is easily attainable in practice. 

s self-esteem The question, therefore, arises as to how can 
ion maximize satisfactions among 
its members? In this article we will test the 


with that environment hypothesis that the significance of environ- 
& McCormick, 65). mental ambiguity for satisfaction varies as a 
from the viewpoint of function of the degree to which the environ- 
i i- ment provides @ sense of high self-esteem to the 


ay in which individuals in the environment. To the extent 
ould predict 


anner. In other gain 


t to which @ 


| a] administrator that the W i 1 

gow aa with th ‘organizational environ- that the latter 1$ the case, we W 

pams.: e ized is relatively straight- that people in high self-control environments 
be seen that the would develop a more optimistic opinion of 


fly, it can easily be t ve a 
A imizi the capabilities of both oneself and of different 


forward Ji e 

inimizing of ambiguity and the maiar f 3 3 

B \f-di a tedness should result in the utmost others. Such an increase m the perceived com- 

f faction whether the satisfactions We are petency of oneself and of different others would 

s g ab it are the job satisfaction of the tend to lead, the present author believes, tO (a 

t a greater confidence in one’s abilities to handle 
1 Requests for repay act ior aot to A 3 the change situation where rules for behavior 

z D partment © sychology, * Or = AA j e ali 2 eful; 2 

Korman, Depa ington peyew York, New yom, ViN become less salient and less useful; and (b) 

a a greater tolerance and acceptance of the posst- 
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ble changes in human relationships implicit = 
the abstract concepts of difference, eon a 
and potential difference (or chaga in PETS 
Hence, the growth of such potential- z so : 
situations as change and role ambigo Le. 
when nothing is clear and new guide a! = 
being developed) would not be as likely to lea 
to uneasiness and dissatisfaction as if the per- 
son had been in a low self-control (or high 
external control) situation, where his learning 
about the characteristics of himself and others 
would more likely have been of a negative 
po specific prediction, then, is that the 
relationship between environmental ambiguity 
and satisfaction should be lower, or not exist 
at all, for people in high self-control environ- 
ments as opposed to those situations marked 
by low self-control. Should this hypothesis be 
supported, it would be legitimate to infer that 
the changing, dynamic characteristics of 
modern society and the constantly increasing 
administrator-induced changes do not neces- 
sarily have to be causes of low satisfaction. 
Rather, such effects could be modifiable 
by first developing the high self-control 
environment. 


METHOD AND PROCEDURE 


Instrument. A Likert-type questionnaire was de- 
veloped that was designed to measure college students’ 
perceptions of their university environments as to the 
degree to which they saw it as being (a) ambiguous and 
conflicting in terms of behavioral and attitudinal pre- 
scriptions, and (b) ego-enhancing in terms of the degree 
of self-control of behavior allowed and level of com- 
petency expected. Examples of the types of statements 
that concerned themselves with perceptions of am- 
biguity were “the professors at this school are generally 
pretty similar to one another in basic values”; “the 
information you need to survive around here changes 
from week to week”; and “frequently, students around 


here are expected to achieve objectives and goals which 
conflict with one another and are mutually contra- 


dictory.” In terms of the degree of self-control expected, 
samples of the types of statements used are “the general 
policy here seems to be that you should take as much, if 
not more, responsibility in planning your work as the 
faculty”; “the administration here tends to adopt the 
approach that it is only their opinions which count and 
not yours”; and “the professors at this college believe 
that they do not have to continually pressure you and 
watch you in order to insure that you keep your mind 
on your studies.” There were a total of 19 conflict and 
10 self-control items, all of them answerable by an 
agree-disagree format, interspersed with 14 buffer items 
and 4 general overall “satisfaction with college or uni- 
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versity” items. These last items were highly cn 
with one another (average item inter-r for a EE Be the bY 
50 was .72), and hence were summed to ser : 
measure of satisfaction in this study. ire scale are 
While test-retest reliabilities for the entire aa 
not known, the intrascale reliabilities for cenveniee 
seem to be sufficiently high. In addition iy : parts area, 
item inter-r mentioned above for the satstact oy “prea 
odd-even reliabilities for the conflict and con! i by the 
were .84 and .81, respectively, when megane (eh reli- 
Spearman-Brown formula. In addition = construct 
abilities, some evidence supporting sue during the 
validity of the scales did become available i ° 
course of the study, as will be indicated = “akan a 
Subjects. Three separate studies were und er atropoli- 
three different universities in the New York Pe ese Was 
tan area. However, since the design of each of ie proce- 
the same, varying only in sample, the research Pi two 
dure can best be thought of as an original suasa was 
separate replications. The first university iia t of the 
a large private institution located m the Heat put lo- 
city, the second was also a private institution AE 
cated in a near suburb, while the third was @ rura 
institution located in a far suburb distinctly more first 
in character. In addition to these differences, + that 
and third schools also differed from the second hatte 
they had been the scene of considerable strife and rela 
ing norms and values, whereas the second had Eei ihe 
tively placid, or at least placid enough not to 
newspapers. s T 
The Gale sizes were 202 for Study 1, 90 for a j 
and 91 for Study 3. All questionnaires were distri R 
toward the close of the 1968-1969 acudemie S 
with the samples in all cases consisting of nology 
graduates recruited from dormitories, psyc ee no 
classes, and various other sources. Thus, while pa a 
case could the sample be considered a random © some 
wide variety of sources were utilized, suggesting $ 
representativeness, p zits 
Analysis. For each study, attempted median a or 
on both variables were developed and then ¢ ant 
differences in satisfaction were computed for hig the 
low ambiguity at each level of self-control. While vere 
splits were reasonably good for Studies 2 or 3, they vine 
not as good for Study 1. However, reasonably ee 
cells were obtained there also. In addition ices 
checks were made to be certain that there were n° ach 
nificant differences in ambiguity at each level for one 
level of self-control (i.e, high ambiguity was at ach 
same level for both levels of self-control, etc.)- In reat 
case, there were no significant differences, thus 1" en 
ing sufficient commonality of ambiguity at each 
for different levels of self-control. 


ResuLTs PR 
_ The results of the research are T 
in Table 1, indicating support for the hypo” 
ses only in Study 2, istent 
In the other two studies the most ey 
result was the lack of importance aerate 
ambiguity as a determinant of satisfa af 
More specifically, in the case of the high 
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control environments, where we did not predict 
any satisfaction differences as a function of 
environmental ambiguity, this prediction was 
supported in all three instances. On the other 
hand, the findings for the low self-control 
environment, where variations in satisfaction 
as a function of environmental ambiguity were 
predicted, were not supportive, except for 
Study 2. In the other two studies, ambiguity 
was no more related to satisfaction for the low 
groups than for the high. Briefly, ambiguity 
seems to be far less important as a determinant 
of satisfaction than are the control aspects of 


the environment. 


DISCUSSION 


Looking only at Study 2 for a moment, the 
results can generally be interpreted as sup- 
portive of the hypothesis which guided this re- 
search. This is a finding of considerable signifi- 
cance for the administrator since it is possible 
for him to control the self-control aspects of 
the organization but increasingly not the 
ambiguity, both within and without. Thus, the 
finding that self-control is more important than 
ambiguity means that the administrator need 
not be loath to introduce needed changes into 
the organization because of the need to main- 
tain harmony, etc. Similarly, there is no need 
to demand rigid role prescriptions, as de- 
manded by traditional organization theory, 
because of their stabilizing nature. The key, 
apparently, seems to be to Increase self-confi- 
dence since, by so doing, we can increase satis- 
faction overall and, at the same time, reduce 
fear of the unknown and the change. By so 
doing, changes in organization may be intro- 
duced and the result will not be dissatisfaction 
with its frequent concomitants of turnover and 
withdrawal. a}. ee 

Perhaps an equally significant implication 1s 
raised by the lack of significance overall in 
Studies 1 and 3 for ambiguity as a cause of dis- 
satisfaction in general. This, too, suggests to 
the administrator that he need not be afraid to 
either accept or introduce rule and regulation 
relaxation. We can only conjecture here as to 
possible reasons, put one possibility may be due 
to the activation level of the individual in- 
volved (Fiske & Maddi, 1961). Thus, the 
campuses of both the Study-1 and Study-3 
groups had been involved in ferment for so long 


TABLE 1 


MEAN SCORES ON SATISFACTION As A FUNCTION OF 
ENVIRONMENTAL AMBIGUITY AND SELF-CONTROL 


Group Z |SD| 2N 
Study 1 
High ambiguity & high 
self-control* 10.45 | 2.6 | .79 
Low ambiguity & high 
self-control? 10.91 | 2.4 
High ambiguity & low 
self-control® 9.16 | 2.8 | .01 
Low ambiguity & low y 
self-control 9,20 | 3.0 » 
Study 2 7 
High ambiguity & high S 
self-control® 9.84| 2.3) 1.35 
Low ambiguity & high fe 
self-control! 10.64 | 2.2 š 
High ambiguity & low i 
self-controlé 8.19 | 2.2 | 2.55% pi 
Low ambiguity & low 4 
self-control 9,90 | 2.0 Re 
Study 3 Bi 
High ambiguity & high 4 
self-control! 11:70 | 2.3 | .73 © 
Low ambiguity & high > 
self-control! 10.98 | 2.4 ý 
High ambiguity & low 
self-control! 8.94 | 2.7 
Low ambiguity & low 
self-control* 9.17 | 2.1 


an = 64. en = 26. 


by = 58. hn = 16. 
on =59. tn = 27. 
dn 230. Jn = 18. 
en = 17. kn = 28, 
ty = 31. *p = 01. 


and changes had become so commonplace that a 
perhaps they had become accepted as part of ; 
the landscape, even by the low self-control | 
people. Hence, there was no sense getting very — 
excited or unhappy about such ambiguity and — 
lack of clarity. The implications for the admin- a 
istration here, provided our explanation is al 
valid one, are very similar to the implications 
we drew from the Study 2 findings, in that in — 
both cases, change has become increasingly 
accepted and people are not afraid of it, 
whether such acceptance comes from lack of 
fear of difference or from generalized adapta- 
tion to a normal state of affairs (such normality | 
being increasingly dynamic). 

As support for this interpretation of our 
findings, we may consider some evidence sup- 
porting the construct validity of our scale. We 
have said that the schools involved in Studies 


f 1 and 3 had been involved in considerable 
change and unrest, whereas the school in: n y 
2 had not and, in fact, have built our tapana 
tory hypotheses around such differences in 
order to explain our results. Hence, it 1s she 
portive of the construct validity of our scale 
and our line of reasoning that the students ine 
volved in Studies 1 and 3 had signifcantly 
higher overall scores on perceptions of env a4 
mental ambiguity than those from Study 2. 
(The means and standard deviations for 
Studies 1, 2, and 3, respectively, were 67.8 and 
9.8, 53.2 and 7.9, and 65.5 and 8.6. Studies 1 
and 3 are significantly different from 2, but not 
from each other.) i eae, 

There is, however, a question which might be 
raised regarding our argument. Essentially, we 
are suggesting a homeostatic Interpretation of 
our findings in that we are proposing that situa- 
tions of high ambiguity with its implied high 
reliance on people rather than specified rules 
and regulations are not dissatisfying if they 
are in balance and congruent with one’s per- 
ceptions and beliefs of the world. Thus, such 
perceptions and beliefs might be composed of 
perceptions of the typical nature of things 
around one (i.e., Studies 1 and 3), or one’s 
beliefs concerning the nature of people (i.e., 
Study 2), with the strength of cach effect 
perhaps varying according to the specific situ- 
ation. Thus, in the cases we have studied here, 
the degree of variability in the environment in 
general in Studies 1 and 3 was strong enough 
so that tolerance for ambiguity developed for 
the general sample overall and it was not 
necessary to change the self-control aspects of 
the environment in order to increase tolerance 
for ambiguity. (Although increasing self-con- 
trol would have increased satisfaction as a 
general effect in all universities, this was not 
a matter under specific investigation here.) On 
the other hand, in the Study 2 situation, where 
the environment was in general more stable, 
increasing self-control would seem to be neces- 
sary to increase tolerance and acceptance of 
ambiguity. 

Given this interpretation, the 
tion, then, is if people develop 
ambiguity in either of these t 


major ques- 
acceptance of 
wo ways, why 
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don’t they actually like them even more than 
the stable stituations, as a strict homeostatic 
prediction might imply, rather than there being 
no differences? One answer to this is that suc 
acceptance and satisfaction is probably a = 
velopmental process over time, with the ee 
of preference for ambiguity as opposed to $ z 
bility being a function of the amount of ne 
and experience one has had in self-control an 
variability situations. The longer one has been 
in such situations and the more one has learned 
that this is the normal state of affairs, the more 
we would predict such differential preference 
for variability as opposed to stability. Hoy f 
ever, in this case, we were not dealing Y 
substantive individual differences in belie 
Systems concerning the nature of the world 
which are solidly based on one’s life experi- 
ences. Rather, we were dealing with contem- 
porary environmental influences, influences 
which in many cases might have been dia- 
metrically in opposition to a person’s character- 
istic belief structure. Hence, it would seem 8 
be legitimate to consider the results found here 
to be reasonably supportive of our interpreta- 
tion. , 

To summarize, then, the findings of this anes 
search would seem to point to the conclusion 
that justification of rules and regulations as 
being more Satisfying, due to their fulfilling 
needs for order and predictability, may be 
overstated. Apparently, ambiguity and dime 
are not dissatisfying if the individual’s belie 
system is first altered so that he sees such char- 
acteristics as being consistent with the nature 
of the world as he knows it. Finally, such cogni- 
tive change may be introduced by the adminis- 
trator, but it may also occur as a function of 
factors beyond his control. 
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ergraduate Ss worked on structured and unstructured tasks under 


Sixty male und 

stressful or nonstressful conditions in groups of four. A confederate of E, who was 
one of the members of each four-person group, adopted either a democratic oF 
authoritarian leadership style. Measures of performance, time to completion, an 
satisfaction were used in separate analyses employing & Lindquist Type VI analysis 
of variance. The Ss under stress performed best under authoritarian leadership, 
whereas, those not under stress performed best under democratic leadership. The 
Ss under authoritarian leadership conditions took significantly longer to complete 
their tasks. Tn addition, Ss performed better, worked faster, and most enjoyed the 


structured task. 


of Lewin, dependence on external parental help rather 
tudies have than rely on his own ego resources. Hamblin 
if- (1958) provides some support for this theory 
in that his experimental Ss who were subjecte 

in a review of research in th to a crisis situation in the laboratory were more 
that: “The evidence available fails to demon- subject to influence than controls. Mulder 
strate either authoritarian or democratic leader- (1963) found similar results in a study involv- 

A igher ing independent grocers approximately hal 


ship is consistently associated with 
most situations, however, of whom were told that a supermarket woul 


productivity. In ‘4 
leadership is associated w be opening near them. The: grocers in the high- 
threat condition apparently showed a tendency 


Since the pioneering research 
Lippitt, and White (1939) many s 
further investigated the consequences of 


ferent styles of Jeadership Anderson (1963), 


democratic les r= i higher 
morale LP- 1607.” Bass 1960) derive a set 0 r 
sag : to seek strong leadership- Polis (1964) studied 


iti e nee o directive 

ropositions about the need for more direc i ) 

een reater accept- Australian grade school children and foun 
types of leader- 


leadership under stress @ r 1 
ability of structure under these circumstances. that Ss who chose democratic p 

Earlier, Fleishman, arris, and Burtt (1955) ship during peace time chose omnipotent forms 

had found that as th s for pro- of leadership during an imaginary Crisis 
duction became more demanding, the rated situation. n 

partments These results can also be explained by Mc- 

Jated with leadership Ginnies (1970), who adopts & functional ana- 

ch and advocates the view that 


tion and higher jn structure. lytic approa 
demanding time “a crisis establishes @ deprivation state for a 


the reverse pattern tended to hold. group, 5° that the consequences of effective 
based on psychoanalytic tenets leadership will be more reinforcing than other- 
vested by Dev x (1955) and Alexander wise. The circumstances are roughly analogous 
sug “ie that when an S is confronted to those of depriving an experimental anima 
he would return to of food so that food becomes @ more effective 


schedules, 
A theory 


to James Brewer, peas Korten (1962) developed a model concerned 
aan a yker Wig with some of the situational factors that deter- 
in partial fulfillment mine group preferences for authoritarian OT 
nts of erst of rimena aieo, democratic leadership. He suggests that author- 

i %”  itarian leaders maintain their power in highly 
stressful, highly structured goal situations, 
whereas, democratic leadership is sought under 
less stressful, poorly structured situations. 


s a 
ing of the manuscript. 
reprints should be sent to Leonard L 


partment of Psychology, Montgomery 


College, oc 
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Fiedler, Meuwese, and Oonk (1961) and 
Fiedler (1962) studied leadership behavior n a 
variety of groups working on ara a =< 
Under a pleasant group atmosphere ne = 
found a positive correlation between $ e 
leader’s esteem for Least Preferred Coworker 
(LPC) score and productivity. Under un- 
pleasant group atmosphere, or when groups 
were heterogeneous and stress resulted, the 
relationship between LPC score and produc- 
tivity was negative. Fiedler (1964) developed 
a contingency model that predicts that the 
effectiveness of leadership style depends on 
the extent to which the group’s task and 
organization facilitates or inhibits the leader’s 
ability to exert influence. A low-LPC leader 
would function best in a negative group atmo- 
sphere and on unstructured tasks. He also 
notes that a low-LPC leader would be effective 
if the task were especially structured. In the 
intermediate situation, however, the high-LPC 
leader would be most effective. In a study of 
officers and recruits at a Belgian naval training 
center Fiedler (1966) provides some confirma- 
tion of the curvilinear relations predicted by 
the theory. 

Consequently, it was hypothesized that, 
under stress, groups under authoritarian leader- 
ship would perform better and be more 
satisfied, whereas, democratic leadership groups 
would perform better and be more satisfied 
under the nonstress condition. It was further 
anticipated that groups under authoritarian 
leadership would be more satisfied and perform 
better on the structured task 
cratic groups would show a preference (in 


satisfaction and performance) for the moder- 
ately unstructured tasks, 


, Whereas, demo- 


Subjects 


Sixty male undergraduate introductory psychology 
students at the University of Maryland served as Ss 
for this study. The Psychology Department required 
that all students in the introductory course earn Te- 
search credits by participating as Ss in research projects. 
The Ss who volunteered to participate in this study 
were randomly assigned to 20 experimental groups. 
Each group consisted of three naïve Ss plus a con- 
federate of £. 


Procedure 


Under either stressful or nonstressf 


ul conditions each 
group of Ss solved two structured or 


mechanical tasks 
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and two unstructured or problem-solving tasks una 
both autocratic and democratic leadership en <a 
The independent variables in this study there ae e. 
type of leadership, amount of stress, and ty se per- 
The dependent variables examined were: FERR 
formance, group satisfaction, and time to so! T E 
Leadership manipulation. A confederate of A cois 
pated with each group of three naïve Ss. anad 
federate was an undergraduate enrolled in a iy 
experimental social psychology course at the ae tie 
of Maryland. He arrived at the laboratory: A He 
scheduled hour, as did each of the three naiv E, 
represented himself as a student in the a ai 
psychology course and maintained this decenii epi 
all data for this study were collected. The con eH et 
was trained in the leadership behaviors appropri and 
each of the leadership styles identified by mie E 
Lippitt (1938). Autocratic leaders attempte! e 
determine the group’s policy, (b) go only one s “| ee 
time when giving directions so that the group i 2; ai 
certain as to what to do next, (c) structure n S 
group member would do and with whom, (d) pa (e) 
and praise individual members of the group, & PEE 
remain aloof from the group. On the other hand, d de- 
cratic leaders attempted to: (a) have the gron P ade 
termine their work policies, (b) explain and ene 
perspective of work procedures, (c) allow group pete 
bers to choose between alternative procedures, (d) mee} 
cize and praise the group as a whole, and (e) beco 
group member in spirit. , PAE S 
It is interesting to note that ratings of liking | iaren 
confederate obtained on a questionnaire adminis py 
after each task was completed did not differ ae te 
cantly (£= .73) from authoritarian and demper t 
condition: is questionnaire also asked each i ie 
to rate his satisfaction with his performance an 
group's performance on the just-completed task. truc- 
Task manipulation. Two unstructured and two § nee 
tured tasks were utilized in this research, The pa 
in which these tasks were performed under the ditte ae 
leadership conditions was counterbalanced to minin 
order effects, Aal 
One structured task required S to place in mO ias 
order a deck of 45 randomly arranged cards; to & a 
betize 45 cards, each with a nonsense syllable on it; sa 
then to paper clip one number card and one nonse 


syllable card to each other, Finally, S was to sort the 


resulting stack of cards into odd- and even-numberri 
piles. The second Structured task required S to ae 
six sketches of each of six stimulus objects (¢-8+ # ‘the 
to crayon these with Prescribed colors, and to a per 
36 objects out and paste them on six sheets of Pal 


4 t each 
such that six different objects appeared on 
composite. jassify 
e ass!) 
An unstructured task involved asking Ss to cl 


ela- 
18 Sentences from Bales’ (1952) Standard Human m 
tons Problem, according to Bales’ 12 categorie 


. Z structure! 
interaction process analysis. The other unstruc 


z logisms 
task required S to determine whether 18 syllog 
were true or false, 


A maximum of 10 min. 
of each of these tasks. 1° 
might work on each tas! 
informed that the experi 


ation 
was allowed for ae a 
urther, Ss were tol They were 


k in any sequence. concerned 


meee 
iment was primarily 


GROUP 


with the productivit y of the group, but cach S was en- 


couraged to work quickly and accurately- 
Stress manipulation. At the outset of the experimental 


session, a group Was exposed to either the high- or low- 
stress manipulation. T he Ss in the high-stress condition 


were told that the tasks they would be asked to com- 
plete were from an established test and that each 


group’s performance would be compared to performance 
of other groups at the college. ‘They were also told that 
on all materials 


S should indicate his name and class 
so that a permanent record of his 
and a copy could be provided y 
instructor. 
The Ss in the low-stress condition were told that 
E just designed the tasks and hoped they were in- 
teresting. These Ss were also told that individual re- 
sponses were not important to E, only group produc- 
tivity was. Consequently, it was not necessary tOr 
to sign or identify himself on any of the various answer 
sheets. 
As an additional check on whether 
s manipulations used in 
employed with 42 undergraduate psy- 
York Institute of Tech- 
in groups of three. 
high-stress 
ater stress 


chology 

nology. The students worked 
Twenty-one students were exposed to the 
manipulation. They reported significantly gre 
(t = 2.26; P < .05) than did 
who were exposed 
Stress was measured by responses 
naire consisting 
tension, anxiety, 
perienced becaus' 
the extent to which they agree 


item, using seven st 
\ RESULTS 
erformance, satisfaction, and 


re obtained for all expert- 
each of 


Measures of p 


time to solution We 
group responses for 


mental SS- 0 
the dependen were obtained from 
rime for each 


means were 
in each of the following 
953) Type VI 
a three-way 
two of the 


from two groups of 
f leadership 2M 
y represent repeated 
measures p. the: Same Ss. Consequently, a 

a pe VI analysis of variance was 
measures 


LEADERSHIP STYLE, STRESS, 


AND TYPE OF TASK vs 


TABLE 1 


ANALYSES OF VARIANCE OF Mean GROUP 
PERFORMANCE Scores, TIME TO TASK 
COMPLETION, AND SATISFACTION 

R 


ATINGS 
Performance Time to task Satisfaction 
score completion rating 
Source dj \ 
MS) F 
19 | .01 
11.00] -15 55 
18 | 01 
Within-Ss 60 | .02 
Task (A) 1| .64 67.06" 6.69* 
1|.00| -35 „31 
1.01] 1 146 
1| 06) 443 OL 
1|.o1| 29 09 
1| 00) -t3 28 
54 | .01 
18 | .01 
18 | .01 
Errors (w. 18 | .02 
Total 79 | .02 
*p <05 
++p < -001 


of group performance, group satisfaction, and 


time to solution.” 


Performance Index 


The measure of performance used in this 
the ratio of the number of correct 
ade by Ss to the total number of 
ible. The possible range 
1.00. The higher 


the index, the performance. 
Table 1 presents the results of the analysis of 


variance which was performed on these data. 


"The main effect for type 
001 level. Examination of the means 


2 reveals that the overall 
ce on the unstructured task 
mean performance On 
ed task was .90. Generally, Ss per- 


the structur 
ter on the structured 


formed substantially bet 
task. 

In addition, 
a significant 


this analysis of variance reveals 
(p< 05) interaction between 
leadership style and stress. The cell means 
depicting this interaction are also presented 
in Table 2. Under stress, groups with author- 
itarian Jeadership had a mean performance of 
„83, whereas, democratic leadership groups had 
a mean performance score of .80. This relation- 
eee 


k 3'The computer time for this project was supported 
in full through the facilities of the Computer Science 
Center of the University of Maryland. 
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TABLE 2 


MEAN GROUP PERFORMANCE Scores, Time TO Comp. 


LETION, AND SATISFACTION RATINGS 


Stress Nonstress Combined stress 

i re 

Authori- | Demo- | Com- | Authori-| Demo- | Com- Authori- | Demo- oo 

toss $ tarian cratic bined tarian cratic bined tarian cratic elere 

leader- | leader- | leader- | leader- leader- | leader- leader- leader- stip 

ship ship ship ship ship ship ship ship 

Group X performance index 

= | a we l m 74 .72 

Jnstructured 72 72 72 -68 -76 Red 70 a ‘00 

sy, 93 88 .91 .85 .92 .89 .89 -90 81 

Combined 83 80 81 76 84 80 .80 82 d 

Group X time to completion 

5 5 

pote | Ber.) owa | ows | ons | ga SM | Ste | oos j 988 

Structured 9.31 8.85 9.08 8.63 8.38 8.50 8.97 8.61 9.09 

Combined 9.49 9.24 9.36 9.21 8.42 8.82 9.39 8.79 i 

Group X satisfaction score 

awe 5 

Unstructured 15.80 16.43 16.12 15.30 15.07 15.18 15.55 15.75 fe 

Structured 17.33 17.63 17.48 16.83 L277 17.30 17.08 17.70 652 

Combined 16.57 | 17.03 | 1680 | i607 16.42 | 16.24 | 1632 | 16.73 | 16.52 


ups not exposed to 
For these groups, 
ted in a mean per- 
» autocratic leader- 


mean performance 
d, Ss in the 


pletion data, 
1, resulted in Statistically 


eadership style and type 


Examination of T, 
the average, were a 
tured task significan 


able 2 reveals that Ss, on 
ble to Complete the struc- 
Uy faster (8.83 min.) than 


the unstructured task (9.35 min.). The A 
performed better and faster on the structur 
task; it was a arently easier. i 
Furthermore. ni democratic leadershiP 
conditions Ss needed an average of 8.79 koe 
to complete each task, whereas, under aut ime 
itarian leadership the significantly longer S 
of 9.39 min. was necessary to complete ce 
task. In the previous analysis, pE 
under democratic conditions was sligh 
better than under authoritarian conditions, 
however, this effect was not significant 3 
appears then that the authoritarian B oe 
did substantially increase the amount of in 
needed to Complete the task but did not in 


rk 
fere dramatically with the quality of wo 
performed, 


Satisfaction 
Three 7-point Likert-type rating aero 
signed to measure S satisfaction with per inis- 
“nce on the experimental tasks were “or 
tered after each task was completed. 1 e 0 
Satisfaction scores for each S for each ean 
task were Computed. In turn, these ! 


Group LEADERSHIP STYLE, Srress, AND Tyre or Task 


S-satisfaction scores were used to calculate 
group mean satisfaction scores to each type 
of task under each of the leadership conditions. 
The possible range of these group mean satis- 
faction scores was from 3 to 21. The larger the 
satisfaction score obtained, the greater the 
satisfaction. These scores represent the data 
submitted to the Lindquist Type VI analysis 
of variance. The results of that analysis also 
appear in Table 1. Surprisingly, virtually no 
difference in satisfaction under the different 
leadership styles is evident. The main effect for 
type of task was significant at the .05 level. 
The mean S satisfaction with the structured 
task was 17.39, whereas, for the unstructured 
task it was 15.65—significantly less. Apparently 
then, Ss performed better, worked faster, and 
most enjoyed the structured task. 


DrscussION 


Although there have been conflicting reports 
in the literature about the relative merits of 
the different leadership styles on productivity, 
our results support the various approaches that 
argue that leadership effectiveness is situa- 
tionally determined. Under stress, the present 
authors found greater productivity for auto- 
cratically led groups; whereas, in the absence 
of stress, democratic leadership was more 
productive. : 

In addition, the authors hypothesized that 
satisfaction ratings should parallel the pro- 
ductivity ratings obtained. This hypothesis was 
not confirmed. Previous evidence has, as 


Anderson (1963) indicates, consistently demon- 


strated an association of higher morale with 
our study 


democratic leadership. The Ss in 
would normally be expected to prefer demo- 
cratic leadership. Probably significant m this 
regard is the especially poor performance ob- 
tained from those Ss who were autocratically 
led but were not in the stress condition. 
McGinnies (1970) argues that effective leader- 
ship during crises is reinforcing. Perhaps the 
greater satisfaction generated for autocratically 
led groups under stress, as hypothesized by 
in combination with the greater 


McGinnies, 1 i 
satisfaction of democratically led groups, re- 
gardless of which stress condition, combined to 


eliminate the anticipated differences. 
Real differences existed between the types 
of tasks employed in this study. The task 


S41 


variable, however, did not interact with leader- 
ship style or stress. The tasks identified as un- 
structured probably were not perceived as such 
by Ss. To college students, these tasks may 
have represented an intellectual sorting task 
not sufficiently dissimilar to the structured 
mechanical sorting tasks employed here. 

Shaw (1962) outlined four aspects for classi- 
fying group tasks, namely: (a) decision veri- 
fiability, (b) goal clarity, (c) goal path multi- 
plicity, and (d) solution specificity. When our 
tasks are evaluated along these dimensions, 
it appears that all tasks, in fact, were quite 
structured and differed primarily in the extent 
to which they relied on manual or verbal _ 
solutions. The better performance and shorter 
work time found for the structured or me- 
chanical tasks reflected intrinsic differences 
between the two sets of tasks. The greater 
satisfaction obtained to the structured task 
is probably a reflection of the generalization 
of reinforcement effects resulting from doing 
better on the structured tasks. The finding of 
better performance under autocratic leader- 
ship across all tasks partially confirms our 
initial hypothesis if it is agreed that both sets 
of tasks are rather structured. 
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College of Business Administration, University of Iowa 
. Interrelationships of favorable self-perception as measured by piographical data, 
r perceived supervisory style of the boss, and job satisfaction of 128 administrators 
È and professional employees of a state department of public instruction were com- 
pared using & two-factor analysis of variance design- The Ss with a igh level © 


els of job satisfaction, although the effect was 


moderated by favorable self-perception: Results are discusse! 

relationship between expectations, experience, and attitudes. 

_ There is evidence that within the work situa- plore the relationship of the individual’s back- 

tion, the supervisory style of the boss plays an ground of experience and expectations to his 
i erception of supervisory support an job 


important role in providing 
for the individual to experience satisfaction of satisfaction. In this regard, it was initially 
personal needs. Likert (1961, 1967), for assumed that an individual’s background of 
example, describes & supervisory y experience and appraisal of abilities wow 
gement, combine to form a measure of his self-percep- 
reportedly leads to higher Jevels ation of the types © 
faction and performance t ther, COn- treatment the individual would expect 10 the 
trasting styles- This style is g i guperior-subordinate relationship. Specifically, 
by the “princi th high self- 
ithin an organiza- perception would experience Jess job satis- 
tion all relationships and interactions must be faction from & given supervisory style than 
indivi “1) view these EX- those with low self-perception. 


il 
th). METHOD 
1. The Ss were administra- 
tors and srofessional employees of a state department 
jon, Survey forms consisting of a cover 
ymity, general information 
stionnaires were com- 


f the individual as deter- Subjects and data collection 


ence form an integral Pat r 
“Tf experience alls short of expectations un- questions, and two separate que 
favorable attitudes occur. Where our experi- pleted by 121 Ss during 2 bimonthly meeting of the 
ence 1s better than our expectations we ten department. ‘Additional survey forms and self-addressed 
to have favorable attitudes [Likert, 1961, PP- envelopes were left at the department with the under- 
12 Ry standing that the assistant superintendents would dis- 
3 á Je si ilar to tribute the forms to those absent from the meeting. 
ory style simi In this manner an additional seven surveys were com- 
y Likert (1961) was found by pleted for a net al of 128 Ss. Of these, 21 had been with 
evels of job the department less than 1 yr., 62 from 1 to 5 yr, and 
5 yr. All Ss except eight were male. 


ee | ec- the remaining 45 over 5y 

.ceaction among managers OF < arge © a $ : S 

m A $ did not however jn- Questionnaires. Job satisfaction and supervisory style 
: ne of the boss were measured with a modification © the 


tronics Comp ny ME aiff 
al di erences in Management Attitude Questionnaire developed by 


stations have OF this relationship. The Myers (1966). The revised form, entitled Administra- 
y was designed to replicate Myers tive Attitude Questionnaire (AAQ), Was constructe 
with only minor word changes from the Management 


ina jssimilar organization and to ex- 4 £ } p 

Attitude Questionnaire. The resulting Q6-item ¢ 
consisted of 10 multiple-choice, 62 rating-tyP® 7 
24 forced-choice items. Principal component analysis 


ould be sent to Duane E. 
and factor matching by means O the salient variable 
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Thompso”, sollege of Business Administration, Uni- 
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similarity index (Cattell & Baggaley, 1960) indicated 
that the factor structures of the two forms were 
sufficiently similar to justify the use of subscales 
identified by Myers. y 
e differences information regarding Ss’ ex- 
pectations, backgrounds, and experiences was EEEF 
by means of a specially constructed group of 41 mul- 
tiple-choice items entitled Biographical Information 
Blank (BIB). Items were selected from those pre- 
viously used by Morrison (1962), Gilmer (1963), and 
Thomson (1964). The criteria for inclusion of items 
were: (a) items should load on Morrison’s scale of 
Favorable Self-Perception, or on the Corresponding 
scale identified by Gilmer; and (b) other items from the 
three studies should be used only if they appeared 
to bear a relationship to the problem at hand. 
Preliminary component analysis of the BIB data 
yielded three factors, one of which was found by the 
salient variable similarity index to be statistically re- 
lated to Morrison’s Favorable Self-Perception scale. The 
other two factors could not be given a meaningful in- 
terpretation (Thompson, 1969). 

Specific AAQ and BIB subscales used in this re- 
search were: 


AAQ 


Job Satisfaction: Items with high loadings on this 
subscale (previously labeled Motivation on the Job by 
Myers) dealt with feelings of utilizing potential, being 
highly stimulated, the opportunity for growth, self- 
esteem, and achievement on the job. 

Supervisory Style of the Boss: This subscale was 
comprised of items that “describe the boss? style in 
terms of his ability to stimulate enthusiasm, the level 
and consistency of his expectations, his recognition of 
performance, his accessibility and willingness to listen 
to new ideas, his practices related to dispensing com- 
pany information, his attitude toward risk taking and 
mistakes, and his sensitivity to the feelings of others 
[Myers, 1966, p. 59).” 


BIB 


F Favorable Self-Perception: Items loading on this 
subscale dealt with level of income, type of work pre- 
ferred, assessment of potential, level of education at- 
tained, value gained from education, and membership 
in professional organizations 


TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR THE 


AAQ anD BIB SUBSCALES 
Subscale £ SD 
Supervisory Style of the Boss 9.14 3.50 
Job Satisfaction 13.04 3.81 
Favorable Self-Perception 8.81 1 24 
Note.—Abbreviations: AAO = Administ, ii 
ueationuaire, ‘BIB "= Biographical omni tion E Attitude 


formation Blank, 
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TABLE 2 


Cross CLASSIFICATION OF SS ON THE Basis oF 
Supervisory STYLE OF THE Boss AND 
FAVORABLE SELF-PERCEPTION 


Supervisory Style of the Boss 


nse Total 
Perception Sup- Tra- Nonsup- 
portive | ditional portive 
High 7 14 22 be 
Medium 18 10 5 42 
Low 17 19 6 is 
Total 42 43 43 


Note.—N = 128, 


Procedure. Subscale scores for the two AAQ subscales 
and the BIB subscale were computed by asigne 
value of 1 to the most favorable response ORAR ie 
each item and 0 to the least favorable option, The n the 
assigned intermediate responses depended pes iy an 
number of options. If Ss had failed to respon Means 
item, the mean value of the item was assigned. ted 
and standard deviations for the scales thus compu 
are presented in Table 1. i 

The relationship between Favorable Selt Pareep ht 
Supervisory Style of the Boss, and Job Satis a ssis 
subscale scores was tested with a 3 X 3 factorial a on 
of variance. The Ss were divided into three group the 
the basis of their score on the Supervisory Style 0 the 
Boss subscale to form column classifications coco 
design. The top one-third were considered a8 ven? that 
to supervisors perceived as “supportive, Mt scale 
favorable responses to the items of the presen oe) 
described “supportive” as defined by Likert ane 
The middle third were considered as reporting to d as 
perceived as “tradition 


al” a style characteriza as 
avoiding either extreme, and the bottom one-thit ‘i 
Teporting to bosses perceived as “nonsuppori Ya y 
Row classifications were similarly devised by pe 
Ss on the basis of their Favorable Self-Percep™”,, 
subscale scores, Groups were simply labeled as oe 
“medium,” and “low” to designate the third © tion 
distribution in which their score fell. Job Satisfac ua 
subscale scores were the criterion measures. ye 
cell frequencies associated with this method of alysis 
fication were handled by treating the univariate an? lysis 
of variance as a special case of a multivariate e rs 
of variance procedure that permitted unequ® 
frequencies (Clyde, Cramer, & Sherrin, 1969). 


Resutrs i 
The initial cross classification of Ss wre 
Subscale scores for Supervisory Style = 6- 
Boss and Favorable Self-Perception are 
picted in Table 2. the 
As may be noted in this table, only 7 ne 
43 Ss in the high Favorable Self-PerceP -ig 
Subgroup perceived their boss as display 


SELF-PERCEPTION, SUPERVISORY STYLE, AND JOB Sawiseac.-__ 


what has been labeled as a supportive style of 
Supervision, whereas, 22 of this same group 
perceived their boss as exhibiting a nonsuppor- 
tive style of supervision. This is in distinct 
Contrast with the 42 Ss in the low Favorable 
Self-Perception group. Of these, 17 perceived 
their boss as supportive, but only 6 perceived 
his behavior as nonsupportive. The statistical 
Significance of this relationship is born out 
by an r of —.30 (p < .01) between Supervisory 
Style of the Boss and Favorable Self-Percep- 
tion subscale scores. Job Satisfaction subscale 
Means corresponding to the data of Table 2 
are presented in Table 3. 

_ As indicated in Table 3, mean Job Satisfac- 
tion subscale scores decreased as Favorable 
Self-Perception increased and increased as 
Supervisory Style of the Boss became sup- 
portive. Highest Job Satisfaction cell means 
were associated with a supportive style of 
supervision for those classified medium or low 
on Favorable Self-Perception and with a tra- 
ditional style of supervision for those classified 
as low in Favorable Self-Perception. Low Job 
Satisfaction cell means were associated with 
all levels of Favorable Self-Perception for Ss 
perceiving their boss as nonsupportive. — 

Results of the two-factor analysis of variance 
performed with these data are presented in 
Table 4. From these results it was concluded 
that Supervisory Style of the Boss and Favor- 
able Self-Perception subscale scores jointly 
acted in the determination of Job Satisfaction 


subscale scores. 
Discussion 


By and large, Myers’ (1966) findings that a 
supportive style of supervision was associated 
with higher levels of job satisfaction have been 
supported by this study conducted in a dis- 
similar organization and environment. In 
addition, Favorable Self-Perception, as mea- 
sured by personal history data, was found to 
act on this relationship in two ways. First, 
those scoring high on Favorable Self-Percep- 
tion were less likely to perceive the behavior 
of their supervisor as supportive than those 
scoring lower on the F. ‘avorable Self-Perception 
scale; and second, it was found that those 

h on the Favorable Self-Percep- 


coring hig 
a aele generally scored lower on Job 


Satisfaction. 


TABLE 3 


AVERAGE JOB SATISFACTION SUBSCALE SCORES FOR 
Ss CLASSIFIED ON THE BASIS OF SUPERVISORY 
STYLE OF THE Boss AND FAVORABLE 
SELF-PERCEPTION 


Supervisory Style of the Boss 


Favorable Row 
Self- T 

Perception Sup- Tra- | Nonsup- 
portive | ditional | portive 

High 13.5 12.9 10.2 | 11.6 

Medium 15.3 12.0 10.8 12.9 

Low _| 147 16.0 10.0 | 14.6 

Column X 14.8 14.0 10.4 13.0 


Note.—N = 128. 


The first question to be considered is how 
one can account for these results. The observed 
relationship between Favorable Self-Percep- 
tion, Supervisory Style of the Boss, and Job 
Satisfaction subscale scores may be cast in 
terms of the theoretical framework of Likert 
(1961) if the following assumptions are made: 
(a) Job Satisfaction subscale scores are a 
measure of the subordinate’s attitudes; (b) 
Supervisory Style of the Boss subscale scores 
form an index of the subordinate’s perception 
of at least one category of experience in the 
organization; and (c) Favorable Self-Percep- 
tion subscale scores may be interpreted as an 
indication of the expectations the subordinate 
has of his supervisor (a reasonable assumption 
based on the relationship with perception of 
supervisory style). 

In this context, it would appear that for 
those perceiving the Supervisory Style of the 


TABLE 4 


Suamrary oF Two-Factor ANALYSIS OF VARIANCE: 
SUPERVISORY STYLE OF THE Boss AND FAVORABLE 
SELF-PERCEPTION WITH JOB SATISFACTION 
AS CRITERIA 


Source df | MS F 
Supervisory Style of the Boss 2 | 167.4 | 16.2** 
(Columns) 
Favorable Self-Perception 2| 94.8] 9.2* 
(Rows) 
Rows X Columns 4| 249| 24* 
Within 119| 10.4 
Note.—N = 128. 
*p < 053. 


++p <.001. 
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Boss as nonsupportive, this experience was 
less than their expectation, hence, the uni- 
formly low Job Satisfaction subscale mean. 
On the other hand, the perception of a tradi- 
tional supervisory style appeared to have been 
sufficient to exceed the expectations of those 
classified as low in terms of Favorable Self- 
Perception, whereas, a supportive style was 
required to exceed the expectations of those 
with a medium level of Favorable Self-Per- 
ception. As indicated by the Job Satisfaction 
mean for those classified as low Favorable Self- 
Perception and reporting to a boss perceived 
as supportive, there was no increase in atti- 
tudes once expectations were exceeded. 

Thus far, it has been assumed that Super- 
visory Style of the Boss and Favorable Self- 
Perception subscale scores are causative vari- 
ables. Vroom (1964, pp. 112-113), however, 
has suggested an alternative explanation and 
expressed the need for joint studies to clarify 
the relationship. Specifically, he suggested 
that should job satisfaction be found to be 
jointly influenced by supervisory style and a 
personal variable, then more credence could 
be given to the claim that a supportive style 
causes changes in job satisfaction, as opposed 
to the alternative explanation that satisfied 
subordinates elicit supportive behavior from 
their supervisor. Since Job Satisfaction sub- 
scale scores were found to vary jointly with 
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Supervisory Style of the Boss and Favorable 
Self-Perception subscale scores, there is evi- 
dence in support of the assumption of causation. 
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INFLUENCE OF RACE ON THE MANIFESTATION 


JAMES R. FENELON? AND 
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Inventory 
task in which one had to 
following pairs of college women : (a) high- 
and low-Do blacks—Group 2; 
and (d) high-Do blac! 


assumption by the high-Do white women paired with the low-Do black 
lower than the rate of leadership assumption by 
s. Analysis of the decision-making process suggested that 


Group 3 was significantly 
Do Ss in the other group: 
this stemmed from the reluctance of the 
over the low-Do black gir 
low-Do black girls when pa! 


tests assess 


In applied settings, the usefulness of a 
the first study, 


psychological test is often judged by how well 
it can predict future behavior (Meehl, 1959). 
However, most of the research literature on 
such tests is based on studies of their con- 
current rather than their predictive validity 
(Myers, 1950). For example, the California 
Psychological Inventory Dominance 
(Do) scale is used in secondary schools, 
colleges, and industry to forecast who is most 
likely to rise to a position of leadership and 
authority (Gough, 1968, 1969a, 1969b). How- 
ever, most of the evidence for its validity 
validity studies that have 


work together 


other to follow. 
high-Do Ss (56%) 
difference 
was repeated 


rests on concurrent A f f the 
shown that individuals who are already In & importance © 
position of Jeadership—student leaders, mili- change, he leader role 
tary officers, business executives, and the assumed tht — 
like—obtain significantly higher scores hay pe thems 
nonleaders (Bogard, 4960; Carson & Parker, ence, tee 
Ad ence W 
1966; Gough, 1969a, 1969b; Johnson & also n eed 
Frandsen, 1962; Knapp 1960; Rawls & eu h r 
Rawls 1968) ? 2 program of research 
’ : — + atte i jona ti 
A series of studies on the predictive validity tie e oh al faci 
of the Do scale has illustrated one apike bean peer hig 
. s.r tar fri cho d 
problems in predicting pehavior from psyent : 
predicting s a function CPI Do scale. 


logical test scores: while behavior i 
of both personality jonal factors) The fest of 
2 inve: 


roles 


1 This article is based on & master’s thesis 


to the Psychology Department, University of Te% he 
areni in August 1966 by J: R. Fenelon nder men paired with low- 
irecti E a < 5 i 
ection of E. I. Megargee Air Force high-DO women paired 


2 Now at David Grant Hospital, Travis 


Base, California. sai 
2 Requests for reprints should be sent to Edwin . 
Megargee, Department of Psychology, Florida State 


University, Tallahassee, Florida 32306. 


Group 2; (c) b 
women, Group 
paired with 
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that was n 
with the instructions changed 


so as to stress leadership and 
leader’s role. 


of the 20 high-Do Ss 


stigated the influence 0 
on the assumption o 
Do Ss. Four groups were forme 


igh-Do men paired 
3; and (d) 
low-Do me 


OF LEADERSHIP t 


EDWIN I. MEGARGEE* 
Florida State University 


whether racial differences influence the manifestation of dominance 
(Do) was investigated by pairing black and white women who were 
Do scale and having the! 
lead and the other follow. Groups were constituted of the 
and low-Do whites—Group 
(c) high-Do white and low-Do blacks—Group 3; 
ks and low-Do whites—Group 4. The rate of leadership 


high and low 


m interact in a 


1; (b) high- 


women in 
the high- 


: high-Do white girls to assume leadership 
Js coupled with increased assertiveness on the part of the 


only the personality variables. In 
Megargee, Bogart, and Ander- 
son (1966) paired men with high-Do scores 
with men with low-Do scores and had them 
on a simulated industrial task. 
Although no specific mention of leadership or 
role differentiation was made in the instruc- 
tions, it seemed clear to the investigators that 
the task required one person 
Despite this, 
assumed the leader role, a 


to lead and the 
only 14 of the 25 


ot significant. The study 


emphasize the 
Given this 
tested (90%) 
(p< 001). 


uctions could make such a differ- 
d likely that other factors could 
hether or not high-Do people 

in various situations. A 
was undertaken to identify 
ors that facilitated and 
the assumption of leadership by 
h in dominance as measured by the 


these studies (Megargee, 1969) 


f traditional sex 
f leadership by high- 
d: (a) high-Do 


high-Do women 
n, Group 4, It was 
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expected that the high-Do partner would 
usually assume the leader role in Groups 1, 2, 
and 3, where cultural sex role prescriptions did 
not interfere with the manifestation of domi- 
nance. In Group 4, however, it was predicted 
that the high-Do women who were paired with 
low-Do men would assume Jeasership sig- 
nificantly less often because of the tradition of 
masculine dominance in American society. This 
proved to be the case. In two studies using two 
quite different tasks, no more than 25% of the 
high-Do women paired with low-Do men 
assumed leadership while in the other groups 
the high-Do partner assumed leadership 62%- 
90% of the time. 

The present study further explores how 
cultural role expectations may influence the 
prediction of leadership using the Do scale. 
Whereas Megargee (1969) studied what hap- 
pens when high- and low-Do men and women 
are paired and asked to assume leader or 
follower roles, this study applies the same 
paradigm to two other groups that have 
traditionally occupied dominant and sub- 
missive roles relative to each other in American 
society: whites and blacks. It was expected 
that in the homogeneous racial groups 
(i.e., high-Do white-low-Do white and high-Do 
black-low-Do black) the high-Do partners 
would usually assume the leader role. The 
question was what would happen in the bi- 
racial groups. Would the dominance of white 
over black common in the South result in a 
lower rate of leadership assumption by the 
high-Do blacks? There was reason to believe 
that it might. Megargee’s (1969) study had 
shown that conflict with stereotyped cultural 
roles inhibited the expression of dominance by 
high-Do white women. Prior research on the 
interaction of blacks and whites had shown 
that Negro Ss were inhibited and complained 
and made more errors when working with 
whites (Burstein & McRae, 1962; Katz & 
Benjamin, 1960; Katz & Cohen, 1962; Katz, 
Goldston, & Benjamin, 1958; Katz & 
Greenbaum, 1963; Katz, Roberts, & Robinson, 
1965). While these studies did not deal with 
emergent leadership per se, they did suggest 
a pattern of passivity that would lead one to 


expect that high-Do black Ss would be less 


likely to lead low-Do white partners, Similarly 
? 


it seemed likely that a low-Do white S might 


behave more assertively when paired with a 
high-Do Negro than when paired with a toe 
partner because following a Negro pe 
might well be threatening to southern wi! 
Ta the other hand, no particular conflict 
between personality and cultural role expec- 
tation was anticipated in the high-Do white- 
low-Do black situation; hence it was a? 
pated that the high-Do white Ss wou c 
usually lead the low-Do black Ss. 


METHOD 


s s 

Subjects. The S selection was a major problem Hi ho? 
investigation because of the relatively small nun vs Hy: 
Negro students enrolled at the University of Text rce 
Austin at that time. This meant that the primary T 
of Ss, the introductory psychology classes, Cou‘. the 
be used; instead it was necessary to teepu OS WE 
integrated cooperative housing units at the U ie aan 
The effect of this was threefold: (a) it dictated reg the 
of women rather than men as Ss because most = did 
blacks on campus were female, and since the me al 
not have to live in University-approved quarters naa 
the women; (b) it meant that the sample of W ae 
consisted of those willing to live in integrated w aie 
(c) as a result of the recruiting and the pease be 
probability of a white student “just happening i. 
paired with a black student on the basis of omme as 
likely that most Ss surmised that racial attitudes 
well as leadership were under investigation. > which 

A 113-item test labeled the Gough Inventory (Com- 
consisted of all the items from the CPI Do, rm 4 was 
munality), and Gi (Good Impression) SC%'® dents 
administered to 72 black and 119 white women a 
at the University of Texas at Austin. The a the 
median Do scale T score was 53; Ss at oF ape s low 
median were classified as high Do; those below = h Do 
Do. Pairs were then formed; in each pair oe T at o 
S’s T score was at least 20 points higher than th 
his low-Do partner. Four groups were formed WI 


wer 
Task and apparatus. The task and appara he task 
identical to those used by Megargee (1969 rath 
consisted of a simulated clerical task with ae 
leader and follower roles in which the impor that this 
leader’s role was stressed. Megargee ar use Wi 
task, which seemed more appropriate in masculine 
female Ss, gave results identical to the MOT’ © “Bogart 
simulated industrial task devised by Megats° 
and Anderson (1966). i reen in the 
The setting consisted of a table with & chair W35 
middle and chairs on either side. The lead? fi 


it 

tofi 
; ieee p fron 

an executive-type swivel chair with arms- 
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rested a leather-covered loose-leaf binder containing 

the material to be dictated. A sign on the screen above 

the binder read “leader's side.” The follower’s chair 

was a straight-backed wooden chair. She had no binder 

for her papers and on her side of the screen a sign read 
follower’s side.” 

The leader’s notebook contained a page from the 
Stroop Color-Word Test (Stroop, 1935) which had the 
names of four colors, each printed in ink of a different 
color such as the word “red” printed in blue ink. The 
follower was supplied with a mimeographed form on 


which the initial letters of the four color names were 


printed, once for each word on the leader's sheet. 
e testing room 


Procedure. Each pair of Ss was led to th 
where the following instructions were read with the 
italicized words emphasized: 


‘This is a study of the relation between the Gough 
Inventory and leadership under stress. One important 
aspect of leadership is the ability to concentrate, 
remain calm, and accurately i 
subordinate. T his is what is required by 
The leader will have to absorb information and 
rapidly pass it on to the follower. 
The information to be transmi 
the ink on a series of words: is i 
On the table is a sheet of paper on w. 
f four colors: blue, 


nt colors of ink—blue, greens orange, and 
inti o color names. 


is printed in blue 
r is to tell the follower the 
rinted word. Since the 


red— 


ink. The job of the leade: 

color ink used, ignoring the p! 

first word is printed i 

call “plue”; si 

orange ink, sh 
(The E moves around table.) 


eps side of the table. The follower 
must record the information given her by the leader. 


The follower’s sheet of paper has the letters B, G, 
inted, once for each word on the leader’s 


O, and R prin í 

alls out the colors of the ink, 
the follower must record the information by crossing 
out the letter corresponding to the color, B for blue, 


G for green, 


This is the follow 


leader's sample 
therefore the leader sho 
follower should cross out the letter 2- 


give more examples. 


who will be 


take your sea 


at the leader or followers sides © 
. When the leader is sure the fol 


Hower is ready t9 


table: 
sas! e formation, she should call p pa 
ing out the 
over the sample pase, and start ca ng 
j gifferent-colored inks. When the leader 


vy 


calls out “start,” I shall begin timing you; the leader 
will have 90 sec. to transmit as much information as 
possible. 

Any questions? 

OK. Decide who will be leader and who will be 
follower and take your positions. 


the investigators were not interested in 


Of course, 
the accuracy Or speed of dictation but simply in which 


S ended up in the leader’s chair and the process whereby 
this decision was reached. The purpose of the task was 
simply to provide a setting in which the high-Do S 
could manifest her dominance by assuming the leader 
role if she so chose. á 

After reading the instructions, E turned on a tape 
recorder which recorded the discussions that preceded 
the choice of leader. Later E transcribed these records, 
noting any additional nonverbal behavior such as an 
simply sitting down in one position Or shrugging when: 
asked her opinion. 


RESULTS 


Decision time. It had been expected that 
intra- and possibly interpersonal conflict 
would be engendered by the biracial situation. 
Because of this it was predicted that the 
amount of time taken for the partners to 
arrive at a decision would be significantly 
longer in the biracial than in the uniracial 
groups. This proved to be the case; the mean 
decision-making time in the piracial groups 
was 9.8 sec. which proved to be significantly 
longer than the uniracial groups’ mean of 5.8 
sec. (F = 4.65, df = 1, 56, p< 105). 

Assumption of leader role. The number of 
high-Do Ss assuming the leader and follower 
roles in each of the four groups is presented in 
Table 1. In both of the two uniracial groups, 
80% of the high-Do Ss assumed the leader 
role. This was consistent with the expectation 
that a majority of the high-Do Ss in the uni- 
racial groups would assume leadership, and it 
replicated Megargee’s (1969) finding that 
75% of his high-Do white women Ss assumed 
the leader role when paired with low-Do white 


women SS. 
In the biracial groups, it had been expected 


that most of the high-Do white women m 
Group 3 would become leaders. This did not 
prove to be the case since only 33% of these 
women ended up in the leader’s chair. In 
Group 4, a low rate of leadership assumption 
had been expected on the part of the high-Do 
black women who were paired with low-Do 
whites. This, too, proved to be ın error; 13% 
of the high-Do black women assumed the 
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TABLE 1 


‘UMBER OF HicH-Do Ss ASSUMING LEADER AND 
FOLLOWER ROLES IN EACH GROUP 


Group 
1 2 3 4 
Role ` à A ih 
High | (High | (High | (Big 
ote ae black- white- | black- 
Low Low Low Low 
white) black) black) white) 
di z 
y 2 | 12 s | un 
% 80 80 33 73 
plover y % D 5 
% 20 20 67 27 
N 15 15 15 15 
Note.—x? = 10.2, p < .02. 
leader role. These differences in the rate of 
leadership assumption in the four groups were 


statistically significant (x? = 10.2; p < .02). 

Decision-making process. Finally the records 
of the verbal and nonverbal interactions during 
the decision-making process were analyzed. 
While the cell entries were generally too small 
to allow for meaningful statistical tests, it was 
hoped that this analysis would provide fruit- 
ful hypotheses to help explain the results. The 
methods of analysis followed those described 
by Megargee (1969). It was first noted which 
S made the final decision; this information is 
recorded in the first row of Table 2. Next, the 
behavior of the partner who made the decision 
was studied to determine whether he appointed 
himself or his partner leader, These data are 
presented in the second and third rows of 
Table 2. The behavior of those who allowed 
their partner to decide was also examined and 
a record made of those who actively said in 
effect, “You decide” and those who simply 
said nothing. The proportion of high- and 
low-Do Ss who said nothing is presented in 
the fourth row of Table 2. 


Discussion 


The first objective of the present program 
of research was to explore how situational 
factors can interfere with the prediction of 
behavior on the basis of personality test scores, 
Megargee, Bogart, and Anderson (1966) 


demonstrated that the degree to which the 
instruction emphasizes the relevance of leader- 
ship to the task can be of crucial importance 
in determining whether or not a high-Do S 
assumes the leader role. The next , study 
(Megargee, 1969) showed that while high-Do 
Ss will generally lead low-Do Ss, this is not 
true when it involves a conflict with the cul- 
tural role expectation that males should domi- 
nate females. . 
The present study extended these findings 
by showing that racial differences between the 
partners can also significantly interfere with 
predictions based on test scores alone. More- 
over, as has often occurred in these studies, 
the effect was not that which had been ex- 
pected. It was not the high-Do black women 
in Group 4 but the high-Do white women in 
Group 3 who manifested a low rate of leader- 


ship assumption when paired with a partner 
of a different race, 


TABLE 2 


ANALYSIS OF DECISION-MAKING PROCESS 


Group 
No. Ss a 
il 2|) 3 | 4 
Making decision 
High Do 
N 7| 8| 6| 6 
% 47 | 53 | 40 | 40 
Low Do 
N 8| 7] oj 9 
% 53 | 47 | 60 | 60 
Appointing self leader 
High Do 
; ‘lel ela 
% 71 ó 
Low Do ws ° 
N 1) 1| 4] 2 
Jo 12 | 14 2 
Appointing partner leader 
High Do 
y el ahel e 
40 
Low Do vil es a 
N 
N Ti e| s 7 
Déferrin b i S e os 
g bysayin; i 
High Do peeing 
N 4 
% 100 86 56 Be 
Low Do 
N 3 
v AE 4 
% 100 | ss | 67| 5° 
P S Aine S 
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What happened in these biracial groups? 
As in any experiment, there are a number of 
possible ex post facto explanations. However, 
close scrutiny of the data in Table 2 suggests 
that the most plausible explanation is that 
the high-Do white Ss in Group 3 were re- 
luctant to assume the leader role vis-a-vis 
their low-Do Negro partners. Moreover, the 
low-Do black women in Group 3 tended to be 
more assertive than the low-Do Ss in the other 
groups. Let us examine the empirical basis for 
these speculations. 

The first question that was asked was 
whether the four groups differed in respect to 
whether the high- or low-Do partner made the 
decision. The first row in Table 2 shows that 
no such differences were found. Overall, the 
final decision was made by the low-Do partner 
55% of the time. The individual groups did 
not differ significantly from this average. 

Turning to the behavior of Ss who made the 
decisions, some interesting differences can be 
found. In this experimental situation, when 
the high-Do partner makes the decision, he 
typically appoints himself leader (Megargee, 
1969). This was the pattern in both the uni- 
racial groups and in the high-Do black-low-Do 
white group. However, while 2 or more of the 
high-Do Ss in Groups 1, 2, and 4 appointed 
themselves leaders when they made the de- 
cision, none of the six high-Do white decision- 
makers in Group 3 appointed herself leader. 
Instead, all six of these girls told their low-Do 
black partner to lead. The presence of a black 
partner not only did not increase the asser- 
tiveness of the high-Do white girls, but it 
instead made them more submissive, at least 
overtly. 

This is only part of the story. The other 
part is the behavior of the low-Do partners. 
In most cases the low-Do partner made the 
decision. The typical scenario was for the 
high-Do partner to remain silent and look 
quizzically at her low-Do partner. When the 
decision was thus thrust on her, the low-Do 
partner typically responded by saying, “You 
be the leader.” Not so in Group 3. In Group 
3, 9 of the 15 high-Do white partners deferred 
the decision to their low-Do black partners. 
However, instead of remaining silent, a ploy 
which apparently put some pressure on the 
low-Do partner to nominate the high-Do 
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44% of the high-Do white 
girls who deferred the decision actively en- 
couraged their black partners to decide by 
saying, “Go ahead, you decide” or some such 
statement. Given this encouragement 44% 
of the low-Do black girls in Group 3 appointed 
themselves leader when making the decision, 
while only 12% to 22% of the low-Do partners 
appointed themselves leaders in the other 
groups. There is no way of determining 
whether the greater assertiveness of the low-Do 
black girls in Group 3 was because the high-Do 
white girls actively encouraged them to make 
the decision or whether the biracial situation 
simply elicited more assertive behavior. 

The pattern of the data suggests that in 
Group 3 the assumption of leadership was less 
important to the high-Do white girls than was 
the demonstration of egalitarian attitudes. It 
is questionable how generalizable this finding 
is. In the first place, the situation was @ 
temporary one. If the high-Do white girls had 
been faced with the prospect of having to 
occupy a subordinate position for 6 mo. or a 
year, one wonders if they would have been as 
eager to adopt the follower role. Second, it 
was probably obvious to the girls in the biracial 
groups that this was an experiment on racial 
attitudes as well as leadership. This probably 
made the expression of tolerance by the white 
girls or “black power” by the black girls at 
least as salient as the manifestation of domi- 
nance, Moreover, the method of recruiting Ss 
in the cooperative dormitories ensured that 
the white girls selected were drawn from the 
more liberal elements of the university com- 
munity. By the same token, the fact that there 
were so few blacks enrolled in the university 
at that time suggests that it was the unusual 
black girl, one with greater fortitude and 
emotional stamina, who was able to matricu- 
late successfully. The investigators had, in a 
sense, been expecting the high-Do black girls 
to “play Uncle Tom.” However, by coming to 
UNIV EESILY they had renounced the “Uncle 
eye eee ie 
enee he ne e themselves in the 
Cite giris who ar er Instead, it was the 

opted the submissive role 


partner as leader, 


+TIt may also be noted that i ¢ 

ree: hat in Group 4, 50% of the 
white girls who deferred to their black partners actively 
encouraged their partners to decide. 


=) wea ae 


58 


is-A-vis their black partners, so that it was 
Miss Anne” who played “Uncle Tom.” 

Further research is, of course, necessary to 
est the validity of some of these speculations. 
Jne such study has already been carried out 
yy Rubinroit (1970). Rubinroit set out to 
Jetermine whether the results of the present 
investigation could be attributed to the fact 
that the study was conducted in a university 
setting. Rubinroit went to a lower class high 
school in which approximately 15% of the 
students were white, 25% black, and 60% 
Mexican-American. In this setting there would 
be no reason for members of biracial groups to 
suspect that racial attitudes were being in- 
vestigated. After studying all nine combi- 
nations (high-Do whites paired with low-Do 
whites, low-Do blacks paired with low-Do 
Mexican-Americans, high-Do blacks paired 
with low-Do whites, low-Do blacks, etc.) it 
became apparent that in this thoroughly inte- 
grated setting, differences in dominance as 
assessed by the CPI Do scale were the primary 
predictors of leadership. 
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INFORMATION SEEKI 


NG IN A NATURAL STRESS SITUATION ' 


DAVID T. A. VERNON? 


University of Colorado 


Hospitalized tuberculosis patients 


physicians were compared with respect to their use of b 
differ in the proportion of general book readers. However, 
f prior laboratory experiments on information choice and 


The two groups did not 
in line with the findings o 
information seeking in stress situations 


tended to seek out books about tuberculosis and its treatment more 


under the care of informative and uninformative 


ooks in the patient library. 


the patients of uninformative physicians 
frequently than 


did the patients of informative physicians. 


The findings of several recent studies sug- 
gest: (a) that persons in potentially stressful 
situations in which injury or pain are threat- 
ened generally choose to have information 
about the nature and timing of a stress stimu- 
lus when a choice is offered to them, (b) that 
they actively seek such information if it is not 
offered and, (c) that they profit from obtaining 
it. These tendencies appear to be true even 
when the information does not seem to be 
useful in reducing the magnitude of the stress 
stimulus in an objective way. 

Data on the first of these issues are available 
from the laboratory experiments of Lockhard 
(1963); Perkins, Levis, and Seymann (1963); 
and Lanzetta and Driscoll (1966). Lanzetta 
and Driscoll found that persons prefer to have 
information about the incidence of an un- 
avoidable shock when given a choice. Simi- 
larly, both Lockhard and Perkins et al. found 
that rats preferred that cage compartment in 
which a signal light preceded the advent of 
shock over the compartment without a signal 
light or the compartment in which the light 


followed the shock. 
With respect to the closely related second 


issue—that persons in potentially stressful 


j 3 à g i hen it 
situations actively seek information w 
y available. The 


is not offered—less data are 


1 The author wishes to thank Veterans’ Administra- 
tion Hospital, Hines, Illinois for its Tae i 
support during the conduct of the research. The Ge 
article is an elaboration of a segment (Vernon, d i 
Appendix G, pp- 122-123) of research conducte = 
Partial fulfillment of requirements for the m an T 
at the University of Chicago. Tt was carried out uni a 
the supervision of F. L. Strodtbeck, University © 


Chicago 
d id T. A. 
? Requests for reprints should be sent a ott Fe 


Vernon, Department of Psychology, 
Colorado, Boulder, Colorado 80302. 


field observations of Roth (1963) describe how 
tuberculosis patients on occasion go to con- 
siderable lengths to get information about 
their condition and their treatment. Pervin 
(1963) and, to a somewhat lesser degree, 
Haggard (1943) found that Ss in laboratory 
experiments which involved unavoidable elec- 
tric shock actively tried to determine the 
signals or pattern of signals (lights or tachisto- 
scopically presented words) which cued the 
shocks. 

Experimental data on the third issue—on 
the utility of such information for modifying 
the psychological impact of the stress stimuli, 
not their objective magnitude—appear to be 
reasonably consistent. In research in a closely 
related area, Carlsmith and Aronson (1963) 
found that a disagreeably bitter solution was 
rated as tasting more bitter when it was not 
expected. Pervin’s (1963) data indicated a 
similar phenomenon—that is, persons rated a 
predictable electric shock as being preferable 
and less anxiety provoking than an unpredict- 
able shock. Haggard (1943) found that Ss in 
his experiment who developed “‘cognitive 
clarity” (who were able to identify the specific 
words which cued an electric shock) were able 
to tolerate an electric shock better than persons 
for whom the situation remained ambiguous. 
The former showed generally lower galvanic 
skin responses, more accurate estimates of 
session length, and less disintegration of verbal- 
motor coordination. The data of Lanzetta and 
Driscoll (1966), while agreeing with the gen- 
eral trend of the above studies, must be con- 
sidered less conclusive since supporting inter- 
pretations could be reached only from internal 


analyses and not from the main treatment 
group comparisons. 
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The field research of Janis (1958) with 

rgery patients, although purely correlational 

nature, also tends to support the proposition 
at information of the above sort is generally 
neficial, as do numerous problem-oriented 
Jd experiments on the effectiveness of pre- 
uring children and adults for medical pro- 
dures (e.g, Dumas, Anderson, & Leonard, 
965; Egbert, Battit, Welch, & Bartlett, 1964; 
nd the research reviewed by Vernon, Foley, 
ipowicz, & Schulman, 1965). However, char- 
cteristically these latter studies have been 
lesigned in such a way that the effects of 
nformation could not be separated from the 
fects of other aspects of the experimental 
preparation routines. 

In addition to reducing the impact of po- 
tentially stressful stimuli (e.g., the perceived 
painfulness of an electric shock or medical 
procedure as described above), information 
may also modify the degree of response to the 
threat of such stimuli. This is evident in the 
studies of Lazarus and Alfert (1964) and 
Folkins, Lawson, Opton, and Lazarus (1968), 
both of which provide data suggesting that 
responses to a threatening movie may be modi- 
fied by preparatory information. In the second 
study this interpretation is based on the fact 
that the two most effective treatments in- 
volved a rather complete description of the 
film in addition to procedures designed to 
encourage particular components of desensi- 
tization therapy. The work of Nomikos, Opton, 
Averill, and Lazarus (1968) ostensibly pro- 
vides evidence that information leads to in- 
creased rather than decreased fear. However, 
as far as the present issue is concerned, these 
findings must be considered very carefully, as 
the authors recognized. The average length of 
time between information and the period of 
maximum threat was only 12 sec. in this 
study. 


It was the purpose of the present work to 
investigate the second of the above notions— 
that is, that pertaining to information seeking 
—in a natural field setting, the tuberculosis 
service of a large veterans’ administration 
general hospital. 

In the light of the above findings it seemed 
reasonable that tuberculosis patients under 
the care of relatively uninformative physicians 


would be more inclined to actively seek in- 
formation about the disease and treatment 
from secondary sources than would patients 
under the care of informative physicians. 

The appropriateness of this expectation for 
the present setting was further suggested by 
numerous descriptive emphases on the im- 
portance of information to the emotional well- 
being of tuberculosis patients and by surveys 
which have demonstrated the phenomenologi- 
cal importance of information to persons in 
hospitals (Skipper, 1965; Stewart, 1956). The 
findings of these two studies also suggest that 
patients (quite reasonably) regard their doctors 
as the preferred source of information about 
their disease and treatment. 


METHOD 
High and Low Information Wards 


On the basis of statements about staff doctors made 
by five patients, four registered nurses, and a clinical 
psychologist who worked on the tuberculosis service, 
six of the seven wards of the tuberculosis service were 
classified as either high information wards or low in- 
formation wards. At the time of the study, one ward, 
in serving as a discharge ward, fulfilled a special func- 
tion for the tuberculosis service and was excluded from 
the study. The staff doctors in charge of the three 
high information wards were generally characterized as 
being relatively free in giving information about 
tuberculosis and treatment to their patients. They were 
described as being attentive to their patients’ questions, 
willing to stay after hours to explain test results and 
Kent procedures to their patients, etc. In contrast, 
me _ doctors in charge of the three low information 
wards were described as being relatively uninformative, 
resentful of patients’ requests for information, difficult 
for patients to contact. : 
ery es eee among the 10 respondents 

q Algh. he classification of the six physicians 
was identical in seven cases; each of the remaining 
three (two patients and a nurse) agreed with the con- 
sensus picture in 4 out of 6 of their judgments. 


i Eee of the classification was also supported 
tested on sev : sample of 100 new patients who were 
cases 1 mo pia questions about their own particular 
(Vernon, cop m n to the tuberculosis service 

atient’s di à = information (e.g., whether the 
Pp isease was unilateral or bilateral) was of the 


type that could not bi i 
EE srived from 
secondary sources of infi a Me og 


c ormation; it could be judged 
pen wrong only from each patient’s medical cords: 

M the average, the 44 patients on the high information 
wards had significantly more information (total correct 
wrong) than did the 56 patients on the low information 
wards (2.22 vs. 1.09, ¢ = 3.25, .01 > p > .001)- 
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TABLE 1 
DATA ON THE READERSHIP OF TUBERCULOSIS Books AND GENERAL 
Books in High anp Low INFORMATION WARDS 
Low information wards High information wards 
Books z 
A B c Total D E F Total 
Tuberculosis 
No. readers 22 21 17 60 9 3 12 24 
No. potential readers | 242 205 254 701 169 181 157 507 
Proportion 09 10 07 09 05 02 08 05 
General 
No. readers 18 12 9 39 14 6 16 36 
No. potential readers | 242 205 254 701 169 181 157 | 507 
Proportion .07 06 .04 .06 .08 03 10 | 07 


Use of Secondary Sources of Information 


Thirteen books in the tuberculosis service library 
were designated as potential sources of information 
for tuberculosis patients. These included all autobio- 
graphical accounts by former patients and all books 
written specifically for tuberculosis patients by experts. 
In addition to the 13 tuberculosis books, 30 general 
books (both fiction and nonfiction) were selected 
randomly. From information which was routinely re- 
corded on the circulation cards it was possible to 
determine the number of tuberculosis book readers and 
general book readers on each of the six wards over a 
preceding 2-yr. period during which the medical 
supervision and organization of the wards was stable. 

Comparison of the 1-yr. totals for the six wards 
indicated that the 2-yr. totals were relatively reliable 
measures. The correlation between the two 1-yr. totals, 
corrected by the Spearman-Brown formula, was .86 
for the tuberculosis books and .87 for the general 
books. The measure also possessed the virtue of being 
nonreactive (Campbell, 1957). 

Tt was assumed throughout that the proportion of 
patients who were curious about, or were capable of 
becoming curious about, tuberculosis and treatment 
was equivalent from ward to ward at the time of their 
he hospital. This assumption seemed 
plausible in light of the fact that tae pee were 
assigned to wards on the basis of available bec space, 
a method which seemed unlikely to bias comparisons 


between different types of wards. 


admission to t 


RESULTS 


Table 1 summarizes the essential data of 
the study—the number of tuberculosis book 
readers and the number of general book readers 
on each of the six wards for the 2-yr- period, 
along with the corresponding proportions. 
Data on potential readers were abstracted 
from routine hospital census reports. 

The proportions of tuberculosis book readers 
listed in Table 1 are all quite low, the highest 


being .10. Relatively few patients read books 
about tuberculosis and treatment. 

The extent to which the patients in the two 
types of wards differed in the degree to which 
they read books about tuberculosis was tested 
by comparing high and low information wards 
on: (a) the mean proportion of tuberculosis 
book readers, .05 vs -09, and (b) the mean 
proportion of general book readers, .07 vs .06. 
Statistical tests indicated that the two groups 
differed significantly with respect to tuberculo- 
sis book readers (Z = 2.11, p = 034, two- 


tailed test) but not general book readers 
(Z = — 05, p = -96, two-tailed test).® 
DISCUSSION 


The findings cited above are in general 
agreement with the findings of closely related 
laboratory experiments and anecdotal studies 
on information seeking (Haggard, 1943; Pervin, 
1963; Roth, 1963) and with the implications of 
laboratory experiments on information choice 
in potentially stressful situations (Lanzetta & 
Driscoll, 1966; Lockhard, 1963; Perkins et al., 
1963). 

These studies, in conjunction with the 
studies of whether or not information modifies 
the impact of a potentially stressful stimulus 


The tests used here were those appropriate to 
clustered samples (Scott & Wertheimer, 1962, p. 231). 
Alternative ways of analyzing the data—for example, 
considering the six wards to be the Ss and the difference 
between the number of tuberculosis readers and the 
number of general readers to be scores—lead to differ- 
ences at least as significant by statistical criteria as 
those described here. 
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(Carlsmith & Aronson, 1963; Haggard, 1943; 
Janis, 1958; Pervin, 1963; and others), empha- 
size the importance of information in stress 
situations and create a consistent, intelligible 
picture. F 

Information about the nature and timing of 
a potentially stressful stimulus is important 
to most persons who are in a stress situation 
because, in some undefined way, it increases 
their ability to withstand its impact. For this 
reason persons generally will choose to have 
such information if it is readily available and 
will actively seek it if it is not. A natural 
stress situation such as a hospital is complex 
in that it contains a variety of potential 
sources of information which are probably 
differentially desirable to patients. If informa- 
tion is readily available from preferred sources 
(e.g. physicians) the patients will use it; if 
it is not, they will turn to less preferred 
sources (e.g, books about the disease and 
treatment, and perhaps nurses, aides, other 
patients, etc.). 

It should be emphasized that the present 
findings are from a natural, field experiment 
and as such are subject to alternative inter- 
pretations which place less emphasis on the 
role of information per se. For example, it is 
possible that other aspects of the physician’s 
personalities or treatment philosophies may 
have been crucial here. Differences among 
physicians in strictness, in sympathy, or in 
friendliness toward patients may have in- 
fluenced the patients’ general anxiety and 
morale. This, in turn, may have fostered dif- 
ferences in the readership of tuberculosis books 
which were independent of the effects of 
physician informativeness. While such inter- 
pretations cannot be eliminated in the present 
study, they seem less compatible with the re- 
sults of prior laboratory studies than that 
offered above which emphasizes information. 

The fact that the use of library books about 
tuberculosis was not high in general (only 
about 7% of all patients were tuberculosis 
book readers) was not surprising in retrospect 
and was not taken as an indication of the lack 
of importance of information to these patients 
in general. The absolute level of this statistic 
was assumed to be influenced by the patients’ 
use of other sources, both preferred and less 
preferred or secondary. In addition, several 
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facts about the setting suggested that books 
might be a relatively unpopular secondary 
source of information. The preponderance of 
patients entering the tuberculosis service were 
of low socioeconomic status. Many had little 
formal education, were probably poor readers, 
and perhaps avoided books which they felt 
were difficult to understand. In addition, 
many were elderly and had poor vision with 
or without glasses. Such factors would cer- 
tainly reduce the overall extent to which such 
books were used. With respect to the general 
books, the low proportions presented are 
probably artifacts. The sample of 30 general 
books selected represented only A% of the 
total number of general books in the library- 
A larger sample of general books would have 
increased this proportion considerably. 

Finally, we should emphasize what appeat 
to be the rather clear methodological implica- 
tions of this study for field experiments on the 
effect of information, psychological prepara- 
tion, etc. on human responses to stress. These 
studies have been numerous in the pasts 
particularly in the area of human responses 
to medical procedures. In the future, investi- 
gators attempting to give some persons in a 
natural stress situation (e.g., some hospital 
patients) special information about an im- 
pending stress should take pains either to 
control the use of other sources of information, 
or, alternatively, to assess the effects of these 
outside sources on the experimental and con- 
trol groups. The data presented here suggest 
that the effects of these outside sources cannot 
be assumed to be equivalent among such 
groups. 
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DENTIFICATION OF MODERATOR VARIABLES BY DISCRIMINANT 
ANALYSIS IN A MULTIPREDICTABLE GROUP 
VALIDATION MODEL! 
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Bowling Green State University 


A multivariate discriminant analysis procedure was used to identify moderator vari- 
ables in a multipredictable group validation (MPGV) model. Results from MPGV 


also were compare! 
ence techniques. S 
(N = 209 in the experimental group; 
investigated as potential moderators o! 
the criterion (grade point average, 
ent from each other. In addit 
to the compositi 


d to those of the univariate simple algebraic and absolute differ- 
eventeen variables, determined for 418 undergraduate students 
N = 209 in the cross-validation group), were 
f the relationship between the predictor and 
). The three techniques revealed moderators differ- 
ion, number and types of moderators varied according 
on of the prediction groups in MPGV. Results are discussed with 


respect to problems with moderator variable techniques. 


In an effort to increase predictive efficiency, 
investigators have modified the classic pre- 
diction model. In particular, Dunnette’s (1963) 
model gives considerable attention to inter- 
actions which may occur between several 
factors—predictors, individuals, behaviors in 
the situation, and consequences of these 
behaviors relative to the goals of the group. In 
Dunnette’s model, it is necessary to identify 
relatively homogeneous subsets of predictors, 
individuals, behaviors, and situations in an 
examination of the interactions between pre- 
dictors and criteria. 


The basic approach to the identification of 
homogeneous sets of individuals has involved 
the identification and use of moderator vari- 
ables. Banas (1964) defined a moderator 
variable as a quantitative or qualitative vari- 
able which improves the usefulness of a 
predictor by isolating subgroups of individuals 
for whom a predictor or set of regression 
weights is especially appropriate. The original 
concept of a moderator variable referred to a 
multivariate curvilinear regression equation 
using cross products (moderator and predictor) 
in which the beta weights, instead of being 
constant, are linear functions of the moderator 
variables (Saunders, 1955, 1956). The basic 
notion of moderator variables also is implied in 


1 The author wishes to express his thanks to Patricia 
Cain Smith, C. J. Cranny, Carol A. Vale, and Michael 
Doherty for their guidance throughout this research. 

2 Requests for reprints should be sent to the author 
who is now at the Department of Psychology, Uni- 
versity of California, Berkeley, California 94720. 
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subgrouping variables (Frederiksen & Melville, 
1954), population control variables (Gaylord & 
Carroll, 1948), predictability variables 
(Ghiselli, 1956, 1960b), referrent variables 


(Toops, 1959), and modifier variables (Grooms 
& Endler, 1960). 


A common procedure used to identify 


moderator variables is Ghiselli’s (1956, 1960b) 
technique which involves deviations from the 
regression line. Correlates of absolute dif- 
ference scores (/D/), between standardized 
criterion scores (Z) and standardized pre- 
dictor scores (Z,), are used to identify rela- 


tively predictable (low /D/ scores) from 
unpredictable individuals. 

Another technique, quadrant analysis 
(Hobert & Dunnette, 1967), revealed items 
which distinguished between overpredicted 
(individuals whose Zp scores are below their 
Ze scores) and predicted (Z, and Z, scores are 
approximately equal), and between under- 
predicted (individuals whose Z, scores are 
below their Ze scores) and predicted indi- 
viduals, Identifying differences among un- 
predictable groups provides enhancement of 
the understanding of the psychological com- 
position of groups. However, recent comments 
by Abrahams (1969) and McNemar (1969); 
indicate limitations in the quadrant analysis 


method as used by Hober tte 
(1967). y Hobert and Dunne 


One purpose of this article is to examine the 
results obtained from a combination of the 
two aforementioned approaches. A multi- 
predictable group validation (MPGV) i” 
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volves deviations from the regression line, 
algebraic differences, and identification of pre- 
dicted, underpredicted, and overpredicted indi- 
viduals. Prediction groups are formed by 
taking the algebraic difference (4D) between 
an individual’s Ze score and Zp score, +D = Ze 
= Loo CHE underpredicted group is composed of 
individuals with D greater than some a priori 
value, for example, > + 1, the overpredicted 
group with D < — 1, and the predicted group 
with —1< DSF 4. Size of the predicted 
region and number of levels of underpredicted 
and overpredicted groups could vary depend- 
ing on the situation. 

A second purpose of this research is to 
examine a technique which would identify 
moderator variables. The method of trying 
out specific hypotheses is slow and expensive 
because of the tremendous number of possi- 
bilities that exist. For the same reasons, the 
development of moderator scales through 
item analyses is no more satisfactory. There are 
no statistics or objective methods that mech- 
anically identify moderators. Furthermore, not 
enough evidence has accumulated to make it 
possible to state any general principles about 
the nature of items or traits that function as 
moderators. 

French (1961) attempted to develop a pro- 
gram for the selection of moderator variables 
from a large number of variables. The program 
was designed to identify moderators from 42 

otential moderator variables without taking 
time to compute actual correlations. Indexes, 
based on the joint distribution, in 3 X 3 con- 
tingency tables, of predictor and criterion 
scores which had been trichotomized, were 
used in place of correlation coefficients. The 
technique was unsuccessful because of dis- 
tortion of the indexes. ee: 

Since moderator variables discriminate be- 
tween those close to the line of regression and 


those farther away ; 
is conceivable that the st 
discriminant analys 1 
that contribute to the discrimination of over- 
predicted, predicted, and underpre l 
viduals in the MPGV model. Hence, a dis- 
criminant analysis procedure is used for the 
systematic identification of moderator vari- 
ables in MPGV- : 

The third purpose of this research is to com- 


pare the results of MPGV with those of simple 
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algebraic and absolute difference techniques. 
To achieve these ends, several variables are 
examined to determine their moderator effects 
on prediction of college grade point average. 


METHOD 


Subjects. The Ss were 500 undergraduate students 
fulfilling a research participation requirement of an 
introductory psychology course. Because of incomplete 
predictor and criterion information, sample size was 
reduced to N = 418. The Ss were randomly assigned 
to the experimental group, Sample A (N = 209), 
to the cross-validation group, Sample B W = 209). 

Criterion. The criterion measure was student’s accum- 
ulative grade point average (GPA) compiled prior to 
the winter 1969 quarter. 

Predictor. The predictor was the composite score 
obtained on the A merican College Test (ACT; American. 
College Testing Program, 1965) administered prior to 
admission to the university. 

Potential moderator variables. Six tests, administered 
to Ss in one session, were examined to determine their 
effects on the relationship between ACT and GPA. The 
tests and subtests are presented below. These tests and 
subtests provided scores for 17 potential moderator 
variables. 


Analysis of Relationships (A-R) (Ghiselli, 1960a) 
Bell Adjustment Inventory (Bell, 1962) 

Emotionality (E) 

Health Adjustment (He) 

Home Adjustment (Ho) 

Hostility-Friendliness (H-F) 

Masculinit, Femininity (M-F) 

Submissiveness (S) 
Culture Fair Intelligence Test (“g”) (Cattell, 1963) 
Mandsley Personality Inventory (Eysenck, 1962) 

Neurotocism (N) 

Extroversion-Introversion (E-1) 
Multiple Affect Adjective Check List 

(Zuckerman & Lubin, 1965) 

Anxiety (A) 

Depression (D) 

Hostility (H) 
Survey of Study Habits and Attitudes (Brown & 

Holtzman, 1965) 

Delay Avoidance (DA) 

Work Method (WM) 

Teacher Approval (T A) 

Education Acceptance (EA) 


(MAACL) 


Procedure 


Multipredictable group validation model. All scores 
(predictor, criterion, and potential moderators) were 
converted to standardized Z scores. Overpredicted, 
predicted, and underpredicted groups were formed based 
on the algebraic difference score, + D = Zora — Zycr- 
Sample A was split into overpredicted, predicted, and 
underpredicted groups in seven different ways, thereby 
providing seven sample problems for analysis. The 
seven problems differed in their a priori interval for 
the prediction group (eg, +-2, +.5, +10, etc.) 


TABLE 1 
PREDICTION GROUP INTERVALS 
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and/or the number of levels of underpredicted or over- 
predicted individuals (ie., for certain problems, the 
underpredicted and overpredicted groups were further 
delineated thereby providing several levels of under- 
predicted or overpredicted individuals). The parameters 
differentiating the problems are presented in Table 1. 

For each problem, a one-way multivariate analysis 
of variance (MANOVA), with the 17 potential modera- 
tors used as the dependent vector random variable, 
was performed in Sample A. A significant prediction 
group main effect, tested by Wilks’ lambda criterion, 
indicates that at least one of the discriminant functions 
(with standardized coefficients) is significant. 

To identify potential moderators within each prob- 
lem, a stepwise application of multivariate analysis 0 
covariance (MANOCOVA) was used. The initial 
MANOCOVA treated the variable associated with the 
highest discriminant function coefficient as the covariate 
and the remaining 16 as dependent. A significant main 
effect indicated that some discriminating power still 
remained among the 16 dependent variables. Each 
succeeding MANOCOVA added the variable with the 
next highest weight to the set of covariates and removed 
it from the dependent set. The procedure was repeated 
until the dependent set no longer produced a significant 
main effect. 

Once the moderators were identified, a new dis- 
criminant function using only those moderators was 
found and a discriminant score for each S was com- 
puted; DS; = Lwyxij, where the discriminant score for 
the individual 7 is DS;, wy is the coefficient of the ge 
variable, and x; is the individual i’s score on the y 
variable. 

_ Classification of individuals in Sample B was estab- 
lished in two ways. First, cutoff discriminant scores 
were obtained for each a priori prediction group in 
Sample A based on England’s (1961) Index of Dif- 
ferentiation which minimizes the number of misclassi- 
fications. Thus, in Classification Method I, an S in 
Sample B was placed in a prediction group based on 
cutoff scores established in Sample A. Tetrachoric 
peter oh coefficients were computed between ob- 
eee group classification of Ss. For 
dichotomized as indicated i ese BOS WS 

The second aati a des ENG section, 
chi-square techni = ee ee 
Goe. The prorsns applet to fs ta Secale E, 
places an individual ee EE A a Sample B; 
deviation of his discrimi © a group depending on the 
scounateandtieceale n score from the appropriate 
Again, tetrachoric e determined in Sample A. 

> correlation coefficients were com- 
pas assess the effectiveness of the moderator in 
ik. groups in the cross- 
ane oa effectiveness of the moderators on 
wel 4 y coefficients of the predictor, ACT; 

te computed for each prediction in Sample B 
for both classification techni group an saro; izēd 
that validity coefficients of ACT E hypotes 
the prediction groups ins SIT woplti be ligia al 
coefficient in Sample A. ampie B then: the: ever 
ip ee sae Each potential moderator 

the absolute difference 
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between the criterion, GPA, and the predictor, ACT. 
If a significant correlation was obtained in Sample A, 
then the sample was split into } (and 3) high scorers 
and 3 (and 3) low scorers on the potential moderator 
variable, and validity coefficients were obtained for 
each fractionated group- Significantly different coeffi- 
cients indicated the existence of a moderator. Results 
were cross-validated in Sample B. 

Algebraic difference model. Each potential moderator 
was correlated with the algebraic difference (+D) 
between the criterion, GPA, and the predictor, ACT. 
If a significant correlation was obtained in Sample A, 
then the sample was trichotomized. Differential 
validity coefficients of the predictor for the subgroups 
indicated the existence of a moderator variable. Results 
were cross-validated in Sample B. 


RESULTS ” 


The validity coefficient of ACT as a pre 
dictor of GPA was 52(p < 01) and 
.51(p < 01) for Sample A (VY = 209) and 
Sample B (V = 209), respectively. 


Multipredictable Group Validation Model 


Assessment of potential moderator variables. 
Seven one-way MANOVAs were computed 
for the problems outlined in Table 1. The 
main effect was prediction group, and the 
levels within the factor varied according to 
the problem. Scores for the 17 potential 
moderator variables constituted the dependent 
vector variable in the analyses. A significant 

< 01 in each case) main effect was found 
in Problems 1-6; Problem 7 did not yield a 
significant main effect and thus was excluded 
from further examination. ; 

For each problem, the first discriminant 
function was analyzed by MANOCOVAS to 
determine which variables were contributing 
significantly to the discrimination. , 

"To confirm the significance of the contribut- 
ing variables, one-way M 
ducted for 
variables treated as the 
variable. Results demonstrated that 
effects were significant (p<! i 
The weights of the new discriminant function 
for each problem are presented in Table 3. 

inaling 


Effectiveness of moderators in discrim : 
prediction groups: Method 1. In 


Classification 
Sample A, a dis 


vector 
all main 


tribution of discriminant scores 


3 Tables of intercorrelations and summary < 
of MANOVA and MANOCOVA can be obtained from 
the author. (See address in Footnote 2.) 
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TABLE 2 


DISCRIMINANT WEIGHTS FOR THE SEVENTEEN 
POTENTIAL MODERATOR VARIABLES 
1x EacH PROBLEM 


Problem 


a See text for explanation of abbreviations. 
» Significant weights of the discriminant 
problem as determined by MANOCOVAS. 


function for the 


was obtained for each level within a problem. 
Examination of the distributions resulted in 
two modifications of procedure. First, for 
problems with more than one level of over- 
predicted or underpredicted intervals there 
was considerable overlap between groups. 
Thus, Ss within all levels of underpredicted 
groups were combined into @ single under- 
predicted group and, similarly, all levels of 
overpredicted groups were combined into & 
single overpredicted group. Second, for several 
problems with only three groups, there still 
existed considerable overlap between groups. 
Hence, to use England’s (1961) Index of Dif- 


TABLE 3 
DISCRIMINANT Weicuts FOR SETS OF CONTRIBUTING 
VARIABLES IN Eacn PROBLEM 


Problem 


Lind 


A |= 
H 
S$ | 604 | | 
AR | 593 | 1.004 | 
EA | 599 | —.699 
EI | | | 491 | 

M-F | | | | .708 


i e eene 


aa 


33 | — -892 | 
51 


tn 
a 
a 


a See text for explanation of abbreviations. 


TABLE + 
0} 
CORRELATIONS BETWEEN GPA AND AD FOR 
PREDICTION GROUPS IN SAMPLE 
BASED ON INDEX OF 
DIFFERENTIATION 


| Prediction group* 


Prob- j T 
jem i Predicted Undër- 
Overpredicted | | predicted 
j | 
aD | 49 (N = 173) 63 (N = 36) 
| zi 7 
2 | 51 (N = 147) | 50 (N = 62) 
3 | 49 (N = 147) 53 (N = 62) 
| 


4 55 (N =44) -51 (N = 107) | .55 (N = 58) 


5 | Si(N =114) 57 (N 


i] 
w 
2 


.50 (N = 65) 


6 | .53 (N = 124) 49 (N = 85) 


s Values centered in the overpredicted-underpredicted column 
are common to both groups. 


ferentiation, the predicted group was com- 
bined with one of the unpredicted groups in 
Problems 1, 2, 3, and 6. 

Based on the cutoff discriminant scores ob- 
tained in Sample A, Ss in Sample B were 
placed onto prediction groups. There were 
significant tetrachoric correlations for Problems 
3 and 6 (p < .01 and p < .05, respectively). 

Effectiveness of moderators in discriminating 
prediction groups: Classification Method 2. The 
chi-square technique (Cooley & Lohnes, 1962) 
resulted in a second set of discriminant cutoff 
scores. None of the tetrachoric correlation 
coefficients revealed a significant relationship. 

Effectiveness of moderators on validity: Groups 
based on Index of Differentiation. Table 4 pre- 
sents the correlation coefficients between ACT 
and GPA for the prediction groups within 
each problem, None of the coefficients for the 
prediction groups within any problem differed 
significantly from .52 (overall validity coeffi- 
cient for Sample A). 

Effectiveness of moderators on validity: Groups 
based on chi-square classification. Table 5 pre- 
sents the correlation coefficients between ACT 
and GPA for the prediction groups within 
each problem. Again, none of the coeficients 
for the prediction groups within any problem 
differed significantly from .52. 
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Absolute Difference Technique 


The correlations between /D/ and Spee 
Approval (TA) (r= —-17, p < .05) and p 
and Education Acceptance (EA) (r = — È , 
p < .05) were significant in Sample A. fl 
each of the two potential moderator variab 5 
(TA and EA), two fractionations were — 
(a) } low and 3 high scorers, and (b) 3 low and 3 
high scorers. Table 6 presents the correlations 
between GPA and ACT for each of the frac- 
tionated groups. The difference between vali- 
dity coefficients for the 3 low and 3 high TA 
scorers and for the } low and 3 high EA scorers 
was significant (p < .05 in both cases). 

Neither of the differences in correlations 
between fractionated groups was significant 1m 
the cross-validation sample. 


Algebraic Difference Technique 


The correlations between + D and the 
Culture Fair Intelligence Test (“g ) 
(r = — .15, p < 05), + D and Masculinity- 
Femininity (M-F) (r = — .22, p < .01), and 
+ D and the Analysis of Relationships (A-R) 
(r = —.29, p < .01) were significant. For each 
of the three potential moderators g”, 
M-F, and A-R) trichotomizations were made. 
Table 7 presents the correlations between GPA 
and ACT for each of the prediction groups with 
respect to the three potential moderators. The 
difference between the predicted and both the 
overpredicted and underpredicted groups was 
significant (p < .05) when the trichotomiza- 
tion was based on M-F. Again, the differences 
were not significant in Sample B. 


TABLE 5 
CORRELATIONS BETWEEN GPA Anp ACT FOR 
PREDICTION GROUPS IN SAMPLE B BASED 
ON CHI-SQUARE CLASSIFICATION 


Prediction group 
Prob- é 7 7 e 
lem ver- ünder 
Pre- ‘ 
pre- N | si N pê N 
dicted | | dicted | syed 
= | | 
u 50 | 30| 36 | og) az | 8 
2 | 39 | 85 | 52 48 43 76 
3 | 55 | a1} 3o |a| 45 | 7 
4 | 52 | 101| ‘61 | io} a8 | 8 
te se | wr l2) s: | 90 
a mae E | 2 || 3 107 
| ae 
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TABLE 6 


CORRELATIONS BETWEEN GPA anp ACT FOR 


FRACTIONATED GROUPS IN SAMPLE A 
R | 
Variable Fractionated | N | K 
group | 
| a ae J 
ma | ma | 2 
| 3high | 137 
| 2 low | 138 
| } high | Ti | 
EA blw | 67 
| 3high | 12 | 
| 3 low | 137 | 4 
| } high 72 | 56 


‘A = Teacher Approval. EA = Education 


Acceptance. Correlations consecutively underlined are signifi- 


cantly different from each other at P<: 5. 


DiscussioN 


This article concerns the use of discriminant 
analysis for the systematic identification of 
moderator variables in an MPGV model. 
In addition, results from MPGV are compared 
with those of simple algebraic and absolute 
difference techniques. The results of this 
research did not support the use of any of the 
moderator variable techniques because of the 


problems and inconsistencies within each 
method. 
Multi predictable Group Validation Model 


The variables discriminating the groups in 
Sample A were not effective in Sample B. The 
ineffectiveness of discriminant analysis in this 
situation is due to several problems. First, 
the discriminant weights obtained in the initial 
MANOVAs were not different enough to lend 
themselves to an eyeball analysis for ex- 
tracting significant contributors. Hence, 


MANOCOVA was necessary; the selection of 
the potential moderator variables was based 
he largest weights 


on the a priori notion that t : 

indicate the most important variables. It is 
possible that the selection of a different set of 
variables; for example, those that are signifi- 


cant in @ univariate analysis of variance, 


would also provide @ significant MAN effect. 


The necessity for using MANOCOVA limits 
the facility of using discriminant analysis to 

identify moderators. : 
Second, & significant discriminant function 
hat all levels within a factor are 


does not mean t 
discriminable. It is possible that the means for 


one set of levels are similar as are the means for 
a second set of levels; however, the first set 
of levels is different from the second set. 
Discriminant analysis does not necessarily 
provide a discrimination between all levels, 
but distinguishes between sets of levels. 

Overlap or similarity of levels necessitated 
modification of the proposed procedure. For 
most problems studied, & discrimination be- 
tween all levels of the prediction group Was 
difficult. As a result, levels within a prediction 
group were combined. For some problems, it 
was further necessary to combine to the extent 
that the three basic prediction groups (over- 
predicted, predicted, and underpredicted) could 
not be maintained. The inability to maintain 
all levels and/or groups was due to the small 
amount of between group variance accounted 
for by the moderators. For the six problems, 
the proportion of between group variance 
accounted for by the set of moderators was 
09, .12, .05, .14, 19, and .16, respectively. 

Third, if two variables are highly correlated, 
they provide redundant information for dis- 
criminating among the groups: Thus, if a set 
of subtests are highly and significantly corre- 
lated, only one need be retained in the analysis. 
The procedure and interpretation would thus 
be facilitated. 

Fourth is the problem of misclassification. 
The Index of Differentiation, & relatively crude 
estimate of classification, provided largely non- 


TABLE 7 
CORRELATIONS BETWEEN GPA anp ACT FOR 
PREDICTION GROUPS IN SAMPLE A BASED 
ON +D CORRELATION 
WITH MODERATOR 


Variable* Prediction N & 
group 

Age Overpredicted 62 49 
Predicted 91 42 
Underpredicted 56 59 

M-F Overpredicted 53 35 
Predicted 107 .66* 
Underpredicted 49 49 

A-R Overpredicted 67 A2 
Predicted 84 39 
Underpredicted 58 35 


a Abbreviations: 
M-F = ‘Masculinity-Femininity, A-R = 
ships. i 

* Predicted is significantly different from poth overpredicted 
andzunderpredicted at p <.05. 


“g” = Culture Fair Intelligence Test, 
‘Analysis of Relation- 
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ignificant associations. The chi-square tech- 
ique, derived from a discriminant analysis 
ystem, maximizes the probability of correct 
Jassification. With both techniques, there was 
onsiderable misclassification as evidenced by 
the lack of many significant tetrachoric correla- 
tion coefficients. Examination of the con- 
tingency tables revealed that there was a 
tendency to place less than the actually ob- 
served number of Ss into the predicted group. 
Sample size is a related problem. Sample 
sizes of the prediction groups depend on three 
factors: (a) validity coefficient for the entire 
sample, (b) = D interval for the predicted 
group, and (c) number of levels within the 
overpredicted and underpredicted groups. The 
higher the validity coefficient for the total 
sample and the greater the + D interval for 
the predicted group, the more unequal are the 
prediction group sample sizes. In addition, if 
more than one level of underpredicted or over- 
predicted is used, the sample sizes of the levels 
within the unpredicted groups, especially the 
more extreme levels, become smaller. With 
relatively small and unequal sample sizes 


among groups, the test for significance between 
coefficients loses power. 


Absolute Difference Technique 


EA and TA were identified as moderators in 
Sample A. Results indicated that Ss were more 
predictable if they had relatively higher 
acceptance of educational objectives and 
higher approval of their teachers’ methods and 
classroom behaviors. The results did not hold 
up in cross-validation in Sample B, thus sup- 
porting the need for verifying moderator 


results prior to making decisions (Banas & 
Moore, 1968). 


Algebraic Difference Technique 
ugn 


M-F, “g”, and A-R were identified through 
their correlation with + D as possible modera- 
tors in Sample A. The only variable that pro- 
vided differential validities was M-F. The 
failure of “g” and A-R to yield differential 
patterns of validity may be due to restriction 
of range. Moderator variables are usually 
uncorrelated with predictors or criteria. In this 


study, M-F was uncorrelated with GPA and 


ACT, whereas “g”? was correlated with GPA 
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(r = .23, p < .01) and ACT (r = 38, p < 01), 
and A-R was correlated with GPA (r = 42, 
p < 01) and ACT (r = .71, p < 01). Conse- 
quently, subgrouping on the latter two poten- 
tial moderator variables was equivalent to 
restricting the range on both the predictor and 
the criterion, resulting in lower validity 
coefficients. . 

With regard to M-F, differential validities 
appeared in Sample A. For those Ss with 
normal masculine-feminine tendencies the va- 
lidity coefficient was highest. The problem of 
cross-validation is evident in this technique 
the /D/ technique. 


as 


Implications 


The data of this research do not confirm the 
use of discriminant analysis to identify modera- 
tors in an MPGV model. Likewise, the data 
do not support the use of the algebraic dif- 
ference or absolute difference (Ghiselli, 1956, 
1960b) techniques. However, a consideration 
of the problems of discriminant analysis and 


those of cross-validation suggests 


further 
research. 


First, future research should be conducted on 

larger sample sizes, thus limiting the effect of 
sampling error. Second, to facilitate procedure 
and interpretation, no more than one level 
within a prediction group is necessary. 
: Third, when potential moderator variables 
in a matrix are significantly and highly inter- 
correlated, only one variable of the set needs 
to be employed in the discriminant analysis. 
If the number of variables is extremely large, 
factor scores derived from a factor analysis 
can be used as the dependent variables in the 
discriminant analysis. 

Fourth, since some evidence does exist that 
effective moderator variables may be un- 
correlated with predictors and criteria, all 
ng emp variables highly correlated 
a a ictor and/or criterion can be 
z Geant n to subsequent investigation Mm 
sci nt analysis. Also, the effect of a 
ae al moderator as an independent pre- 

or should be examined. 
Pon once the moderators are determined, 
ering the variables inde- 
pendently, one at a time. Thus, a simple 
analysis would be possible without interaction 
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effects. Joint moderators were not found in 
academic situations in studies by Zedeck, 
Cranny, Vale, and Smith (1969) and Stricker 
(1966). In addition, the procedure would be 
easier since raw scores would be used in classi- 
fication as opposed to discriminant scores. 
Sixth, different classification procedures from 
the ones in this study might be tried. In addi- 
tion, using more than one discriminant func- 
tion for classification may account for more 
of the variance and result in better outcomes. 
The use of a technique such as discriminant 
analysis in conjunction with MPGV does have 
some advantages. First, if variables could be 
found to identify the prediction groups, then 
greater emphasis in the future will be placed 
on reliability of instruments. Second, main- 
taining differences among unpredictable indi- 
viduals enhances psychological understanding 
and leads to different priorities of selection. 
For example, suppose two individuals had the 
same acceptable predictor score but different 
moderator scores. On the basis of moderator 
scores, if one individual was placed in the 
underpredicted group and the other in the 


predicted group, then the MPGV concept 
suggests hiring the former before the latter. 


In essence, MPGV reduces the amount of 
misses more than does the absolute difference 
technique. 

Finally, & systematic technique for selection 
of moderator variables would allow us to 
examine larger masses of data and select those 
variables which serve as moderators. A 
systematic technique is more efficient than 
intuition for discovering or developing modera- 
tor variables. 
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TABLE 2 


MEANS AND STANDARD DEVIATIONS OF GENERAL APTITUDE TEST BATTERY 
p i APTITUDES BY AGE AND OCCUPATION 


Age group Warner iii 
Warner 7 level | | | 3] a 
level | 5-35 | 36-40 | 41-45 | 46-50 | 51-55 | 56-83 | 25-35 | 36-40 | 41-45 | 46-50 | 51- 
| hace | | ze 
Intelligence Spatial Aptitude (continued) 
ao tothe. r | 
B | | 120.6 | 118.0 | 111.9 š | 108.7 | 110.8 | 109.6 | 107.5 | 
¥ 5.9 | 123.9 | 123.3 | 120. o| u Š 108.7 | 110.8 | 109. 5| 
a wal 173 | 15.2) 142| 161} 15. SD) 174| 175 | 15.3 | 149 | 
| F | 
Pe 118.2 | 119.3 | 115.8 | 115.3 | 111.4 | 109.1 X | 105.7 | 105.5 | 98.5 | 99.2 
sp | 167 | 13.7| 140| 145) 15.5| 18.6 SD | 145 | 181 | 181] 16.4 
4 T | 
X | 107.5 | 108.5 | 104.8 | 105.2 | 108.1 | 99.4 X | 106.3 | 99.2 | 103.9 | 100.6 
sp | 150} 151| 14.3 | 163] 15.2] 20.1 SD | 154 | 15.7 | 14.1 | 15.6 
| 
5-7 : y 
X | 104.4 | 102.8 | 105.2 | 101.8 | 100.7 | 98.0 Porn Perception 
sp | 16.0 | 146| 14.2) 13.7 | 161) 12.0 i 
[i 1 | 
z F 1-2 | 
Verhal Aptitude X | 1177 | 115.5 | 113.4 | 105.6 
= ; SD | 15.5} 176| 18.0) 15.4 
1-2 | | g. | | 
X | 121.5 | 120.8 | 120.8 | 116.0 | 116.2 | 112.7 X | 112.7 | 109.8 | 107.1 | 104.3 
sp| 143| 171} 157| 15.1) 15.2| 165 sp | 190} 200| 193| 198 
3 


4 
X | 116.6 | 115.2 | 112.3 | 114.5 | 111.6 | 109.8 X | 107.8 | 109.4 | 101.0 
SD | 169) 14.1 | 13.5 | 14.6) 16.4 


94.7 
4| 161 SD | 18.0 | 16.9 | 163) 17.5 
4 be, fm 5-7 
Š | 104.2 | 107.2 | 104.6 | 107.4 | 110.9 | 100.1 Š | 1045) 98.7 | 104.0 | 96.6 
SD | 134) 14.3| 12.2 | 16.6] 11.9] 19.4 SD | 207) 141| 16.2| 15.7 
5-7 | | arse | 
X | 102.5 | 101.3 | 104.3 | 103.4 | 101.8 | 102.6 Cridi oe 
SD | 15.4 | 145] 12.7| 128| 154| 13.8 enbel Becca tion 
\ | al | Pa | aie |e 
| ea RE =i a A a o N] 
Numerical Aptitude X |1249 | 125.8 | 124.5 120.1 
| DAE SD | 16.2 | 1641 | 152| 160 
1-2 | 3 
X | 126.4 | 122.2 | 123.2 | 102.3 | 117.6 | 114.9 X | 123.8 | 123.6 | 117 
= 15.4| 16.1 | 161 | 15.0 1| 161 SD | 179 | 168 i ee 
a 4 i i 
X |118.5 | 118.7 | 115.7 | 112.6 | 110.9 | 108.5 X | 114.4 | 14. | 
8 . 3 j 8 | 
SD | 16.3 | 15.5 | 16.0) 19.1] 16.2| 20.2 SD | 138 | 12.7 | T a 
4 | | 5-7 ia ie i iad 
X |1049 | 106.5 | 105.0 | 103.8 | 108.5 | 98.1 x | | 114.5 | 
SD | 146) 143| 134] 164) 184) 188 SD tee Pm "a | ee 
. 5-7 | ` ` 


13.1 | 12.8 


X | 102.8 | 102.5 | 104.3 | 98.2 | 


sakasa 5| 162| 16.7) 


99.2 | 94.7 re aie > = 

| SD | 16.0 | 14.3 | 12.8 | 13.2 | 153| 141| _ Motor Coordination 

' a sae Tans a g mon 1-2 “hor 7 T 

l Spatial Aptitude A | 

E 7 an ‘a a 118.2 | 117.8 | 112.8 | 115.2 | 111.3 | 107.8 
is | | F | 14.7 | 17.8 | 16.6 | 16.9 | 17.6 | 20.5 
b X |1146 | 113.7 | 111.3 | 107.9 | 105.2 | 96.3 Yoav | | 5 
l fei Si 5 2 117.9 | | Be 
t SD | 172| 189| 164| 163| 162| 144| Sp) agg | 166 | 1119 | 1130 | 108.5 | 100:5 
#¥ pea > 


Hal i54) 19° 
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"TABLE 2— (Continued) the others were significant at the .001 level. In 
j 4 all cases, mean scores declined with age. The 
Age group effect of SES was statistically significant for 
| PoR N eight of the nine aptitudes. Manual Dexterity 
| 25-35 | 36-40 | 41-45 | 46-50 | 51-55 | 56-83 did not change significantly with SES. Finger 
- = ——— Dexterity was significant at the .01 level, while 
Motor Coordination (continued) the other seven aptitudes were all significantly 
a = i affected by SES at the .001 level. In all cases 
g | 105.6 | 106.4 | 107.5 | 107.2 | 105.1 | 98.2 ihe Tie from lower SES levels had. lower 
Sp | 185) 163| 184| isel 20:6] 224 scores for the eight aptitudes significantly 
| atiected by SES. 
| | | 2 For none of the nine aptitude scores was 
p| 199| 13.0) 127) 127 168 | 17.5 there a statistically significant interaction be- 
—— eee - tween age and SES. 
Finger Dexterity The percentage of variance accounted for by 
the main effects of age and SES in the nine 
analyses of variance was computed based on 
the procedure described by Hays (1963). The 


Warner 
level | 


104.9 | 104.0 | 107.7 | 100.1 | 98.8 | 95. 
ri 


3 98.8 | 89.3 | 89.0 85.6 | 77.1 70.9 


SD 20.7 17.8 19.4 18.9 19.0 20.0 

3- results are shown in Figure 1. The percentage 
Z ays eke ae Be {he oe accounted for by SES in each aptitude is on 
sp | 21.0| 19.0} 203 | 205 20.7 | 23.1 the horizontal axis, that for age on the vertical 


axis. The magnitudes of the values in Figure 1 
248 reflect the fact that differences in SES are 


d 95.6 | 89.8 | 86.8 79.5 | 75.8 
1 


sp | 16.6 | 16.1 | 17.3 | 174) 19.9 
a wea | sal gala re reflected most in Numerical Aptitude, Intelli- 
o| 168 Tho | isa ee i20 rat gence, Verbal Aptitude, and Clerical Percep- 
p L ige a ui p : tion. In contrast to the results of the “four 
Manual Dexterity state” study (United States Department of 
ie! gee i Labor, 1967, p. 225), the effect of age on Ver- 
1-2 
Š 107.2 | 102.5 98.4 | 93.6 | 92.4 | 80.1 
sp | 23.1 18.4 | 19.5 | 17.0 18.9 | 18.4 u 
3 
g | 104.8 | 108.3 | 99:2 97.6| 964| 824 X 
sp | 20.1 | 20.1) 218 22.0 | 20.1 | 19.2 W 
4 | | 
g | 106.3 | 103.5 | 99 os| 883| 993 & 
ep | 208) 13 2 209 | 228| 228 & 
5-7 i m N 
x |1039) 98.9) 1017 99.6 | 86.2 | 764 
smi 199) 184) 17 | 170 | 157| 24 $ 
ee mal Q 
——— = $ 
& 
ù 
a 


7 i 0, 

Warner Levels 6 and 7 are 21% and e of the 
group, respectively, in the present study. 

Nine two-way analyses of variance were con- 
ducted on the aptitude scores with age and SES PERCENT VARIANCE : SES 
as factors. (A discussion of similar mere Fic. 1. Percentage of variance accounted for in nine 
performed on the 12 GATB subtests are de- GATB audes by differences in four i in 
scribed elsewhere [Fozard & Nuttall, 1970; (SES) and six age classifications, (Percentages of 


; ” weighted cell variance were calculated assuming a fixed factor analysis 
Nuttall & Fozard, 1970) The unWelb of variance. Abbreviations: F = Finger Dexterity, 
means pro 


or each. Re- M = Manual Dexterity, P T Form Perception, S 
sults of the analyses 


cedure was employed fi ch. ; 
showed statistically = Spatial Aptitude, K = Motor Coordination, 

significant age effects for all apt 

Verbal Aptitude was significant att 


tude scores. = Clerical Perception, V = Verbal Aptitude, G = In- 
he .05 level; telligence, and N = Numerical Aptitude.) 
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Fic. 2. Percentage of 36 Occupational Aptitude 
Patterns for which the average of men in each of the 
combinations of age and SES qualify. 


bal Aptitude was statistically significant in the 
present study. 

The hypothesis of curvilinearity in the effect 
of age on Intelligence, Verbal Aptitude, 
Numerical Aptitude, and Spatial Aptitude 
(Hirt, 1959) was examined. Trend analyses, 
using Gaito’s (1965) procedure for unequal 
numbers of cases and unequal intervals, were 
performed for the age differences within each 
Warner Level separately. The results of the 
36 trend analyses are summarized elsewhere 
(see Footnote 3). 

In the present sample Warner Levels 5-7 are 
most similar to the level of occupations repre- 
sented in Hirt’s (1959) study. In contrast to 
Hirt’s results, in the present study none of the 
quadratic trends in that group was significant 
at the .05 level. In Warner Level 4, however, 
the quadratic trends were significant for all 
aptitudes except for Clerical Perception. For 
the Warner Level 3 group only Spatial Apti- 
tude and Manual Dexterity had significant 
quadratic trends. At Warner Level 1-2 Intelli- 
gence, Spatial Aptitude, and Form Perception 
had significant quadratic trends. 

Examination of Table 2 shows that most of 
the significant quadratic trends occurred be- 
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cause the differences between the two or three 
oldest groups were larger than those between 
the younger groups. The exceptions are the 
quadratic trends for Intelligence, Verbal Apti- 
tude, and Numerical Aptitude in W arner 
Level 4, in which the means for the 51-55-yr. 
age group were higher or almost as high as in 
any of the younger groups. 

The linear trends were statistically signifi- 
cant in 27 out of 36 cases. Most consistent 
linear trends in all SES groups were those for 
the psychomotor aptitudes, Finger Dexterity 
and Manual Dexterity. 

The procedure used by Heron and Chown 
(1967) to assess differences in the age trends of 
the two extreme SES classifications was applied 
to the age differences for Warner Levels 1-2 
and 5-7. The differences between the means of 
these two Warner level groups in each of the 
six age groups were found for the nine apti- 
tudes, and the rank order of correlation be- 
tween these differences and age were computed. 
Only for Intelligence and Verbal Aptitude were 
the rank order coefficients significant beyond 
the .05 level. The significant correlations oc- 
curred because the differences between the two 
extreme groups grew smaller as age increased. 
The age decline was greater in the Warner 
Level 1-2 groups (see Table 2). These results 
are the opposite of those found by Heron and 
Chown (1967). 

Occupational Aptitude Patterns. The mean 
score on the aptitudes for each of the 24 groups 


was compared to the cutoff scores for each of 
the 36 OAPs (United Sta: 


tes Department of 
Labor, p 


D 1962). The percentage of OAPs for 
which these mean scores qualify is indicated 
in Figure 2. Clear differences are observed 
between the SES levels, especially between 
Warner Levels 1-2 and 3, as contrasted to 
Warner Levels 4 and 5-7. Age, however, 
seemed to have a stronger effect ‘than SES, 
with all four SES groups dropping precip- 
itously with advancing age. The young Warner 
Level 1-2 group qualified, on the average, for 
all 36 OAPs, but the oldest group for this SES 
level qualified on the average for only 17 
OAPs, Similarly, the lowest SES group quali- 
fied for 28 OAPs on the basis of their average 
Scores at the youngest age, but only for 9 
OAPs at the oldest age. Clearly the older 
worker in all SES groups is severely handi- 
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capped when compared to younger men on the 
basis of OAP cutoff scores. 


DISCUSSION 


The major results of the study may be sum- 
marized as follows: (a) Both age and SES 
affect average performance on the GATB 
aptitude scores and the average percentage of 
OAPs passed. (b) There were no statistically 
significant interactions between the effects of 
the two variables when all Warner Levels 
were considered. When the data of only the 
two extreme SES groups were used, however, 
in GATB aptitude of Intelligence and Verbal 
Aptitude, there was a statistically significant 
tendency for the differences in the scores of 
the high and low SES groups to become 
smaller with age. (c) In general, aptitude scores 
which were most influenced by age were least 
influenced by SES and vice versa. 

There was little evidence in the present data 
that the curvilinear trend in age differences for 
Intelligence, Verbal Aptitude, Spatial Apti- 
tude, and Numerical Aptitude found by Hirt 
(1959) generalizes across SES classifications. 
The occupational levels of his sample were 
only poorly represented in the present group 
so that a direct comparison was not possible. 

There were many statistically significant 
curvilinear trends for age differences in the 
GATB aptitude scores. Except for Warner 
Level 4, these trends could be attributed to 
differences in the amount of decline between 
older and younger age groups, that is, a 
threshold effect. 


The exceptions in the Warner Level 4 groups 
occurred because performance 1m the 51-55-yr.- 
old group was high in the aptitude scores most 
sensitive to SES. One hypothesis to explain this 
effect is that the initial occupation of Ss in this 
group may have been of a lower status on 
that suggested by their aptitudes. Men 2 ihis 
age group typically held their first jobs mae 
the depression years of the 1930s, when en 
were fewer jobs i higher ~ : 

categories. Ina depressed job market, a given 
pattern of abilities would lead to a pang 
occupational attainment, hence lower BS, 
than would be the case in an expanding job 
market. Many men of the depression ere would 
then be “frozen” in an occupation below what 


vee 


their abilities would have allowed them to 
achieve in better economic times. Some sup- 
port for this hypothesis comes from Table 1 
which shows that the educational level in the 
group is somewhat higher than 
Level 4. 

analyses of the present 
data show unequivocally that there is no 
statistically significant interaction between thel 
main effects of age and SES. Thus, there is no 
evidence in the present data analysis to support 
the idea that age decrements in cognitive 
functioning decline less in high- than in low- 
SES groups. 

Application of Heron and Chown’s (1967, 
pp- 104-106) procedure for assessing differential 
declines in extreme SES groups showed that 
the amount of decline in Intelligence and Ver- 
bal Aptitude was different for Warner Levels 
1-2 and 5-7. In contrast to Heron and Chown’s 
results, however, which showed the largest 
differences between the two SES groups at the 
oldest ages, the present study found the two 
SES groups becoming more similar in Intelli- 
gence and Numerical Aptitude with increasing 
age. 

Even allowing for differences in tests and 
systems of SES classifications in the two 
samples, it would not be expected that the age 
trends would be directly opposite, particularly 
in Intelligence, which is the best unitary mea- 
sure of general intelligence available in the 
GATB (United States Department of Labor, 
1967, Ch. 14). One explanation of the dis- 
crepancy is that the educational differences in 
the two extreme SES groups were greater 
among the older men than among the younger 
men in the British sample. Alternatively, one 
could argue that the lowest SES groups were 
not well represented in the present study, 
particularly in the oldest age groups. As a re- 
sult, the observed age differences between 
extreme SES groups would not be as large in 
the present sample as in the general population. 

An additional possibility is that the differ- 
ential age effect in high and low SES groups 
observed by Heron and Chown (1967) is 
peculiar to the Progressive Matrices Test. The 
GATB Aptitude of Intelligence, and also 
Numerical Aptitude, measure more the “crys 
tallized” intelligence that the “fluid” intelli- 
gence as indicated by the Progressive Matrices 


51-55-yr.-old 
average for Warner 
The results of all the 
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METHOD FOR DETECTING RANDOM ANSWERS 
ON PERSONALITY QUESTIONNAIRES 


JERRY W. O’DELL' 


Eastern Michigan Universily 


i i y for the Sixteen 

| strategies for detection of randomly answered protocols 
pala factor Questionnaire, Form A, 1967 edition were developed and evalu- 
ated. Cattell’s Sabotage index predicted poorly, indeed in the wrong direction, be- 
tween normal and randomly answered sheets (t = 2.10, p = .05). Random and 


Carelessness indexes were derived by the author for the same purpose and predicted 
very well, providing és of 15.02 and 12.67 (significant at well beyond the 001 level), 
respectively, between random and normal sheets. More importantly, both indexes, 
while correctly selecting 86% and 88% of the random answer sheets, respectively, 
incorrectly picked out only about 6% of the normal group. Thus, while detection 


of random protocols is rather accurate, few normal people would be offended by 
having their accuracy questionned. Since the design of the study included a form of 


use. 


Randomly filled-in answer sheets pose night- 
mare-like problems for any person putting 
psychological tests into practical use. Recent 
rapid development of automated scoring and 
interpretation systems for tests (see Butcher, 
1969) is compounding the problem by leaps and 
bounds. The emphasis of these systems is more 
and more that of producing a report indis- 
tinguishable from that produced by a human 
psychologist, and there seems little doubt that 
really elegant (although not necessarily valid) 
systems will be available in the very near 
future. These will be capable of describing 
everything from probable work habits to clini- 
cal syndromes. This development will make it 
possible for a personnel worker or counselor to 
obtain a report looking almost exactly like the 
routine production of an expert test interpreter 
with little more cost or effort than is required 
for simple scoring, and on tens of thousands of 
persons. These elegant programs unfortunately 

have no way of distinguishing a randomly 
answered test from one completed with care. 
They can and will generate beautifully be- 
lievable reports on a meaningless protocol. It 
should be evident that if such great numbers of 
persons are tested, a discouraging number of 
absurd reports is going to be developed on 
randomly answered sheets. The author, in 
attempting to develop an automated interpre- 


1 Requests for reprints should be sent to Jerry W, 
O’Dell, Eastern Michigan University, Ypsilanti, Michi- 
gan 48197. 
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cross-validation, it is suggested that these indexes might well be of great practical 


tation system for the 1967 Sixteen Personality 
Factor Questionnaire, Form A, (16PF; Cattell, 
Eber, & Tatsuoka, 1970) ran up against the 
problem of random answering so frequently 
that it became clear that a means would have 
to be developed to solve it. 

Although the basic problem is that of picking 
out as many of the random profiles as possible, 
one error in particular cannot be tolerated, 
namely, calling a sheet randomly answered 
when it has in fact been filled out in good faith. 
Such an error leads to the difficult ethical 
problem of having to incorrectly question 
someone’s honesty. A high proportion of errors 
of this sort could lead to chaos in any organiza- 
tion, and any method used should make as few 
errors as possible in this regard. 

Over the years there have been many at- 
tempts at developing indexes of careless answer- 
ing, none of them apparently having been 
demonstrated to be successful. For the 16PF, 
Cattell himself (Cattell, Eber, & Tatsuoka, 
1970) has considered the problem of random 
answering, which he colorfully calls ‘“sabo- 
tage.” He has proposed a method for sabotage 
detection on purely theoretical grounds. The 
basic assumption of this procedure is that the 
Score on one-half of a given 16PF scale should 
be predictable from the other half of the scale 
through regression techniques. For each scale, 
the difference between the actual and predicted 
score is obtained, each difference is squared, 
and the squared differences are summed across 
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all 16 scales to provide the sabotage index. It is 
said, on purely theoretical grounds, that one 
may be 99% sure of sabotage if the score 
exceeds 102, and 95% sure if it exceeds 83. To 
the best of this author’s knowledge, this index 
has never been systematically tested, and such 
a check clearly needs to be done. 

Preliminary work with Cattell’s index sug- 
gested that it was not effective, and thus it 
seemed necessary to try other methods of de- 
tection, One method, used in this article, 
employs extreme item alternative response fre- 
quencies. The frequency of response to each 
alternative of each question is tallied in a group 
of persons motivated to answer accurately. 
Those alternatives that are chosen by very few 
people in the population, say by less than 10%, 
are then selected and put together to make upa 
“random” scale. Accurate answering of the 
questionnaire should yield a low score on this 
scale; careless responding should lead to a much 
higher, chance score. Such a scale may also be 
constructed from alternatives which virtually 
every person in a motivated group selects, say 
90% or more. Here careful answerers should 
obtain a high score, with randomness being 
indicated by a lower, chance score. Variations 
in this approach have been used in the con- 
struction of the V score on the various Kuder 
Preference Records (Kuder, 1950, 1964) and in 
the Minnesota Multiphasic Personality In- 
ventory F scale (Dahlstrom & Welsh, 1960). 
For some reason, however, there seems to be no 


h reported on the efficacy of 


systematic researe 
these methods in practice. This may well be due 


to an understandable tendency of workers to 
: “fak 
focus on the more glamorous topics of faking 


good” and “faking bad.” 
In addition, it seemed desirable to measure 
some very specific and obviously deviant Te 
sponse sets, such as answering all the questions 
with the same alternative, leaving too pon, 
items blank, or having an jnordinate pe - 
multiple responses. e 16PF also ae 
three direct questions about anand i a 
sponding (Numbers 1, 2, and 187) that a 
whether one has understood the Sain isan 
whether one is prepare truthfully, 
and whether one is sure that he has not omitte 
any responses: Obviously, if an » 

hese, there shou 


about any of t i 
the whole record. It was the various 


mentioned in this paragraph that were com- 
bined to provide the “carelessness” index 
mentioned subsequently. 

The purpose of this article, then, is to 
evaluate the practical effectiveness of Cattell’s 
Sabotage index, to construct and test a single 
scale for the detection of random answering, 
and, finally, to develop and check a summary 
index for as many indicators of careless 


answering as possible. 


METHOD 


Development of the original scales was done on & 
sample of 250 students in introductory psychology 
classes at Eastern Michigan University- Virtually all Ss 
were freshmen and sophomores between the ages of 1 
and 20, The sample was well balanced with regard to 
sex. The 16PF was administered during @ normal class 
period, and students were informed that they would be 
able to see the results of the test in private consultation 
with the instructor. The prospect of seeing the results 
engendered obvious interest in the test. This interest 
was essential to keep the number of carelessly answere' 
sheets to a minimum. ‘All scoring and data processing 
was done through the use of an OpScan scanner and an 
IBM 1130 computer. 

Evaluation of Cattell’s sabotage score was done 
through a computer subroutine, which, because of the 
complexity of the calculations, was checked many times 
against hand-computed results to insure accuracy. 

The scale to detect random answering (hereafter re- 
ferred to as random) was constructed by the method of 
selecting items with extreme item alternative response 
frequencies as was mentioned earlier. There are two 
basic ways of doing this. (a) An alternative may be 
sdected for the scale if it is answered by a very small 
percentage of the sample. In this case, if selected by an 
individual, it would count one point toward the random 
score. (b) An alternative may be selected for the scale if 
it is selected by a very large percentage of the sample. 
Such an item would count one point toward the random 
score if it were not selected by a respondent. Note that 
on the usual true-false test, the two ways are essentially 
the same. For example, ifa given item is answered true 
by 10% of the sample, it will of necessity be answered 
false by 90% of the sample. 

The existence on the 16PF of three alternatives—A, 
B, and C—leads, however, to a troublesome ambiguity. 
Consider an item with alternative response frequencies 
of 2% A, 49% B, and 49% C. Such an item is an obvious 
candidate for the first strategy mentioned above and 
could clearly be made to contribute one point to the 
random score if it were answered A. On the other hand, 
to illustrate the ambiguity, consider an item with alter- 
native response frequencies ©! 1% A, 12% B, and 
87% C- Notice here that under the first strategy either 
Jtem A or Item B might be considered infrequent. But 
the question arises as to which of the two should be 
chosen or whether both should be picked. Fortunately, 


there is another way of looking at this last set of per- 
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centages, namely, that what was really happening T 
not so much that Items A or B were being rejected, bu 
rather that C was being preferred. It makes poma 
conceptual sense in such a case to use = secon 
strategy, that of scoring the high frequency a peta 
if it is not selected by a testee. Thus, in the example, 
Alternative C, selected by 87% of the sample, would be 
scored if not answered, and the ambiguity about which 
of the other two alternatives to score 15 eliminated. 
Both strategies were adopted in the construction of 
the present random scale. Specifically, an item was 
chosen for the scale if it were answered by 10% or less of 
the sample, assuming that the other alternatives were 
both frequently answered. If they were not both fre- 
quently answered, however, (ie., if the second situation 
mentioned above arose), then the item was made to 
contribute one point to the random score if the fre- 
quently answered alternative was not chosen. In the 
latter case, it was initially planned to use only alter- 
natives chosen by 90% or more of the sample, but it was 
found that this cutting point could not be held as rigidly 
as could the 10% point. The average percentage of 
choices, however, was 90.35 for all the items selected 
through the second strategy. The only serious deviation 
from this cutoff point was one item that had a popu- 
larity of only 80.4% but which still seemed effective. 

Items selected for the random scale through the 
procedure mentioned were as follows: Numbers 38, 54, 
77, 78, 80, 93, 102, 118, 119, 121, 143, 1*, 2*, 29*, and 
112* added one point to the score if answered A, 
Numbers 12, 20, 23, 51, 52, 66, 87, 91, 109, 127, 28*, 
53*, and 128* counted if answered B. Only three were 
keyed C, namely, 4, 73, and 116*. Items with asterisks 
counted if they were not answered. Items picked for this 
scale are distributed fairly evenly over the scales of the 
16PF with the exception that 8 of the 31 were on the 
intelligence scale, B. Exclusion of these items unfortu- 
nately reduced the efficiency of the scale, but it is 
probably reasonable to hope that the 16PF will be used 
primarily with college students or persons who are 
college trained. Hence, these intelligence items should 
remain a fairly constant factor. 

In detecting extreme response sets, simple cumulative 
tallies were made of the answers to the various alterna- 
tives. It was found that of the 250 Ss no one selected 33 
or fewer, or 100 or more A responses; no one selected 8 
or fewer, or 100 or more B responses; and no one selected 
29 or fewer, or 93 or more C responses. It was arbitrarily 
decided that more than five multiple responses or more 
than five blank responses would be considered excessive. 
Exceeding any of these limits was presumed to be 
evidence of carelessness. 

To this point some 15 indexes of careless answ 
have been developed, and scanning all of these on all 
records proved to be a cumbersome chore. What was 
needed was a simple summary score that would signal 
trouble with the record at a glance. It was ar 

decided that such a score should be nearly zero in the 
absence of suspicious indicators, but should rise above 
10 if trouble of any kind was sensed. Consequently, it 
was decided that detection of the really serious devia- 
tions—too few or too many As, Bs, or Cs; too many 
blanks; or too many multiple responses—s 


r hould con- 
tribute 10 points to this summary score, Tt 


ering 


bitrarily 


was origi- 
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nally intended to add 10 points to Ss’ score for answers 
other than “yes” to the three 16PF questions asking 
directly about the accuracy of responding, but the item 
counts showed that many people actually had doubts 
about their care in answering. To face this reality, an 
answer of “uncertain” was made to add only two points 
to the summary score, while an outright avowal of doubt 
added five points. Finally, since the average normal 
record yielded a random score of only 1.58, while an 
average carelessly answered record led to a score of ge 
doubling the random score would result in a score o 
about 17 for randomly answered sheets, which is in the 
desired range. The cumulative combination of these 
scores went to make up the “carelessness” index. 

The scales were checked out on two groups of answer 
sheets. One group (hereafter referred to as normal) 
consisted of 173 16PFs that had been collected over the 
years on upper-class and graduate psychology students. 
This group had no overlap with the original sample of 
250 used to develop the scales. The second group, 
referred to as the random group, consisted of 59 
randomly answered sheets. Seventeen of these were 
made up to show certain definite response aberrations. 
The remaining 42 were filled out randomly by students 
with no 16PF booklet at hand. 


RESULTS 


The Sabotage index did not fare well in this 
study. Using the cutting score of 83, it indi- 
cated that 110 of the 173 persons in the normal 
group were inattentive (63.5%), while it suc- 
cessfully picked out only 30 of the 59 actually 
random sheets (50.8%). C orresponding figures 
for a cutting score of 102 are 70 normals 


misclassified as random (40.5%), with only 


17 (28.8%) of the randoms correctly selected. 
For the normal group the ) 


mean sabotage score 
was 97.98 with a standard deviation of 32.08. 
For the random group the mean score was 88.50 
with a standard deviation of 29.23. The differ- 
ence between these means results in a ¢ of 
2.10 (df = 230, p = .05) but in the wrong 
direction. Assuming these findings to be gener- 
ally true, this number of misclassifications 
would be simply unacceptable, 

The random score w 


Taki as much more successful. 
aking a score of five 


or higher as evidence of 
—— answering, 10 wae pee of 173 
bg %o) oon incorrectly called random, while 
S rode ot 3p (86.4%) of the true randoms were 
selected. For the normal group, the mean score 
on the random scale was 1.58, with a standard 
deviation of 1.68; Corresponding figures for the 
random group were 7.80 and 3,04, The ¢ be- 


tween these mean 5 in highly 
a ans was 15.02, again highly 
significant. 1 28 


The Carelessness index had very nearly the 
same degree of success. Considering a score of 
16 or higher as evidence of random answering, 
10 of the 173 normal protocols (5.8%) were 
called random, and 52 of the 59 random 
protocols were correctly selected (88.1%). The 
mean for the normal group was 5.64, with a 
standard deviation of 6.12; for the random 
group the corresponding figures were 29.17 and 
13.81. The / between these two means was 
12.67, again most significant. 


DiscussION 


The random and carelessness scales appear to 
be promising indicators of random answering, 
especially since they have been checked out on 
a group other than that on which they were 
derived, and, hence, to a degree have been 
crogs-validated. Moreover, it is likely that 
some of the 173 normal protocols were in fact 
randomly answered, so that their efficiency 
may be slightly underestimated. And while the 
significance tests of the means are of interest, 
probably the more important consideration is 
that relatively few people who were honest in 
answering would be incorrectly accused of 
being careless (5.8%). This is not ideal, but 
perhaps sufficiently accurate that a human or 
computer scorer could state with some definite- 
ness that a given answer sheet was suspect. The 
most dramatic demonstration of the effective- 
ness of the carelessness index occurred when the 
author accidentally misplaced one of the 
OpScan output cards used for computer scoring 
in another sample of cases- This had the effect 
of making all the protocols after this point 

sores derived 
after this point jumped from virtually all zero 
to a rather steady value in the 30s. 

With regard to the relative usefulness of the 


à z e 
random and carelessness scores, It 35 th 
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author’s feeling that the latter is the method of 
choice, since it checks on a much wider range of 
problems. In addition, if computer scoring is 
used, the machine may easily be made to print 
out explicit comments as to what the particular 
difficulties are, thus bringing them to the direct 
attention of the user. 

Cattell’s “Sabotage” index, from the results 


of this study, seems of little use. It may be 


unfair, however, to condemn it outright. Cattell 
A-E—of the 


frequently warns that all forms—~* 
16PF should be given to obtain maximum 
discrimination. In using only Form A, this 
warning was not heeded. It does seem unfair to 
base this score on halves of scales containing 
only 10 or 13 items. ‘At the same time, in most 
selection situations only one form of the 16PF 
is given. The great computational labor in- 
volved in this index (at least an hour of 
frustrating work with a calculator) would seem 
to make its use unwise until much better 
evidence can be obtained of its effectiveness. 
This is especially true in view of the fact that 
the random scale developed in this article can 
be scored in about 1 min. with far less chance 
for error. 
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MULTIPLE-INDICANT APPROACH FOR STUDYING 
NEW PRODUCT ADOPTERS * 
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Aspects of the traditional approach to identifying and studying innovators (i.e, 
As] 


select the first 23% or 10% 


to try a single new product and compare characteristics 


of these individuals to later triers and nontriers) were criticized. In contrast, an 
innovation proneness index, ranging from 0 to 15, was derived based on responses to 
experimental stimuli presented for 15 product classes. Sixty female undergraduates, 
confronted by one new and four old items in each product class, role-played pur- 
chase selections. Using this index, the hypothesis that low dogmatic individuals 
would make significantly more innovative responses than would high dogmatics was 


confirmed. 


The vast majority, perhaps 90% (cf. Rogers 
& Stanfield, 1968), of the thousands of new 
products introduced annually end up being 
rejected by the consumer. Given the high costs 
involved in the research, development, and 
introduction of new products, identifying the 
innovation prone individual (i.e., the consumer 
most likely to be among the first to try a new 
product), especially those factors which con- 
tribute to his being an innovator, assumes tre- 
mendous practical importance. Not only do 
such individuals provide a toehold for new 
products, but they also seem to play an integral 
part in the subsequent diffusion of the innova- 
tion through the population at large (cf. Engel, 
Kegerreis, & Blackwell, 1969; Robertson, 
1968; Rogers, 1962; and Zaltman, 1965, p. 31). 

Generalizations from research conducted 
thus far (cf. Bohlen, Coughenour, Lionberger, 
Moe, & Rogers, 1968; Coleman, 1960; King, 
1963 ; Robertson, 1968; and Rogers, 1962) indi- 
cate that innovators tend to have higher in- 
come and status and be younger, more mobile, 
educated, literate, and privileged (i.e., have 
more discretionary income relative to their 
peers). Personality characteristics most often 
cited are venturesomeness (i.e., possessing 
favorable attitudes toward trying new ideas 
and products) and greater risk-taking tenden- 
cies than noninnovators. 


While these findings are interesting and sug- 


1The author would like to express his appreciation 
to Ann P. Reed and Sharon L. Towles for their assist- 
ance in developing this instrument and serving as Es. 
2 Requests for reprints should be sent to Jacob 
Jacoby, Department of Psychology, Purdue Un 


iversity, 
Lafayette, Indiana 47907. y 


gestive, several factors limit their generaliza- 
bility. Consider the typical study. As a rule, 
investigators select a single product, usually @ 
luxury item when studied (e.g., touch-tone 
telephones, color TV, electric toothbrushes, 
electric knives, stereo phonographs, auto diag- 
nostic centers), and arbitrarily define innova- 
tors to be the first 25% or 10% of all potential 
adopters to purchase the item. All remaining 
potential (and actual) adopters are considered 
to be noninnovators. Characteristics of indi- 
viduals in the two groups are then compared, 
often on the basis of raw frequencies or per- 
centages, and conclusions drawn. 

Limitations inherent in such a procedure are 
as follows. First, obtaining only one indication 
of an individual’s tendency to innovate (i.¢., 
using early trial of a single product) severely 
limits one’s ability to generalize about the 
characteristics of individuals who tend to be 
innovators in general, regardless of the specific 
product class. Second, given the luxury-type 
products typically selected, it is not surprising 
that innovators tend to have higher incomes 
and be more privileged. The finding that inno- 
vators take greater risks may 
their greater financial ability t 
risks, rather than being a function of some 


underlying Personality trait. Indeed, such 
factors as income, education 
) 


simply reflect 
o afford such 


} and occupation 
appear to have little relation to early adoption 


when the innovation is a new brand of a fre- 
quently purchased, low priced product such as 
coffee (Frank, Massy, & Morrison, 1964). 

Third, a trial versus nontrial criterion ig 
nores the fact that new 


aM product adoption repre" 
sents a decision-mak 


ing process of severe 
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stages. According to the most generally ac- 
cepted model (Rogers, 1962), trial is preceded 
by stages of awareness, interest, and evalua- 
tion, and the decision to continue full use of an 
item (i.e., adoption) is a postrial phenomenon. 
Treating adoption dichotomously by compar- 
ing characteristics of triers and nontriers 
is of limited value in understanding the psy- 
chological dynamics preceding and underlying 
innovation. Last, identifying correlates of 
innovativeness, expecially when these are not 
predicted in advance, provides minimal infor- 
mation regarding cause-effect relationships. 

In contrast, making adequate generalizations 
requires obtaining several indexes of innova- 
tion proneness across a variety of commonly 
purchased, low priced, necessity-type items. 
‘Another desideratum is experimentation based 
on a priori hypotheses. The present study is 
cast in this mold. 

Despite the generally disappointing results 
of studies trying to link personality to Con- 
sumer behavior (cf. Engel, Kollat, & Blackwell, 
1968, Ch. 8), the construct of dogmatism, as de- 
fined by Rokeach (1960) and supported in 
numerous subsequent investigations (cf. Vac- 
chiano, Strauss, & Hochman, 1969), provides a 
sound theoretical and empirical basis for 
making predictions regarding innovation 

roneness. Consider the relationship of dogma- 
tism to three other variables: anxiety, novelty, 
and creativity: 

According to Rokeach (1960), dogmatism is 
ed mental rigidity factor with indi- 
viduals aligned along an open- to closed- 
Highly dogmatic systems 
represent a cognitive-psychodynamic network 
of defenses against anxiety; the more persist- 


ently anxious Or threatened the individual, the 
“rect a closed mind. 


more likely he is to manifest a OT 
This dogmatism- i 

well axrunted by Rokeach and several ne 
(cf. Vacchiano et al., 1969). Consider, also, n 
research (Bauer, 1960; Cox; 1967) which ten s 
to indicate that changing from one brand or 
product to another usually entails some per- 
ceived risk which, for several reaso! 
when one contemplates trying oF f 
a new brand or product. Given that perceive 
risk, in anything other than mild | 
be anxiety inducing, high dogmatics should 
less likely than low dogmatics 


innovating. 
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The literature relating dogmatism to novelty 
is similarly suggestive. Low dogmatics score 
significantly higher on acceptance of change 
measures (Pyron, 4966a; Pyron & Lambert, 
1967), react more favorably to novel music 
(Mikol, 1960), avant-garde art (Pyron, 1966b) , 
and novel films (Zagona & Kelly, 1966), and 
function more capably in novel situations 
(Rokeach, Swanson, & Denny, 1960; Rokeach 
& Vidulich, 1960). Based on a comprehensive 
review of the literature, Ehrlich and Lee (1969) 
conclude that high dogmatics are generally 
more resistant to change. 

It has been suggested that ‘innovators and 
early adopters tend to be more creative than 
later adopters [Zaltman, 1965, p. 461.” Re- 
latedly, Rokeach (1965) predicted open-mind- 
edness to be a necessary precondition for crea- 
tivity, and Zagona and Zurcher (1965) and 
Jacoby (1967) have adduced supporting 
evidence. 

Based upon both theoretical rationale and 
empirical evidence, it is predicted that low dog- 
matics will select significantly more innova- 
tions than will high dogmatics. 

Last, Engel, Kollat, and Blackwell (1968) 
note that the most often used operational defi- 
nition of innovation is “any form of a product 
that has recently become available in a mar- 
ket [p. 545].” Accordingly, totally new pro- 
ducts (e.g; television), new brands, modifica- 
tions of old brands (e.g. the annual model 
change on automobiles), and even a brand pre- 
viously not available in a given geographical 
area are thus considered innovations. The 
present investigation utilizes this definition. 


METHOD 


Subjects. The Ss were 60 18-26 year old coeds attend- 
ing Purdue University during the Fall 1969 Semester. 


Instruments. To obtain an index of innovation prone- 
ness, 15 sets of photographs, containing 5 pictures per 
set, were collected for 15 different product classes. One 
of the five pictures in each set was of an innovation (as 
defined above), while the remaining four 
ictures displayed older or more traditional varieties of 
roduct, either in terms of prand, style, or function. 

For example, one product class contained pictures of 

» dress-length coats (ie., knee length to 5 
i knee) and one maxicoat (fashion innova- 
tion). Another product class, cigarettes, pictured one 
pack of Virginia Slims (brand ic 
and Parliament. A third product 
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TABLE 1 


T SNOV. ¢ INVESTIGATED 
PRODUCTS AN D TYPE OF INNOVATION INVESTI 


| Type of 

Product | Category | innovation 

| | 

i itioner | Cosmetics New brand 
7. — Cosmetics New brand 
3. Hair lightener | Cosmetics | Functional* 
4. Perfume | Cosmetics | New brand 
5. Blush-on | Cosmetics | Functional* 
6. Make-up base | Cosmetics Functional* 
7. Men’s cologne | Miscellaneous | New brand 
8. Diamond ring | Miscellaneous Fashion 
9. Beer Miscellaneous New brand 
10. Luggage Miscellaneous Fashion 
11. Cigarettes Miscellaneous | New brand 
12. Shoes Clothing Fashion 
13. Slack outfit Clothing Fashion 
14. Panty-hose Clothing Functional* 
15. Coat Clothing Fashion 


s Functional = new form of product or method of use (e.g.. 
foam vs. liquid make-up base). 


class exhibited four regular type pantyhose and one of 
the lower cut hiplet variety (functional innovation). 

The 15 products, classified according to category and 
type of innovation examined, are presented in Table 1. 
Across six cosmetic, four clothing, and five miscella- 
neous products, there were six new brands, five fashion 
innovations, and four functional innovations. 

All five pictures in each product set were mounted in 
a randomized circular array on poster board so that the 
complete set could be viewed simultaneously. Pictures 
within each set were matched for size, color versus 
black-and-white, branded versus unbranded, photo- 
graph versus sketch, and container type (e.g., bottle 
vs. can). 

The standard 40-item Form E of the Rokeach 
Dogmatism scale (Rokeach, 1960) was used to measure 
dogmatism. 

Procedure. The Ss who were run individually, were 
instructed to select one out of the five items in each set 
that they would buy if given the choice. The Æ con- 
tinued: “Do not let the colors of the products or brands 
affect your decision. For example, do not choose a lip- 
stick for the particular shade pictured. Moreover, if 
you do not use a set of products, buy for someone else.” 
Each S received an innovation score ranging from 0 to 
15, depending upon the number of times she selected 
the innovation contained in each of the product sets. 

The 15 sets of pictures were presented in the same 
order, and each S proceeded at her own pace. The Dog- 


matism scale was administered upon completion—again 
with no time limit. 


RESULTS 


Based upon their dogmatism scores, Ss were 
split into two groups at the median. Table 2 
indicates that the between-group difference in 


dogmatism scores was significant at beyond the 
001 level. As predicted, there was a significant 
overall difference (¢ = 10.87; p< .01, one- 
tailed test) in innovation scores between the 
two groups. 

Padme in analysis indicated that 
the low dogmatics gave more innovation eo" 
sponses than the high dogmatics to 11 out 0 
the 15 product sets. Four of these differences 
(Products 7, 9, 11, and 12 in Table 1) were 
significant beyond the .05 level, one-tailed 
test. The remaining 11 significance tests yielded 
probability values in excess of .20. . 

Table 3 provides the analysis of innovative 
responses according to both type of product 
and type of innovation. Low dogmatics were 
considerably more likely (p < .001) than high 
dogmatics to select new brands of products; 


especially in the miscellaneous category (p 
x 05). 


Discussion 


As predicted, low dogmatic individuals tend 
to make significantly more innovative selec- 
tions than do high dogmatic individuals. How- 
ever, the relationship, while significant, does 
not appear to be exceptionally strong. Only 
10% of the variance is accounted for by the 
correlation between dogmatism and innovation 
proneness (r = — .32). Moreover, the mean 
low dogmatic innovation score of 3.27 is only 
-27 above, and the high dogmatic mean only 
‘83 below, the 3.00 value to be expected on a 


chance basis (1/5 innovations X 15 product 
sets). 


TABLE 2 


SIGNIFICANCE OF THE DIFFERENCE IN MEAN 
INNOVATION Scores oF Hicit 
AND Low DOGMATICS 


Innovation Low High 
scores dogmatics* | dogmatics* 

Dogmatism 

Em 118.40 161.13 10.87** 

14.53 

Difference Ra 

xX 3.27 2.17 2,53" 

SD 2.03 1.23 
Note.—Dogmatism X Innovation Index:r = — 32.2 < oe 
*p <.01 (given het une 


ley sor ear 1 
df = '29 tef. Edwards, ggg eenety of variance an 


60, p. varices T 
** b < 001 (one-tailed testy, OBJ: one-tailed te 9 


f 


New PRODUCT ADOPTERS 


From an applied viewpoint, however, the 
notion of practical significance is more im- 
portant than statistical significance or even the 
size of the relationship. To the extent that 
marketers and advertisers can capitalize on 
knowledge regarding the relationship between 
dogmatism and innovation proneness, and can 
thereby increase the likelihood of gaining that 
all-important toehold, the relationship is im- 
portant. How, then, might this information be 
applied? 

Tt has been suggested that when introducing 
new products, “advertisements should empha- 
size the prestige of owning the item—to appeal 
specifically to the early adopters (Zaltman, 
1965, p. 52].” Given that low dogmatics are 
more innovative and tend to adopt earlier than 
high dogmatics, then quite the opposite predic- 
tion should be made. In describing the essen- 
tial differences between high and low dogmatics, 
Rokeach (1960) states that low dogmatics 
“act on relevant information from the outside 
of its own intrinsic merits, unencumbered by 
irrelevant factors in the situation arising from 
within the person or from the outside [p Si” 
Trrational egomotives and the need for self- 
aggrandizement are cited by Rokeach as €x- 
of irrelevant internal factors, while 
arising from peers, reference groups, 
authority figures, social, institutional, and 
cultural norms are cited as examples of irrele- 
vant external factors. High dogmatics prefer to 

the pronouncements of authori- 


rely heavily on } t n 
ties and tend to accept information primarily 


on the basis of who said it (i.e. the message 
source), while low dogmatics tend to act based 
upon their independent evaluation of the infor- 


mation itself (i.e., message content). Substan- 


tial support for these relationships are 
found in the socjal-psychological literature (cf. 


Ehrlich & Lee, 1969; Powell, 1962; Rokeach, 
1960; Vacchiano et al., 1969). , 
Translated into applied terms, this suggests 
that the marketer-advertiser introducing anew 
product would be wise to have 
ments and point-of-purchase display 
the product’s attributes rather tha 


amples 
pressures 


irrelevant ego oF social factors. As a ee 
thetical example, advertisers of @ ee nae 
would 


a new type of filter (ergo, a new product 
d describe the filter rather 


do well to stress an Toa 
than emphasize people enjoying the cigarette 


TABLE 3 


FREQUENCY OF INNOVATIVE RESPO: 


aaa 


or PRODUCT AND Type OF INNOVATION 
= Low | High | = 
Type | dogmatics | dogmatics | P 
| 
Product | | 
Cosmetics* 31 23 | 1.19 
Clothing? | rà ll 1.29 
Miscellaneous" 50 31 4.406* 
Innovation 
New brand* 48 | 20 11.53%) 
Fashion® 29 | 28 09 
Function” | Bi ae Oy 42 


tos (one-tailed test). 
=p < 001 (one-tailed test). 


in social situations. Such differential reactivity 
to prestige versus factual appeals could be 
tested using an approach similar to Kassarjian 
(1965). 

Last, it would appear that a multiple-indi- 
cator approach, especially when coupled with 
theoretically based a priori hypotheses, has 
great promise for the study of innovators. Had 
a single indicant criterion (i.e, studying the 
characteristics of innovators based on their 
responses to only one innovation) been utilized, 
the predicted significant relationship would 
have been manifested in only 4 out of 15 in- 
stances, and the likelihood of concluding that 
no relationship existed thereby increased. 

In conclusion, low dogmatic individuals were 
found to be significantly more likely to be 
innovators than high dogmatic individuals. To 
the extent that such innovators are also more 
likely to prefer objective information, this sug- 
gests that advertisers seeking to establish a 
market position for an innovation ought to 
focus on presenting factually oriented informa- 
tion rather than emphasizing irrelevancies. 


REFERENCES 


BAVER, R. A. Consumer behavior as risk-taking. Pro- 
ceedings of the American Marketing A ssocialion, 1960, 


. M; CouGHENOUR, C. M., LionBERGER, 
H. F., MOE, E. O., & ROGERS, E. M. Adopters of new 
farm ideas: Characteristics and communications be- 
In H. H. Kassarjian & T. S. Robertson 
ses in consumer behavior. Glenview, 


havior. I 
(Eds.), Pers pectt® 
is: Scott, Foresman, 1968. 


388 


Coteman, R. P. The significance of social stratification 
in selling. Proceedings of the American Marketing 
Association, 1960, 171-184. , 

Cox, D. F. (Ed.) Risk taking and information handling 
in consumer behavior. Boston: Graduate School of 
Business Administration, 1967. 

Epwarps, A. L. Experimental design in psychological 
research. New York: Holt, Rinehart & Winston, 1960. 

Enrucn, H. J, & Lee, D. Dogmatism, learning, and 
resistance to change: A review and a new paradigm. 
Psychological Bulletin, 1969, 71, 249-260. 

ENGEL, J. F., KEGERREIS, R. J., & Brackwent, R. D. 
Word of mouth communication by the innovator. 
Journal of Marketing, 1969, 33, 15-19. 

Encet, J. F., Korrat, D. T., & BrackwELL, R. D. 
Consumer behavior. New York: Holt, Rinehart & 
Winston, 1968. 

Frang, R. E., Massy, W. F., & Morrison, D. G. The 
determinants of innovative behavior with respect to 
a branded, frequently purchased food product. Pro- 
ceedings of the American Marketing Association, 1964, 
312-323. 

Jacosy, J. Open-mindedness and creativity. Psycho- 
logical Reports, 1967, 20, 822. 

Kassarjian, H. H. Social character and differential 
preference for mass communications. Journal of 
Marketing Research, 1965, 2, 146-153. 

Kine, C. W. Fashion adoption: A rebuttal to the 
“trickle down” theory. Proceedings of the American 
Marketing Association, 1963, 108-125. 

Mikor, B. The enjoyment of new musical systems. In 
M. Rokeach (Ed.), The open and closed mind. New 
York: Basic Books, 1960. 

Powett, F. A. Open- and closed-mindedness and the 
ability to differentiate source and message. Journal 
of Abnormal and Social Psychology, 1962, 65, 61-64. 

Pyron, B. A factor analytic study of simplicity-com- 
plexity of social ordering. Perceptual and Motor Skills, 
1966, 22, 259-272. (a) 

Pyron, B. Rejection of avant-garde art and the need 


for simple order. Journal of Psychology, 1966, 63, 
159-178. (b) 


Jacos JACOBY 


Pyro, B., & Lampert, P. The generality of simplicity 
complexity of social perception in a high choy 
population. Journal of Psychology, 1967, 66, 265-213 s 

Rosertsoy, T. S. Social factors in innovation behavior- 
In H. H. Kassarjian & T. S. Robertson (Eds), 
Perspectives in consumer behavior. Glenview, Tllinois: 
Scott, Foresman, 1968. ; 

Rocers, E. M. Diffusion of innovations. Glencoe, 
Illinois: The Free Press, 1962. . 

Rocers, E. M., & STANFIELD, J. D. Adoption and ae 
sion of new products: Emerging generalizations any 
hypotheses. In F. M. Bass, C. W. King, & P 
Pessemier (Eds.), Applications of the sciences *" 
markeling management. New York: Wiley, 1968. : 

Roxeacu, M. The open and closed mind. New York: 
Basic Books, 1960. 

Roxeacu, M. In pursuit of the creative process. | In 
G. A. Steiner (Ed.), The creative organization. Chi- 
cago: University of Chicago Press, 1965. 

Roxeacu, M., Swanson, T. S., & Denny, M. R. The 
role of past experience: A comparison between chess 
players and non-chess players. In M. Rokeach (Ed.); 
The open and closed mind. New York: Basic Books; 
1960. 

Roxeacu, M., & Viputicn, R. N. The formation of new 
belief systems: The roles of memory and the capacity 
to entertain. In M. Rokeach (Ed.), The open and 
closed mind. New York: Basic Books, 1960. 

Vaccutano, R. B., Strauss, P. S., & Hocuman, L. The 
open and closed mind: A review of dogmatism. Psy- 
chological Bulletin, 1969, 71, 261-273. 

Zacona, S. V., & KeLLY, M. A. The resistance of the 
closed mind to a novel and complex audio-visual ex- 
perience. Journal of Social Psychology, 1966, 70, 

123-131. 

Zacona, S. V., & ZurcHeR, L. A. Participation, inter- 
action and role behavior in groups selected from the 
extremes of the open-closed cognitive continuum. 
Journal of Psychology, 1965, 60, 213-219. 

ZALMAN, G. Marketing: Contributions from the be- 


havioral sciences. New York: Ha Brace 
World, 1965, anne 


(Received March 2, 1970) 


$$ 


_ eee lied Psychology 
1971 Vol. 38. No. 6380-302 


COMPARATIVE STUDY OF ARTISTS AND NONARTISTS ON 
DESIGN JUDGMENT TEST * 


MAITLAND GRAVES 


THE 


H. J. EYSENCK AND M. CASTLE* 


Institute of Psychiatry, University of London 


‘The Maitland Graves Design Judgment Test was administered to 229 male and 236 
female controls (nonart students), and 519 male artists and 478 female artists (art 
students). Only very slight differences were observed between the experimental and 


control groups. Following the lines of a previous factor analysis 0! 


f the test, scores 


were derived for three subsets of designs (symmetrical, three-dimensional, complex) 3 
these scores were negligible correlated with each other. Conspicuous differences be- 
tween artists and controls were only found for the first of these sets; it was also 


found that scores on this set were more 


homogeneous for artists than for controls. 


The use of overall scores on a heterogeneous test is criticized, and the development 
of a more homogeneous test based on symmetrical designs only is suggested. An 
explanation is suggested for the differences in results reported by Graves and the 


present authors. 


The Maitland Graves Design Judgment Test 
(1947) is based on certain theories of artistic 
creation and appreciation (Graves, 1941). It 
contains 90 pairs or triads of designs one of 
which has to be chosen by S as aesthetically 
superior to the other(s). One particular choice 
in each case is considered “right” (and scored 
+ accordingly) on the basis of agreement with 
the author’s theories and the agreement of art 
teachers on the superiority of that particular 
design. Graves (1948) reports that art students 
achieve higher scores on the test than nonart 
students; mean scores for college students are 
76 and 48, respectively, for high school stu- 
dents, 56 and 38, respectively. These values, 
taken from Graves (1948, p. 3, Table 1) are 
typical of others given in Graves (Tables 2 & 
3y p- 4). The author concludes that “the test's 
ability to differentiate the art groups from the 
s unmistakably clear [p- 21. 

The theories advanced by Graves ae diel 
siderable psychological interest, and if his test 
did in fact succeed in differentiati 
between artists and nonartists, 
important implications for a theor) 
thetics quite apart from the pract 
the test as an instrument of selection. 


i the Social science Re- 
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possible. e i 
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going Editor, Dr- Kenneth E. Clark. 
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Impressive as these figures are, there are 
nevertheless certain doubts that require to be 
set at rest before one can accept the test as @ 
measure of aesthetic aptitude. In the first place, 
the data presented are restricted to American 
students at one point of time; the possibility 
cannot be ruled out that artists in other 
countries, or even in the United States at other 
periods of time may show different preferences 
to those endorsed by Graves’s (1948) experts. 
The alleged objectivity of aesthetic criteria has 
not been demonstrated sufficiently to allow us 
to disregard such a possibility, and accordingly, 
the present study has repeated Graves’s 
attempt with a sample of art and nonart stu- 
dents differing from his by being citizens of the 
United Kingdom, and by being tested some 25 
yr. later. The authors have also taken care to 
keep separate male and female students, as the 
possibility cannot be gainsaid that sex differ- 
ences may interfere with the establishment of 
clear-cut art versus nonart student differences; 
little is known about this. 

In the second place, Graves (1948) has not 
established certain important psychometric 
properties of his scale. This scale is considered 
to be a measure of a single aesthetic property, 
put no evidence is offered of this proposition. 
Graves claims as one of his criteria for reten- 
tion of items “greater preference for the design 
by those who achieved high scores on the entire 
test than by those receiving low scores (internal 
consistency) [p. 2],” but by itself this criterion 
is of doubtful value, and in any case no details 
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of its application have been published. ES ENC 
(1967) has published details of a factor analytic 
study of the 90 items of the test, and has shown 
that several different factors appear to deter- 
mine choice of design, rather than the one sin- 
gle factor suggested by Graves. Three factors 
appeared to be particularly worthy of study in 
this connection. Factor 1 (symmetry) is best 
represented in Items 8, 21, 28, 29, 47, 48, 50, 51, 
57, 59, 61, 69, and 72, all of which had loadings 
of above .5 on this factor. These items are all 
concerned with pairs of designs in which one 
shows excessive symmetry, while the other 
(preferred) design departs from strict sym- 
metry. Factor 2 (three-dimensional) consists 
of Items 80-90, all of which are three-dimen- 
sional, in contrast to the other items which are 
two-dimensional. Factor 3 (complexity) con- 
sists of Items 1, 12, 26, 40, 55, and 74; these are 
all characterized by being very complex draw- 
ings, in contrast with the essentially simple 
drawings which make up Factor 1. Scores were 
obtained for all our groups on these three fac- 
tors, using Graves’s key, as well as on the total 
scale. 

Four groups were tested in all: 
students (V = 519), female art 
= 478), male nonart students (N 
female nonart students (N = 236). The Ss 
were divided into artists and controls on the 
basis of whether or not they had received for- 
mal training in the visual arts, To make more 
precise our use of terms, we will list the types 
of courses that those groups under the term 
“art students” had attended, or were attend- 
ing. These courses included graphic design, 


male art 
students (N 
= 229), and 


TABLE 1 


Scores OF Artists 


AND CONTROLS on 
TOTAL TEST AN 


D THREE SUBTESTS 


| Control Artist 
Test or subtest i |e eater aces 
Fe- Fe- 
M Males | *€ 
| ae male | Male maled 
Sı (Total) | 52 | 56 | 58 | 5g 
S2 (Symmetry) 7.3 8.1 94 9.3 
Ss (Three-dimensional), 65 | 74 | 66) 74 
Si (Complexity) | 36 | Bo | 3.8 | 3.6 
an = 229, 7 a 
by 6. 
en 5 
dn= 478 
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fashion design, fine arts (painting and sculp- 
ture), architectural studies, typographic mine 
textile design, cinematography, ioe ange 
industrial design, theater design, interior o 
sign, stained glass and ceramic design, silver 
and jewelry design; included were also some 
post-graduate students and professional artists. 
By contrast, the controls included students 
of surveying, law, commerce, st 
hairdressing, electrical engincering, a 
engineering, secretarial students, hotel Woks 
and catering students, student linguists, e 
fitting apprentices, and day release (ere 
trade) students. Ages of both groups ranget 
from 16 to 25 yr., with only a few above the 
latter age; the great majority were around 20 
yr. old. The groups differed with respect to 
their preference judgments on a test of polyg- 
onal forms (Eysenck & Castle, 1970), with the 
artists preferring the simpler forms. The artists 
were also found to have higher scores on the 
N scale of the Eysenck Personality Inventory, 
but not on the E scale. 

Presentation of the n 


aterial was by way of 
slides shown to medi 


um-sized groups of stu- 
dents. Close supervision ensured that no cheat- 
ing or talking took place. Seating was arranged 
in such a way that little angular distortion 
occurred. It does not seem likely that this 
variation from the traditional method of 
administration of the test would have given 
rise to differences in Scoring, although of course 
it is impossible to be certain of this. 
Results of the experiment are shown in Table 
1. It is clear that total scores give minimal 


differentiation between artists and controls, 
with the former scori 


Significantly superior to 
001), but male artists are 


erent from female controls. 
S are rather 


th 


2 


reported by Graves was 
28 points, Clear] a : 


Y, our study has essentially 
Graves’s original findings- 


CoMPARATIVE STUDY OF 


Scores on the symmetry factor show a some- 
what different picture. Here artists are clearly 
superior to controls, for both sexes; in each 
case p < .001. Female controls are superior to 
male controls (p < .01), but not female artists 
to male artists. Scores on the three-dimensional 
factor show an overall level of significance of 
p < .001, but this is entirely due to the 
superiority of the female Ss over the male Ss 
(p < .001 for controls, and p < .01 for artists); 
male artists do not differ from male controls, 
and female controls are actually superior ($ 
< .05) to female artists. Scores on the complex 
designs again show overall differences at the 
p < .001 level. Female artists are better than 
female controls (p < .001), but male controls 
are not differentiated from male artists. Male 
controls are superior to female controls (p 
< .001), but male artists are not superior to 
female artists. These data demonstrate that it 
is essential to treat men and women differently 
in the analysis of the data, and also that the 
factors isolated in the Graves (1948) Design 
Judgment Test must be analyzed separately, 
as otherwise much important information is 
lost. 

The original factor analysis had been done on 
a group of nonartists, and it is possible that 
for artists the factors there demonstrated might 
not be applicable. If this were so, then the 
correlations between subsets (Sz, Sa and S;) 
would be expected to be substantially larger 
e artists, and significantly 
trol group. The observed 
correlations between these sets are given in 
Table 2, together with the correlations os ge 
set and Sy the total score. The total sor ios 
course, incorporates the subscores, 50 that $ = 
correlations are artificially inflated. Howey er, 
as the subsets are roughly of equal size, com- 


parisons between them are still meaningful and 
all four groups 


possible. It will be noted that for : 
the intercorrelations between Se, S3, and S; are 
very low, and even on occasion negatives this 
substantiates the validity of the original factor 
analysis. There is no evidence here that a 
have higher correlations than controls. Correta- 
tions of Sı with the other scales shows that S: 
is very highly correlated with the total peal, 
but that the correlations of Ss and S4 with Si 
are quite low and hardly above the level of the 
artificial correlation produced by the fact that 
the items involved in these scales also appeal 


than zero for th 
different from the con 
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TABLE 2 


RRELATIONS OF MALE CONTROLS (MC), 

FEMALE CONTROLS (FC), MALE ARTISTS 
(MA), AND FEMALE ARTISTS (FA) 

FOR SUBSETS 


INTERCO 


| 
Subset S: | S: Ss 
Si E 
MC .85 AT 25 
FC 8S 16 38 
MA 82 35 35 
FA | „81 27 40 
| 
S: 
MC | —.04 03 
re | —02 20 
MA | 14 .16 
Fa | ‘05 26 
Ss | 
MC 
FC 
MA 
FA 


in Sı. We may conclude that Ss and S4 not only 


fail to differentiate properly between artists 
and controls, but that they also fail to correlate 
with Ss, which does so discriminate. Clearly, 
whatever validity the total scale may possess 
derives essentially from the items in Sp; items 
making up Ss and S; could with advantage be 
jettisoned as they contribute nothing but error 
and unreliability. The short 13-item S: scale 
discriminates better between artists and con- 
trols than does the whole 90-item scale includ- 
ing these 13 items. 

It seems likely that the artist group js more 
homogeneous than the control group for aes- 
thetic ability, and a look at the variances of the 
various scores used may be useful. Ss and S; do 
not show any significant differences; Sı does so 
ata p< 05 value. Most striking, however, is 
the difference in variances for Se, which is sig- 
nificant at the p < 001 level. The actual vari- 
ances are: 3.15? (male controls), 3.61° (female 
controls), 2.84 (male artists), 2.87° (female 
artists). It is clear that on those subsets of 
items which discriminate best between artists 
and controls, the artists have lower variances 
than the controls. This is very much what one 
might have expected. Some psychologists 

could argue that such differences in variance 
T inapplicable the F test and / test used, 
but mathematicians would not uphold such 
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nvestigating the possible interaction between 
-atio of value and exposure duration. 


METHOD 
Subjects 


vere 80 male volunteers from Syracuse 
BEA The Ss competed for monetary awards or 
$10.00, $5.00, $3.00, and $2.00 that were given p t z 
four best performers, as determined by the number o 
points accumulated, within each group of eight Ss. 


‘Apparatus 


‘The stimulus symbols were typewritten capital letters 
from the alphabet. To allow for a comparison between 
high- and low-value stimuli, the alphabet was split into 
the subsets of A-J and Q-Z. The letters KP were never 
presented in order to provide a clear distinction between 
the subsets. F 

The stimulus displays consisted of 2x 2 negative 
slide projections with either 4, 8, or 12 white letters on a 
black background. Each level of load, that is, the num- 
ber of letters per slide, was replicated 15 times with a 
different random sample of the alphabet for a total of 
45 test slides. Each letter appeared an equal number of 
times during the 45 trials, and all slides contained an 
equal number of letters from each stimulus subset. 
Following selection, the letters were randomly assigned 
to a location in a 10 X 10 rectangular matrix with the 
restriction that half the letters selected from each subset 
be placed in the center 36 positions of the matrix and 
the other half placed in the 64 peripheral positions. The 
letters were then typed in appropriate locations on 
sheets of white, unlined paper that were photographed 
for slides. 

A Kodak Carousel slide projector with an externally 
mounted shutter presented the slides, while automatic 
programming equipment controlled the exposure dura- 
tions and interslide intervals. The projected size of the 
matrix area of slides was 43 X 60 in., while the pro- 
jected size of the letters was 1 in. The Ss sat in student- 
type arm chairs arranged in two rows of four each with 
the average distance from Ss’ eyes to the screen being 
12 ft. from Row 1 and 15} ft. from Row 2. 


Procedure 


Eight Ss were assigned to each of 10 groups derived 
from the combination of five stimulus exposure dura- 
tions and two value ratios. The exposure durations used 
were .5, .75, 1.0, 2.0, and 3.0 sec. per slide. The ratios 
between high- and low-value letters were 2:1 and 16:1 
with one point always given for each correct report of a 


letter in the low-value subset and either 2 


d or 16 points 
given for cach of the correctly 


reported letters from the 
high-value subset. Four Ss within each group of eight 


performed with the subset from the first half of the 
alphabet (A-J) as the high-value categories, while the 
other four had Q-Z as the high-value categories. All Ss 
viewed the identical slides and thus performed under 
all levels of load (4, 8, and 12 letters per slide). 
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The Ss served for approximately 45 min. mgt 
first 15 min. reserved for instruction and Seg eer 
Ss were informed that letters could appear at al T a 
on the screen and that points would be ce lees 
each letter reported correctly in their answer rc 
‘All Ss were further instructed that the number ot p! Pa 
received for any particular letter was equal to nie Te , 
assigned to the population in which it was conta aion 
and that aside from the possibility of being droppe! ae 
random guessing, there would be no penalty, or loss ie 
incorrect answers. A card, permanently in view, on oa 
answer booklets informed them of the ratio of Ya 
and of the designated values for each population su n 

Nine practice trials were given with the approp 
exposure duration and intertrial interval to insure ie 
cach S understood the task. On four of the trials, A 
slide was reexposed to allow § to tally the jambora 
points he accumulated, Following practice, all Ss p ie 
the 45 slides in the same randomly presented or¢ ye 
were given a short rest, and then given the Same. 4 
slides in a new order. The Ss were allowed 10 sec. a 
each report. This was followed by a .5-sec. warning eee 
and a 3-sec. interval to allow Ss to adjust the agane 
booklet for the next presentation. No feedback was 
provided during the testing period. 


RESULTS 


The data were analyzed using a number of 
different response measures to assess Ss 
strategy and ability for processing information 
of different value levels. Although there are & 
considerable number of possible response mea- 
sures, those selected for analyses and presenta- 
tion were those which appeared to produce 
inconsistent results in the previous literature. 
Although blocks of trials (Trials 1-45 vs- 
Trials 46-90) were analyzed and were & 
significant variable with some of the measures, 
these findings are not presented, since they do 
not alter the general findings and conclusions 
of the major variables of interest, 


Initial Responses 


These data were first analyzed for the per- 
centage of times that a high-value letter was 
the first letter recorded 
the accuracy 
1965). E; 


by an S regardless of 
of the report (Christ, 1969; Taub, 
ach percentage was based on a maxi- 
mum of 15 possible responses at each level of 
load for each block of trials. The results of an 
analysis of variance indicated that the per- 
centage of high-value initial reports increased 
directly with exposure duration (F = 15.70, 
df =4/70, p< 01), ratio (F = 14.57, df = 1/10, 
p < 01) and their combination (F = 2-61; 
df = 4/10, p < 05). Figure 1, a plot of the 
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Fic. 1. Percentage of initial responses that were 
high-value letters as a function of exposure duration 
and ratio. 


Ratio X Exposure interaction, shows that at .5 
and .75 sec. durations Ss made initial responses 
of high-value stimuli only about 50% of the 
time with both ratios, and that although the 
proportions of these responses increased 
directly with exposure, increases were larger 
for those Ss performing with a 16:1 ratio. The 
significant effects involving load indicated that 
the percentage of high-value initial responses 
increased with the number of letters on a slide 
(F= 25.95 df = 2/140, p < 01) and that the 
size of the differences among loads varied 
directly with exposure duration (F = 6.08, 
df = 8/140, P < 01). The means underlying 
the main effect of load were 52.88% at 4 letters, 
62.59% at 8 letters, and 63.67% at 12 letters 
per slide, while the respective means were 
51.34%, 51.34%, and 53.19% at .9 Sec- and 
56.47%; 30.06%, and 82.88% at 3.0 sec. No 
other interactions were significant with this 
performance measure. - 
As a check on the findings for exposure an 


ratio, the data were also scored and analyzed 
t initial reports © 


high- and low-value letters 

1968). In general, these findin 
with those of the previous analysis an< 
cated that the following effects were significa 
exposure (F = 3.89, 
(F = 84.15, df = 1/70, P 
posure (F = 14. 4 
X Ratio (F = 11.95, df = 
Value X Exposure xX Ratio 
= 4/70, p< 05). Figure 2, a plot of 
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: ~ Accu- 
Source No; ua, || racy of 
reports 
Exposure duration Œ) zsgpr | 5,74% 
Ratio (R) 1 5.67* 1.67 
YR 4 1.80 175 
E XR 70 
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Fic. 3. Percent-correct transmission as a 
function of value, ratio, and exposure. 


of percent-correct transmissions of high- and 
low-value stimuli, defined as the number cor- 
rect divided by the number of stimuli pre- 
sented (Christ, 1969; Taub, 1965) indicated 
that more of the high-value than low-value 
stimuli were transmitted correctly and that the 
differences between high- and low-value trans- 
missions increased with both exposure duration 
and ratio of value. Although the main effect of 
ratio was not significant, the data underlying 
the significant Value X Ratio interaction sug- 
gested that as ratio was increased, the major 
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trend was toward a decrement in transmissions 
of low-value stimuli. However, Figure 3, E 
summary of the Value X Exposure X Ratio 
interaction, shows that this trend was not 
present at all exposure durations. This latter 
interaction indicated that while the 16:1 ratio 
only led to decreases in low-value transmissions 
when compared to the 2:1 ratio at the .75- and 
1.0-sec. exposure durations, there was an mM- 
crease in high-value transmissions as well as & 
decrease in low-value transmissions at the 
longer durations of 2.0 and 3.0 sec. 

Table 1 further shows that a number of 
effects involving load were significant with this 
performance measure. The means underlying 
the main effect for load indicated that 79.4%) 
49.5%, and 36.3% of the stimuli were trans- 
mitted correctly with 4, 8, and 12 letters pet 
slide, respectively. The data underlying the 
significant interaction of Load X Exposure 
indicated that the differences with load in- 
creased with exposure duration, while inter- 
actions involving Load X Value and/or Ratio 
were mainly due to the finding that these 
variables affected transmission only with loads 
of 8 and 12 letters per slide. Table 2 presents 
the scores for the Load X Value X Ratio 
interaction and further illustrates the data 
underlying the Value X Ratio interaction. 

Analysis of the number of responses of high- 
and low-value stimuli regardless of accuracy 
(Christ, 1969; Hearns & Moss, 1968) revealed 
that the trends underlymg the findings for 
exposure, value, and ratio were identical in 


TABLE 2 


Percent-Correct TRANSMISSION AND MEAN Ni 
AS A FUNCTION OF Loan, VA 


UMBER OF RESPONSES PER TRIAL 


LUE, AND RATIO 


j) 


Letters per slide 
4 8 i i 
Measure Value ? 
Ratio 
i) : P | ae alesse eae se aan 
o 2:1 16:1 2:1 16:1 2:1 16:1 
% transmission High 79.16 80.44 55.49 | 57.98 41.58 44.66 
Low 7946 | 7884 | asso | Sean | Gree 25.55 
: | } 3 - y 
X no, responses per trial High 1.80 1.78 5 
i 2.55 2.56 2.82 220 
Low 1.78 1.68 211 | 176 222 1.69 
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shape to those presented in Figure 3. Similarly, 
for the Value X Ratio interaction, the 2:1 and 
16:1 ratio groups made 2.38 and 2.41 high- 
value responses and 2.03 and 1.71 low- zalue 
responses, respectively. However, unlike with 
percent-correct transmissions, there was a small 
but significant difference due to ratio with less 
responses per trial being made at 16:1 (4.13) 
than at 2:1 (4.43). As in the previous analysis, 
there were a number of significant effects due 
to load. The data underlying the main effect 
indicates that the number of responses in- 
creased directly with load with 3.52, 4.49, and 
4.81 responses made with 4, 8, and 12 letters 
per trial, respectively. Further, although the 
direction of effects are opposite to those with 
percent-correct transmission, the underlying 
differences are the same. That is, the difference 
due to load increased with exposure duration, 
while value and ratio had no effect on the num- 
ber of responses with four letters but produced 
differences at both 8 and 12 letters. An example 
of the data underlying these effects are pre- 
sented in Table 2. 

Analysis of the accuracy of reports, defined 
as the number of correct responses divided by 
the total number of responses (Hearns & Moss, 
1968), suggested that the variables had less ofa 
consistent effect on this measure than on the 
other measures of performance. The significant 
effects due to exposure, value, and their inter- 
action indicated that accuracy increased 
directly with exposure duration (83.14%, 
86.30%, 86.5770, 92.76%, and 95.18%) and 
that although accurac , was slightly higher for 


the low-value letters (88.23% for high value, 


and 89.35% for low value), this trend was only 
er durations with a 


present for 1 sec. and shorte : 
slight reversal in favor of high-value letters & 

2,0 and 3.0 sec. Ratio had little consistent foe 
and did not significantly affect accuracy either 


i inati 4 ue. 
alone or in combination with exposure OF val 


The interaction of Ratio X Value X Load was 


the only significant effect with rati ea 
appeared to be due to @ ae reversal in rete 

tive accuracy for hi h-value a ; 
value (50.9%) ae) for the 16:1 ratio condi- 
tion with 12 letters per tri 
affected accuracy, 

88.95% at 4 letters, 
90.42% at 12 letters indicate 


across load was not consistent. 
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interaction suggested that 
ae 
15h 


Load X Exposure 
accuracy varied directly with load at .5, 


and 1.0 sec. and inversely with load at 2.0 
and 3.0 sec. 


DiscussIOoN 


In general, the present results are consistent 
with suggestions that differential ratio of value 
has an effect on performance in a visual search 
task (Christ, 1969; Taub, 1965; Tolkmitt & 
Christ, 1970). The data underlying the major 
interactions with initial responses, mean num- 
ber of responses per trial, and percent-correct 
transmissions indicate that the differences 1m 
performance as a function of ratio increased 
as exposure duration increased. The shape of 
the interactions suggest that with short ex- 
posure durations the differences in percentage 
of stimuli correctly transmitted and number of 
responses per trial were, to a large extent, due 
to an inhibition of low-value responses aS ratio 
was increased, a finding which is consistent 
with the trends observed by Christ (1969) and 
Taub (1965). However, although the trends 
are similar, the present analyses as well as 
those of Christ (1969) and Taub (1965) suggest 
that the main effect of ratio may or may not be 
significant depending on the relative size of the 
increase in responses of high-value stimuli com- 
pared to the inhibition of low-value responses 
as ratio is increased. The present results extend 
these findings by suggesting that the amount 
of increase in high-value responding with in- 
creases in ratio may be directly dependent on 
duration of exposure. This latter result is con- 
sistent with the findings recently reported in a 
comparison of an equal value (1:1) with a 
differential value (4:1) condition at .5-sec. and 
2,5-sec. exposures (Tolkmitt & Christ, 1970). 

The current findings of interactions for ex- 
posure, ratio, and value with initial responses, 
mean number of responses per trial, and 
percent-correct transmissions suggest that a 
large part of the inconsistencies among studies 
of differential ratio of value with a visual search 
task may be related to the exposure durations 
that were used. Figures 1-3 clearly demon- 
strate that differences due to ratio are small, 
if present at all, at 5 and 75 sec., which would 
agree with the reported negative findings of 
Hearns & Moss (1968). T urther, although 
Christ (1969) did not find any differences with 
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RELATIONSHIP BETWEEN AGE AND RISK TAKING 
AMONG MANAGERS 


VICTOR H. VROOM! AND BERND PAHL 


Graduate School of Industrial Administration, Carnegie-Mellon University 


A short version of Kogan and Wallach’s choice-dilemma test was administered to 


1,484 managers from over 200 companies. The results show a signi 


ificant negative 


relationship between age and both risk taking and the value placed upon risk. The 
relationships are small in size when individual scores are correlated with age, but 
increase markedly when mean scores are used. Furthermore, the relationships are 
relatively stable across items and across companies. Developmental and socio- 
cultural mechanisms that might underlie these findings are discussed. 


The concept of risk taking has played a 
central role in psychological research during 
the last decade. A large amount of research has 
been conducted in an effort to determine the 
situational determinants of risk taking, in- 
cluding the mechanisms involved in the fre- 
quently reported finding that group decisions 
tend to be riskier than those of individuals. 

Somewhat less attention has been paid to 
individual differences in risk taking, including 
its relationship to age. Age is a frequently used 
variable in psychological research and its rela- 
tionship to a large number of diverse behaviors 
has been studied (Wohlwill, 1970). But little is 
known about age and risk taking, particularly 
among mature adults. In one of the few studies 
dealing with this question, Wallach and Kogan 
(1961) compared risk-taking behavior on their 
choice-dilemma tasks of college age and elderly 
men and women. The older Ss, both males and 
females, were found to be significantly more 


conservative than the college students. They 
fferences to the 


attribute the observed diff o 
financial anxieties that affect the aged in our 


society. F ; 
In a later review article (Kogan & W "= 
1967) the authors comment on the ae ee 
further exploration of the age-risk-takins 
relationship using less extreme age groups- 
of college students 
s of some interest, 
ct data at age mM- 
There is a striking 
are beyond 
ch. 


Although risk-taking comparisons 
and elderly individuals yield result 
it would be quite valuable to colle 
tervals over the entire adult life span. 
dearth of information about adults who 
college and not yet eligible for gerontological resear 
Ro sent to Victor H. 


al Administration, 
Park, Pitts- 


1 Requests for reprints should be, 
Vroom, Graduate School of Industri 
Carnegie-Mellon University, Schenley 
burgh, Pennsylvania 15213. 


There are the people who are least accessible, of course, 
because of occupational and family commitments. If 
we could collect systematic data on subjects in the 
thirty to sixty age range, we would obviously be able 
to specify a reasonably accurate age function for risk- 
taking. Is it a progressively declining smooth curve, 
or are there peaks and valleys [p. 168]? 


The relationship between age and risk taking 
is of interest not only for purely descriptive 
reasons but also for the insights that it might 
provide into the particular variables that 
actually determine or mediate the variation of 
behavior with age. In this article, data are 
presented concerning the relationship between 
age and risk taking for an occupationally 
homogencous sample of almost 1,500 mature 
adults. The form of the functional relationship 
between these two variables is examined, and 


possible mechanisms underlying it are 
discussed. 

METHOD 
Sample 


The sample consists of 1,484 male managers, em- 
ployed in over 200 corporations in a large variety of po- 
sitions. All were engaged in management-development 
programs at the time of measurement, and the data were 
collected as part of a classroom demonstration. The 
age of the sample was approximately normally dis- 
tributed with a mean of 39.34 yr., a standard deviation 
of 6.79 yr., and a range of 22-58 yr. 


Instrument 


Risk taking was measured by using a subset of the 
standard Kogan and Wallach (1964) choice-dilemma 
questionnaire. Items 1, 4, 7, 9, and 11 were presented 
in the given order. Briefly, these items consist of the 
following situations, all concerned with a safe, certain 
outcome versus a more desirable, riskier alternative: 


d engineer has to decide between a safe, 
d a fledgling company that offers more 
and advancement. 


1A marries 
secure job an 
responsibility 
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TABLE 1 
ITEM INTERCORRELATION MATRIX 

| ] | « 
Item | 1 | 2 | 3 | 4 | 5 

| | | 
1 | 1000 | | | 
2 056 | 1.000 | | 
3 | ‘087 | .020 | 1.000 | 
4 —.001 | .040 | —.045 1.000 | 
5 .166 | .083 j —.001 -095 | 1.000 


2. The captain of a college football team has to decide 
between a sure play to tie the arch rival, or a risky 
ee eo player is faced with a deceptive, risky 
maneuver that could expose him to certain defeat. s 

4. A prisoner of war is contemplating escape, knowing 
he will be executed if he is caught. 

5. A research physicist has to decide between re- 
search topics that would be either plodding and sure or 
risky and uncertain. 


Managers were tested in groups ranging in size from 
20 to 60 participants. Each manager was given a booklet 
containing the five problems and an answer sheet on 
which to record his responses. He was asked to indicate 
the minimum probability of success that would warrant 
acceptance of the risky alternative for each problem. 
Responses to each item were to be chosen from the 
odd numbers 1, 3, 5, 7, and 9, representing the chances 
out of 10 that the risky alternative would be successful. 
If a respondent indicated that he would not choose the 
risky alternative regardless of its probabilities of 
success, his response was scored as 10. 

In addition, each manager was asked to estimate 
whether the average participants in his group would 
respond with higher numbers to an item (ie., they 
would be more conservative) or lower numbers (i.e, 
they would be riskier). If he believed that others would 
be more conservative than he was on an item, he in- 
dicated this with a plus following his own response; 
if he believed that others would be riskier, he indicated 
this with a minus following his own response. Those 
who believed that their response was at the average 
were instructed to record no sign. 

Previous research (Hinds, 1962; Levinger & 
Schneider, 1969; Willems, 1969) has shown that Ss 
tend to estimate themselves to be riskier than their 
peers. This finding has been interpreted as evidence for 
a cultural value placed on risk. People judge themselves 
as riskier than their peers because they like to think of 
themselves as riskier. The judgments of relative 
riskiness were obtained both to verify the previous 
findings of consistent overestimation and explore the 
relationship between age and a potential measure of 
the value placed on risk. 

Following the completion of 
answer sheets were collected a 
divided into groups and 
on each problem. Only 
used in this article. 


The use of the 5-item scale rather than the 12-item 
version in this research was dictated by time consider- 


all five problems, the 
nd the managers were 
asked to make group decisions 
the individual responses are 
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ations. Undoubtedly, it had detrimental effects on the 
reliability of the scores obtained. There are two wie 
bases for estimating reliability for the short version o! 
the instrument. Kogan and Wallach: (1964) and 
Wallach and Kogan (1961) present reliability estimates 
from several samples between the odd and even items 
of the 12-item measure. When these are weighed by their 
respective Ns and stepped up by the Spearman BON 
formula, the result is an estimated reliability of the 12- 
item measure, rı = .62. Decreasing the test length to 
five items leads to an expected reliability of the measure 
used in this study of ra: = .4. When the item inter- 
correlations shown in Table 1, based on data obtained 
from this sample are used as the basis for estimation, 
however, the reliability of the 5-item measure was 
estimated at only .2. 

Either estimate indicates that the 5-item measure of 
risk taking is highly unreliable. This fact should greatly 
limit the size of any relationship between individual 
scores and age. The large number of cases, however, 
should still permit the determination of any relation- 
ships that do, in fact, exist. 


RESULTS 


Given the low intercorrelations among the 
items (F = .05), independence of items was 
assumed and a median test constructed over 
all people-items to test the null hypothesis (i.e., 
the age groups were sampled from risk- 
equivalent populations). The Ss ranged in age 
from 22 to 58 yr., and, therefore, constituted 
37 age groups. The median response for all 
items was 5, and the number of item responses 
below 5 and the number of item responses > 5 
was counted for each age group. This test 
generates a value for X? of 52,07 (df = 36, 
025 < p < .05), so the null hypothesis is re- 
jected and we conclude that there exists a 


significant age-riskiness relationship. 


The median test treats 
variable. To ascertain the actual nature of the 
functional relationship between age and riski- 
ness, we assume that the probability estimate 
scale is an interval scale and that the five items 
are to be weighted equally, 

Given these assumptions, a riskiness index 
(R) was calculated for each S as the arithmetic 
mean of his responses. The higher the score, the 
less risky the person. The overall correlation 
between age (4) and riskiness (R) is: 

rar = 0807 (p< .001). 

Both coefficients in the line regression line of 


R = 3.895 + -01394 
are significant at beyond the .001 level. 
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Given the reliability of the 5-item scale, data 
within age groups were aggregated and the 
sample mean for that age group was used as 
the best estimate of the population mean. This 
essentially has the effect of eliminating the 


within-group variance, which is a combination 
i and true variance, and 


of both error variance 
reduces the issue to & goodness-of-fit problem. 
age (A) and mean 


This correlation between 
riskiness-age (A) is: 
rape bb (= 498, df = 33 < 001). 


Figure 1 shows the plot of this 
included in Figure 1 are two regress) 
linear and a cubic. 

The coefficients in t 

R = 3.575 + -02034 


d the 001 level. A 
that the 


data. Also 
ion lines, & 
he linear regression line of 


are both significant beyon 
chi-square test indicates, however, 
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ASE 


Fic. 1. Mean riskiness versus age. 


residuals are not normally distri 
si Is ally ibuted 
Qe = 27.83, df = 9, p = 001). Inspection of 
E igure 1 suggests a third-order polynomial. The 
coefficients in the cubic regression line of 


R = 1.403 + .04064 — .00954? + .00007.A* 


are all significant at the .05 level. A chi-square 
test indicates that the residuals are normally 
distributed (X? = 8.2, df =9, p =-5). i 

These results indicate that risk taking as 
measured by the 5-item scale generally declines 
with age. From inspection of Figure 1 and the 
significance of coefficients in the cubic- 
regression line, it appears that the slope of the 
relationship is greatest in the age range 22-32 
yr., flattens out in the age range 33-48 yr., and 
increases again in the age range 48-58 yr. 

It remains to be seen, however, whether the 
relationship between age and risk taking 


applies to all five items or merely to a subset of 


le 
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TABLE 2 


CORRELATIONS BETWEEN AGE AND MEAN 
RESPONSE BY ITEM 


| 
Item TAR | t | p 
Engineer + a pra 
Football 04 E 5 m 
Chess player 47 AS a 
POW 28 1.73 or 
Researcher 44 2.93 .005 


them. Table 2 shows the correlations between 
age and mean response to each of the five items. 
All correlations are positive and, with the 
exception of Item 5, are significant at the .05 
level. ; 

It is tempting to conclude from these findings 
that age and risk taking as measured by the 
Kogan and Wallach (1964) items are correlated 
in the total population of managers. It is 
possible, however, that the relationship may 
be artifactual due to particular and unique 
properties of our sample. The 1,484 managers 
were drawn from many different companies and 
worked in different functional areas within 
management. If, in our sample, managers in 
different functions or companies differed both 
in age and in risk taking, and if these differences 
did not exist in the total population, then the 


TABLE 3 
MEDIAN Test, FUNCTIONAL AREA, AND RISKINESS 
No. 
No. equal 
Functional area below or Total 
Mdn. | above 
Mdn. 
Marketing 719 740 1459 
Finance 351 372 723 
Production 706 737 1443 
Personnel 180 167 347 
Research & development 273 304 577 
General management 253 246 499 
Engineering 418 441 859 
Systems 58 82 140 
Management stafi 43 50 93 
Other 514 560 1074 
Totals 3515 3699 7214» 


of failure to respond or response betwe 
numbers), Since this is only 3% 


en categori 
this is hardly significant, 


(i.e., even 
of the total response sample, 


relationship between age and risk taking might 
be spurious. ; i 

It is, of course, impossible to draw conclu- 
sions about the total population except insofar 
as it is revealed in our admittedly nonrandom 
sample. It is possible, however, to examine the 
existing data for relationships between un- 
controlled variables and both age and risk 
taking. 

Managers had categorized themselves as 
working within one of nine types of work: 
marketing, finance, production, personnel, 
research and development, engineering, sys- 
tems, general management, and management 
staff. Nonreports and those that did not fit into 
any of these categories were placed in a tenth 
category termed “other.” 

Table 3 shows the number of item responses 
below the median and above or equal to the 
median of those 10 categories. There appears to 
be a slight tendency for those in general 
management and those in personnel to be 
slightly riskier than the total population, but a 
median test conducted to test the null hypoth- 
esis yields a value for X? of 6.36 (d f = 9, p = .3)- 
The null hypothesis cannot be rejected, and 
there is no evidence for believing that the type 
of work performed is related to risk taking and 
the source of a spurious relationship between 
age and risk taking. 

Differences in risk taking among the com- 
panies represented in the 


sample might also 
affect the rel 


ationship between age and risk 
taking. To test this possibility, three firms were 
selected that accounted for approximately two- 
thirds of the entire sample. They were: (a) 
Firm A, an integrated producer and fabricator 
of aluminum (V = 251); (b) Firm B, manu- 
facturer of electrical and electronic equipment 
(V = 410); (c) Firm C, manufacturer of equip- 
ment for the gi 


eneration, transmission, utiliza- 
tion, and control of electricity (V = 341). 


A median test conducted to test the null 
hypothesis shows the 


value of x? as 9.01 
(df = 2, p < 01). There are significant differ- 
ences in risk taking among these companies. 
Table 4 shows the mean and variance on age 
and riskiness for each of the three companies. 
All of the differences jn age are highly signifi- 
cant (p < .001) with the managers in Firm A 
being the oldest and Firm B being the youngest. 
The managers in Firm A have the highest 
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scores on the risk-taking measure (i.e., are most 
conservative), and Firm C has the lowest score 
on that measure. The difference between Firm 
A and Firm B is significant (/ = 2.15, p < .025) 
and in line with the age differences in the 
samples. The difference between Firms B and C 
approaches significance (t= 1.41, p < .08) but 
is in the reverse direction from what might be 
expected from the mean age of the samples. 

Table 4 also shows the correlation between 
age and mean risk taking within each of three 
companies. The correlations are still positive, 
ranging from .61 to .42. The relationship be- 
tween age and risk taking is not eliminated 
when company differences are eliminated. 

Tt will be recalled that managers were also 
asked to indicate their perception of their rela- 
tive riskiness by putting pluses and minuses 
beside their individual item response. This was 
done because of an assumed relationship be- 
tween biases in Ss’ estimates and the amount 
of value that he attaches to risk. 

Of 7,420 instances in which pluses and 
minuses could potentially have been recorded 
(1,484 Persons X 5 Items), such judgments 
were made 4,254 times or 57.3% of possible 
instances. These judgments are unevenly dis- 
tributed between pluses and minuses with 
2,712 pluses and 1,542 minuses. This is con- 
sistent with previous evidence that persons 
tend to think of themselves as riskier than 
their peers. 

The critical question, however, concerns the 
relationship between these judgments and age. 
Once again using the lowest power test and the 
smallest number of assumptions about the 
data, a chi-square test was conducted to test 
the null hypothesis that the age gom 
constitute equivalent populations with ee 
to the distribution of pluses and minuses. This 
test generates a value for X? of 80.11 (f= 36, 
p< 001), so the null hypothesis 1S rejected. 4 

To explore the actual form of the a gone 
relationship between age and the risk-val A 
measure, an index was constructed for each 


person as follows: 


(No. of pluses) — (No 
~ Total no. of responses 


of minuses) 
Q= omea S 


This index has a range fio. k 
The correlation between age and the risk- 


value measure (rag) is — A3 (p< 001). Since 
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TABLE + 


Mean SCORES ON AGE AND Risk TAKING AND 
CORRELATIONS BETWEEN AGE AND RISK 
TakinG BY COMPANY 


Firm | A Si R SR n TAR | t 


50.43 | 4.70 | 6.65 251 | .61 |-001 
12.20 | 4.50 | 6.73 410 | .42 |.07 
44.08 | 4.37 | 7.14 341 | 49 | 01 


A | 42.70 
B | 36.35 
> |3992 


S 


the measure Q, like R, may be expected to be 
highly unreliable, a correlation was computed 
between age and mean risk-value score (rag) 
and was found to be — .76 (p< 001). Figure 2 
shows a plot of Ọ and age. Clearly, as age in- 
creases, the scores on the risk-value measure 
decrease. 

This finding may be due to the fact that older 
persons place less value on thinking of them- 
selves as risk takers. But, it is also possible that 
it is simply an artifact of their greater con- 
servatism. An older person who states that he 
would take the risky alternative if it had 9 out 
of 10 chances of success has a more difficult 
time seeing himself as more risky than his peers 
than a younger person who states that he would 
take the risk if it had only 1 out of 10 chances 
of success. In effect, the measure Q may be 
highly correlated with R and the correlation 
rao may be an artifact of the association of 
both A and Q with R. 

The correlation rar was found to be — .29 
(p < .001) and the correlation rar across age 
categories was found to be — .70 (p < .001). 
However, the partial correlation roar and 
raa-r were found to be — .11 (p < .001) and 
— 56 (p< 001), respectively. Clearly, age 
and the risk-value measure are correlated even 
when the influence of riskiness is partialled out. 

The fact that age is correlated with estimates 
of relative riskiness in relation to peers even 
when scores on risk taking are partialed out has 
been attributed to the fact that younger per- 
sons place more value on risk taking than older 
persons. It is also possible that this finding 
reflects a widespread shared belief that younger 
managers are more risky than older ones. 
According to this explanation, a manager would 
are of any differences between him and 
of his group and would base 
í differences between his re- 


be aw: 
other members 
his estimates 0 


404 


MEAN RISK-VALUE (Q) 


Victor H. Vroom AND BERND PaHL 


Fic. 2. Risk value versus age. 


sponses and theirs on this knowledge, This 
explanation cannot be completely ruled out 
with these data. However, it does not explain 
why there should be an inordinately larger 
number of pluses to minuses. It seems slightly 
more parsimonious to assume that there is a 
widespread value placed on risk in American 
culture and that differences in the strength of 
this value are correlated with age. 


Discussion 


The results show a significant relationship 
between age and measures of both risk taking 
and of the value placed on risk, The relation- 
ships are small in size (although highly signifi- 
cant) when individual scores are correlated 
with age but increase markedly when mean 
scores are used, Furthermore, the relationships 
are relatively stable across items and across 
companies. 

What can be said about the Process under- 
lying this relationship? One Possibility is that 
the regression line relating age to risk-taking 
constitutes a developmental track along which 


people move as they increase in age. As a 
person gets older, he gets married, has children, 
and acquires responsibilities to persons other 
than himself that militate against his taking 
risks that he m 


ight well have taken earlier in 
his development, 


It is also 


possible that the explanation is not 
developmer 


ntal at all, but cultur 
a person, the less likely he is to have experienced 
the depression, to have served in World War IJ, 
and the more likely he is to have been reared in 
an atmosphere of affluence and economic 
stability. According to this explanation, risk 
taking is a relatively stable attribute of per- 
sonality that is learned fairly early in life with 
the sociocultural conditions prevalent at that 


time being instrumental in its determination. 
The points along the 


I regression line do not 
constitute a track along which people move 
but rather, indications of sociocultural change- 
Each age group is Moving along its own track, 
and that track js Parallel to the horizontal 
axis. 


These explanations are not, of 
mutually exclusive, Both pro 


al. The younger 


course, 
cesses of develop- 


x 
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ment and of sociocultural change may underly 
the relationship of age to risk taking. The 
critical question to be answered involves their 
relative contributions. 

The data reported in the present article are 
not adequate for this purpose. A longitudinal 
investigation in which people are studied over 
time, particularly during the age ranges corre- 
sponding to the steepest slopes on the regres- 
sion line would provide the most convincing 
answers. 

But, the question can also be studied cross- 
sectionally. Let us assume that data of the 
general type reported here were collected from 
different samples of the same populations at 
intervals of 2 yr. over a 10-year period. If the 
developmental explanation accounted for the 
relationship between age and risk taking, there 
would be no change in the slope of the linear 
regression, points of inflection in nonlinear 
regression, intercepts, or mean scores on risk 
taking. If the sociocultural explanation were 
correct, the slope might remain the same, but 
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all other parameters would change systemati- 
cally over time. 
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MOTIVATIONAL TYPE AND THE SATISFACTION- 
PERFORMANCE RELATIONSHIP * 
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Pennsylvania State University 


Recent work in the area of job satisfaction and work motivation has empha- 
sized the importance of individual differences. The logic of subgrouping was 


combined with a consideration of individual differe 
work in an attempt to shed some light on the satisf 


in the motivation to 
tion-performance rela- 


tionship. Professional engineers were subgrouped on three motivationai dimen- 
sions. Motivational type was tested as a moderator of the relationship between 
five factor analytically derived satisfaction dimensions and six performance 
dimensions. Two of the emerging types showed moderator characteristics. 


There has been considerable discussion of 
the merits of subgrouping and typal analysis 
over the past few years (Owens, 1968). The 
purpose of the present research was to deter- 
mine if such an approach might lead to a 
more efficient description of the relationship 
between job satisfaction and job performance 
than has been supplied to this point in the 
literature. 

The study of various reinforcers as well as 
the individuality of satisfaction patterns set 
the stage for the introduction of subgroup or 
typal logic. Two cornerstones of this emerging 
concern for individual differences were Shaf- 
fer’s (1953) monograph on need satisfaction 
and Ahlberg and Honey’s (1950) study of 
government scientists. Although both of these 
works pointed to the importance of individual 
differences in the consideration of job satis- 
faction, they did not concern themselves with 
the identification and description of subgroups. 
More recent and considerably more direct 
attempts to introduce the typal approach in 
the analysis of the motivation and satisfac- 
tion constructs have come from Katzell, Bar- 
rett, and Parker (1961); Dunnette, Campbell, 
and Hakel (1967); and Graen, Dawis, and 
Weiss (1968). 

The Graen, Dawis, and Wei 


ss (1968) study 
introduced a unique strateg 


y insofar as they 
1 This research was partiall 
from the United States Depart 
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versity. 
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not only identified motivational types, but 
they also used these types as moderators of 
the relationship between need-reinforcer cor- 
respondence and job satisfaction. They found 
that need type did, in fact, act as a mod- 
erator. It is important to note that type acted 
as a multivariate moderator, that is, the pat- 
tern of needs of the individual was of maximal 
importance. 

The present research is based on the notion 
that motivational type or subgroup might act 
as a multivariate moderator of the relationship 
between job satisfaction and performance. The 
logic of the hypothesized motivational mod- 
erator is closely related to the consideration 
of satisfaction and performance as collinear 
variables tied together by rewards. The prob- 
lem becomes one of defining “rewards.” The 
importance of the concept of individual differ- 


ences, as previously discussed, is that the 
nature of a reward varies fron 


person. The identification of 
types or subgroups was intended 
a gross sense, the salience of reinforcers in the 
individual’s environment, People having simi- 
lar reward systems (i.e, the same reward 


salient for each) should, given similar reward- 
ing systems, be 


1 More satisfied and perform at 
higher levels (if rewards are, in fact, tied to 
performance) when their own particular needs 
are satisfied. 


The present stu 


n person to 
motivational 
to control, in 


Th : idy is the first step in de- 
scribing the various relationships between spe- 


cific need patterns, satisfaction with specific 
aspects of the work, and specific job behav- 
lors, as outlined above, Its primary intent was 
to demonstrate that the pattern of individual 
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needs modifies the relationship between job 
satisfaction and performance. 


METHOD 


The Ss chosen for the study were 175 engineers 
with no supervisory responsibilities who came from 
six different organizations. Five of these organiza- 
tions were commercial firms with engineering sub- 
sections, while the sixth was an engineering consult- 
ing firm, All data were gathered in group meeting 
rather than by mail. 

There were four broad classes of variables in the 
study: (a) satisfaction variables, (b) motivation 
variables, (c) performance variables, and (d) work 
history and biographical information. These classes 
are described separately. 

Satisfaction variables. A questionnaire constructed 
by Elbert (1966), which will be referred to as the 
satisfaction inventory (SI), was chosen as a measure 
of the satisfaction components in one’s job. The 
questionnaire was developed by writing statements 
for each of 21 need categories identified in the lit- 
erature. Internal consistency analysis was used to 
identify the five best statements in each category. 
The inventory was administered to male college 
graduates employed for 3 yr. or less. A 7-point re- 
sponse scale of satisfaction was used signifying the 
degree to which the item was perceived to be present 
in the job. A principal axis factor analysis with a 
Varimax rotation identified 12 need satisfaction fac- 
tors accounting for 98% of the common variance. 

For the present study, a pilot was done to deter- 
mine the adequacy of the SI for a relatively homo- 
geneous occupational group (professional engineers) . 
Consequently, the SI was mailed to engineers em- 
ployed by three different organizations. Of the 250 
questionnaires that were sent out, 176 were returned. 

i dy indicated that the SI was suitable 
The pilot study > A 7) Thi 
for use in the collection of e ae e enbe Sl 
5 erms g 
major factors appear oa they were defined by 


ariance accounted for), defin 
tke same items as in Elbert’s (1966) analysis. The 


factors and items are more fully described in the 
p: ection. r 

ie A variables. These consisted oi pe ai 
ings of the motivation to work on seven pre 
The scales are labeled as follows: Stina = 
fication, Job Curiosity, Team Attitude, a m 
centration, Independence/Self-Starter, Persis n a 
Organizational Identification. A descriptor = 
scales and their development has recently been p! 


lished (Landy & Guion, 1970). $ i 
Performance variables. Sprecher Hg Le T 

ed a series of 33 performance sca 3 
E ; ofessional engineers. The scales 
broad factors: 


supervisors in rating pro oe 
i z 5 AY 

are considered representative © k 

convergent production, Te 

tion. From 


divergent production, v u 
sponding and doing, evaluation, and cpie i 
the 33 scales, 6 were selected for use in al 
study on the basis of statistical nee SE adie 
general suitability. The scales were: ae em n ee 
ing, Communication of Results, Use of Pers 


sources, Personal Ethics, Problem Analysis, and 
Valuable Results. Performance and motivation of S 
were rated by co-workers selected by S. 
Additional data. The Ss were also asked to supply 
biographical information and work history informa- 
tion, as well as to complete the Bass Orientation 
Inventory (ORI; Bass, 1962). The Bass ORI yields 
three scores: Self-Orientation, People- or Interac- 
tion-Orientation, and Task Orientation. 


RESULTS 
Satisfaction 


The SI was cluster analyzed using Tryon’s 
key cluster analysis as found in the BC-TRY 
computer system of cluster and factor analy- 
sis (Tryon & Bailey, 1966). Cluster analysis 
of the satisfaction inventory yielded five clus- 
ters accounting for 97% of the variance of the 
raw correlation matrix and 96% of the com- 
munality. The five clusters were: Advance- 
ment, Ethical Principles, Creativity, Pay, and 
Working Conditions. The items defining the 
clusters and their loadings can be found in 
Table 1. The intercorrelations of the clusters, 
the cluster reliabilities, and the percentage of 
variance accounted for by each cluster can, be 
found in Table 2. 


Motivation 


Rather than assume the independence of 
motivational dimensions for typing (and pos- 
sibly introduce an error of overrepresentation 
of a particular motivational domain), a factor 
analysis of the motivation ratings was done. A 
principal factors solution was applied with 
both a Quartimax and Varimax rotation, This 
analysis indicated that the three most inde- 
pendent dimensions for typing (i.e., those 
which had the greatest potential for yielding 
discriminably different types) were Profes- 
sional Identification, Team Attitude, and Task 
Concentration. 

For the motivational typing or subgrouping, 
a minimum within-cluster variation technique 
was used. The procedure is one of the com- 
ponents of the BC-TRY computer system and 
is known as O-TYPE. The process is one of 
clustering objects in multidimensional space. 
The dimensionality in this case was three (the 
three motivational dimensions). As a function 
of standard scores, each S was classified as 
high, moderate, or low on each of the motiva- 
tion dimensions (high = +1 SD and higher, 
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TABLE 1 


CLUSTER DEFINERS AND Loapincs 


Cluster | Loading | Mem 
| 
poo .91 | The job provides an opportunity for advancement. 
| .89 | You have an opportunity to get ahead. 
| 84 | There is a chance to move one job higher. 
66 | You advance. 
gical Principles | .91 The things you do are morally right. 
| 91 | Your ethical standards are maintained. 
79 Your work agrees with your ethical principles. 
cavity | 82 | Your creative ability is useful on the job. 
| -86 | You can be imaginative and think up unique ways of doing things. 
| 79 | You can find new ways to do things. 
| SI | You have the freedom to use your own judgment. 
Eey .82 | You are paid enough for the work you do. 
81 | You are able to get the pay increases you feel you deserve. 
T7 | You are satisfied with the length of time between pay increases. 
| 16 | You feel secure with the amount of money you earn. 
Working Conditions | | 
| -95 | There are good physical working conditions. 
| a | You have pleasant work surroundings. 
| : | 


Working conditions, that is, he: 


ating, lighting, etc. are adequate. 


Note.—Oblique factor coefficients. 


moderate = +1 to —1 SD, low = —1 Sp and 
lower). This allows for 27 (or 3?) a priori 
types. This score sector space is a starting 
point for an interative clustering procedure 
based on Euclidian distances. 

One of the problems involved in the choice 
of a typing strategy concerns the model 
chosen. The deterministic model is stated in 
terms of the ideal case in which no provision 
is made for chance or error variance in the 
model itself, and every object must be allo- 


TABLE 2 


CLUSTER INTERCORRELATIONS AND RELIABLLITIES 


Cluster 


| «| 

z ae 

1. Advancement 90 23 38 34 22 
2. Ethical Principles OF 2 14 2 
3. Creativity 87—05 i4 
4. Pay | 89 20 
5. Working Conditions 81 
% he 52 24 2 ë 13 97 


Note,—Reliabilities are Spearman-Brown iabilities 
Tryon [1957] and Ghiselli (1964) D reliabilities ( 


x a see 
ecimals omitted, 


cated to a specific subgroup. Since it was un- 
reasonable to assume that the motivational do- 
main was completely represented (as mani- 
fested in the types) by three dimensions, a 
probabilistic model was chosen for typing. 
This strategy allows for variation in descriptor 
variables and the persons described, and not 
all persons need belong to one of the sub- 
groups found (Lorr, 1968). 

The subgroupings resulted in 13 distinctive 
types comprising 171 Ss. Th 


s e remaining four 
Ss were considered uni 


Another component 


of the BC-TRY com- 
puter system known 


work history variab] 
The component is 
ploys a Monte Ca 
allows one to 


es and Bass ORI scores). 
appealing because it em- 
rlo sampling routine that 
implement the central limit 
ereby treat each test as inde- 
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pendent. Briefly, the procedure draws a large 
number of random samples (e.g., 3,000) from 
your overall sample. Each of these samples 
is of the same size as the type you are exam- 
ining. The mean of the variable of interest for 
the type is then superimposed on the distribu- 
tion of means for the random samples. State- 
ments can then be made about the signifi- 
cance of the difference between the type mean 
and the overall sample mean. On the basis of 
this kind of analysis, the following statements 
can be made about the motivation types: 
Type 2—significantly higher on Self-Orienta- 
tion (p < .05), significantly lower on Task- 
Orientation (p < .05); Type 3—significantly 
higher on People-Orientation (p < .05); Type 
4—significantly lower on People-Orientation 
(p < .05), significantly higher on Task-Ori- 
entation (p< .001); Type 5—significantly 
fewer previous jobs (p < .01); Type 6—sig- 
nificantly higher on People-Orientation (p < 
.001); Type 7—significantly lower on People- 
Orientation ($ < .01), significantly higher on 
Task-Orientation (p < .05); Type 9—signifi- 
cantly greater number of years past degree 
(p< 05); N 

While the independence of the types is as- 
sured by the clustering process, unless one has 
rge sample to start with, many of the 
es or subgroups are too small for 
stable inferences of the form required by oe 
relational forms of moderator analysis. While 
it is possible to combine the types n a P 
fied hierarchical order to achieve larger ie 
Mis 9 information would 
sizes, it was felt that more 1 

4 ini than would 

be lost in the combining process E aoti 
be gained by larger Ws. Consequen Ys 0 ded 
promise was made. It was arbitrari AN types 
to continue the moderator analysis w m A ee 
comprising 20 or more Ss. This ae be- 
a more adequate stratesy than ee account 
cause one can take sample size into ages 
and adjust accordingly in oar a moti- 
Similar problems were encountere Dawis, 
vational typing study done by Graen, 


; iç will probably con- 
and Weiss (1968). This Kal E models are 


tinue to be a problem as long a 
used that assume a multivariate? 


face 

. erator 
Two of the four types showed es 4 
Characteristics, An examination © 


a la 
emerging typ 


TABLE 3 
Motivation Type Score PATTERNS 
| Motivation dimension 
Type | Pro- | Task N 
| fessional Team Concen- 
Identifi- | Attitude tention 
cation 
1 | 32.9 29.0 44.8 6 
2 | 389 46.2 26.6 7 
3 | 287 50.5 50.7 8 
a | 509 | 31.6 54.0 10 
5 | 47.2 53.4 38.5 21 
6 | 45.6 | 46.6 49.2 29 
7 496 | 523 | 879 21 
8 | 51.6 57.5 50.6 21 
9 | 46.0 61.2 63.0 10 
10 | 68.9 54.1 38.4 5 
11 62.1 45.4 51.4 11 
12 | 645 | 55.9 63.5 10 
13 | 62.6 | 58.0 53.2 12 
| 
spent scores appear in standard score form (XY = 50, 


and 5 shows that the full-sample correlations 
are generally of zero order, while the mod- 
erated correlations describe significant rela- 
tionships between satisfaction and perform- 
ance. The number of significant correlations in 
the moderated matrix is sufficient to indicate 
that their appearance is not solely due to 
chance. 

There is an interesting statistical point that 
arises from the moderator analysis. When you 
are dealing with a matrix of correlation co- 
efficients rather than a single value, what 
should be considered as evidence for the pres- 
ence of moderator effects? The first and most 
obvious is simply to look at the increase in 
the number of significant correlations and 
compare that to levels you might expect by 
chance. Since the overall matrices presented 
in Tables 4 and 5 are essentially of zero or- 
der, the differences between the moderated 
matrices and the overall matrix are fairly ob- 
vious. On the basis of this kind of approach, 
there does seem to be some justification for 


accepting the moderator characteristics of 
g 


motivational type. ; 
But since we are directing our attention to 


the grosser question of the relationship be- 
tween satisfaction and performance (the ex- 
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TABLE 4 


INTERCORRELATIONS OF SATISFACTION CLUSTER SCORES WITH PERFORMANCE 
VARIABLES, AND MODERATED CORRELATIONS 


Satisfaction dimension 
Variable z nO 
Advancement reel Creativity Pay aoe 

Problem Weighting 

Intercorrelation 04 05 —02 09 06 

Moderated correlation 53* 50* 07 06 31 
Communicates Results 

Intercorrelation 02 OL 04 15 OL 

Moderated correlation 27 13 13 54** 18 
Personnel Resources 

Intercorrelation 03 07 06 03 01 

Moderated correlation 60** 30 33 —12 18 
Personal Ethics 

Intercorrelation =11 15 —05 02 01 

Moderated correlation 30 09 09 00 05 
Valuable Results 

Intercorrelation —04 03 05 02 —15 

Moderated correlation 35 42 27 —27 40 
Problem Analyzing 

Intercorrelation —06 12 14 04 02 

Moderated correlation 23 45* 22 —02 50* 


Note, —N = 175; Type 5 moderated correlation, 
* p <.05 (two-tailed test). 


N = 21; decimals have been omitted. 
** p <.01 (two-tailed test). 


istence but not necessarily the degree and 
kind), perhaps a different kind of analysis is ariance accounted for by the 
called for—one of a grosser type. One strategy the moderated matrix 

might be to do an r to 2’ transformation on all e point is not that there are lots of ways 
values in the moderated and the overall 

matrix, respectively, and compute the signifi- 
cance of the difference between the average 
value for each of the matrices. 

A related strategy might be to compare the 
Toot mean squares of the overall and mod- Discussion 
erated matrices, respectively. A modification It does seem as if moti 
of the nonparametric sign test might also be as a moderator of the r 
appropriate, Satisfaction and perform 


the first vector of th 
Pared to the y 
first vector of 


e overall matrix as com- 


vational type acted 
elationship between 
ance. In the present 


a pattern of energy expendit 
r ure, first 
glance this might app reo oer 


more orthodox notio 


tor would be interpr 
be made between v 
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TABLE 5 


INTERCORRELATIONS OF SATISFACTION CLUSTER SCORES WITH PERFORMANCE 
VARIABLES, AND MODERATED CORRELATIONS 


Satisfaction dimension 
Variable A ae 
Advancement Ethical Creativity Pay Working 
Principles z | Conditions 
i} 

Problem Weighting | 
Intercorrelation 04 05 —02 09 06 
Moderated correlation 26 —08 —16 —15 —13 

Communicates Results 
Intercorrelation 02 01 04 15 01 
Moderated correlation 29 —18 —52* —21 —30 

Personnel Resources 
Intercorrelation, 03 07 06 03 01 
Moderated correlation 01 07 —20 —13 —08 

Personal Ethics 
Intercorrelation -l1 —15 —05 02 O1 
Moderated correlation 50* —39 =55** —04 —33 

l Valuable Results 
Intercorrelation —04 03 05 02 —15 
» Moderated correlation 35 —49* —22 —13 —53* 

Problem Analyzing 
Intercorrelation —06 12 14 04 02 
Moderated correlation 50* —32 —32 —20 | —26 

| 


N = 175; Type 8 moderated correlation, N = 21; decimals have been omitted. 
5 (two-tailed test). 
01 (two-tailed test). 


history. These concepts will ultimately tie 
together need systems as predispositions for 
behavior and energy expenditure viewed as an 
overt behavior system. 

Interestingly enough, the dimensions chosen 
for motivational typing turn out to be similar 
to dimensions suggested recently by Alderfer 


on response set when they correlate self-re- 
ported needs with self-reported satisfaction. 
| I have also been struck with this possibility. 
The present research strategy has been to 
infer motivation from energy expenditure. 


This was made possible by the nature of 
-workers were instructed to 


the peer ratings. Co 
rate Ss on the degree of energy that they 


expended in each of the specified motivational 
dimensions. The development of the motiva- 
tional scales makes it reasonable to assume 
that there is a fairly large behavioral com- 
ponent incorporated since interrater agree- 
ments reached generally acceptable levels. The 
logic is very simply that, given the oppor- 


i i i ivities that 
ty, people will engage in activi 1 
Ser ie oe Most studies 


they find personally rewarding. 
re asked Ss, in one form or another, what 


e necessary to produce eno 
ve 


have 


conditions wer p 3 
ment. The present study considers obser 


movement as an index of motives. 

The common ground between the more tra- 
ditional view of motivation and the pea 
approach is reward system and reinforcemen 


(1969) as basic dimensions of human needs. 
Alderfer proposes a threefold conceptualiza- 
tion of human needs comprising existence, re- 
latedness, and growth. Existence needs seem 
to be characterized by a concern for satisfac- 
tion of personal needs not directly related to 
other people or the organization. Relatedness 
needs involve relationships with significant 
others. The third category, growth needs, 
«u _. include all the needs which involve a 
person making creative or productive effects 
on himself and the environment [p. 146].” 
In the present study, Professional Identifi- 
cation seems closest to the proposed existence 
need, though Alderfer (1969) might contend 
that it is more appropriate to growth needs. 
The relationship between relatedness needs 
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and the dimension Team Attitude is fairly 
obvious. The Task Concentration dimension 
effectively couples the individual with his 
work environment, as suggested in Alderfer’s 
elaboration of growth needs. 

While a description of motivational types 
present in a sample of Ss might be of interest 
in its own right, the real value of such a pro- 
cedure lies in relating these types to variables 
other than those used in the typing process. 
The primary example of such an approach in 
the present data is the moderator analysis, 
relating the types to the performance-satisfac- 
tion relationship. Another attempt at anchor- 
ing the types was through the use of the Bass 
(1962) ORI scores and the work history data. 
The relationships of the ORI scores to the 
types are not very clear and are mentioned in 
the hope of generating hypotheses rather than 
answering questions. There seems to be a loose 
relationship between ORI Task Orientation 
and the motivational dimension of Task Con- 
centration. Those subgroups whose Task Ori- 
entation scores were significantly higher than 
the rest of the sample also had Task Concen- 
tration scores higher than the mean of the 
sample while the one group that had a Task 
Orientation score significantly lower than the 
rest of the sample had a Task Concentration 
score lower than the mean of the sample. This 
would seem to indicate some gross positive 
relationship between self-reported motiva- 
tional direction and peer ratings of energy 
expended in the same domain. The same rela- 
tionship does not seem to hold for ORI Inter- 
action scores and Team Attitude. This is prob- 
ably due to the fact that Interaction Orienta- 
tion involves seeking out companionship while 
Team Attitude deals primarily with helping 
and working cooperatively with other people. 
This may indicate that seeking companionship 
is not a sufficient condition for being a good 
co-worker. Finally, it is interesting to note 
that the only group having a Self-Orientation 
score significantly higher than the rest of the 
sample was also rated below the mean on all 
three motivation dimensions. 


The present data Suggest that there is some 
value to breaking out of the framework of 
measuring motivation through self-report de- 
vices. It appears that knowing the energy ex- 
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penditure patterns of professional engineers 
provides valuable information about satisfac- 
tion-performance relationships. As is obvious 
from the data analysis, the satisfaction- 
performance problem is not solved. Only two 
of the motivational types yielded relevant in- 
formation (i.e., showed moderator character- 
istics), but this is completely acceptable un- 
der the notion of differential predictability. It 
is entirely possible that if the motivation typ- 
ing were expanded to take into account more 
than three dimensions, predictive efficiency 
might have been increased. , 

There are two important things to keep in 
mind concerning the motivational types. Of 
primary importance is the fact that these sub- 
groups represent multivariate combinations, 
taking into account the complexity of motiva- 
tional systems. There seems to be a tempta- 
tion in certain kinds of research to identify a 
subgroup, look at the scores of a series of 
single variables for the people in this sub- 
group, assume that all the people in the sub- 
group have identical amounts of the traits 
represented by these single variables, assume 
that these variables combine in exactly the 
same way for all people in the subgroup, and 
talk about “types” of people, A prime example 
of this logic appears in a recent book by 
Hackman (1969, p. 54) on work motivation. 
This strategy is occasionally dangerous and 
almost always misleading. 

Of further importance in considering the 
present motivational types is the fact that 
they represent clusters of real people, that is, 
we can expect to find motivational dime 
combining in this manner. Again, there is a 
tendency to take normative data, partition 
the continua arbitrarily, combine these parti- 
tions, and talk about the resulting “types” as 
iT they were actually represented in the data. 
This logic is, again, represented in Hackman’s 
book (1969, p. 149). A related error is the 
Interpretation of a vector of factor loadings 


derived from a factor analysis of variables as 
if it described subgrou 


the data. An example of thi 


nsions 


ously described js th 
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the end! The fact that disjunctive or incom- 
patible patterns of scores can exist, at least 
in the realm of ability dimensions, has been 
documented by Tryon and Bailey (1970). 
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< ANCE IN 
TASK ENJOYMENT AND TASK PERSEVER 
RELATION TO TASK SUCCESS AND SELF-ESTEEM 


SKIPTON LEONARD ann JOSEPH WEITZ 1 


New York University 


An attempt was made to use self-esteem as a moderator for the relationship 
between success-failure and task enjoyment. It was hypothesized that success- 
failure would be related to enjoyment for a high-self-esteem group but not for 
a low-seli-esteem group. This was not the case. Success and failure were related 
to task enjoyment regardless of the self-esteem of S as determined by typical 
measures of this variable. It was further found that a possible behavioral 
measure of task enjoyment (perseverance on a task) was not highly related to 


a rating of task enjoyment. This 
ratings of enjoyment. 


The relationship between cognitive balance 
and behavior has been examined in a variety 
of contexts in recent years, A self-consistency 
mechanism has been proposed by Korman 
(1968) in his discussions concerning the rela- 
tionships between task achievement, self-es- 
teem, and task satisfaction. This self-con- 
sistency model is a natural extension of cog- 
nitive integration or balance theories as pro- 
posed by Lecky (1945), Festinger (1957), 
and Vroom (1964) and reflects a sophistica- 
tion that has developed from recent job satis- 
faction studies (Locke, 1965; 1966; c.f. 
Argyris, 1964; Dunnette, Campbell, & Hakel, 
1967) and studies in social influence theory 
(Deutsch & Gerard, 1955). 

The essential assumption of the self-con- 
sistency model was stated as follows: “All 
other things being equal, individuals will a) 
choose, b) adjust their performance to, and 
c) find satisfying those behavioral roles which 
maximize their sense of cognitive balance or 
consistency [Korman, 1968, p. 485].” It is 
within this framework that we shall investi- 
gate task satisfaction as it relates to success 
and failure and to self-esteem. 

One’s self-esteem or self-assurance can be 
seen as having two, conceptually semiinde- 
pendent components. General or chronic self- 
esteem is related to an individual’ 
ation of his overall capabilities, 
might be thought of as a perso 


s self-evalu- 
and, as such, 
nality charac- 


1 Requests for reprints should be sent to Joseph 
Weitz, Department of Psychology, New York Uni- 
versity, 21 Washington Place, Room 311, New York, 
New York 10003. 


gives one pause concerning the meaning of 


teristic that is relatively stable over time. An- 
other component of self-esteem may be con- 
sidered as task specific. This corresponds to 
an individual's perceived ability in certain 
specific areas, and its relation to general or 
chronic self-esteem is somewhat unpredicta- 
ble. For instance, an individual may think 
highly of his ability to perform well in his 
chosen work and to handle most problems 
that might come up in this area and yet have 
a low self-appraisal of his ability in certain 
other specific areas, 

If one’s cognitive makeup with respect io 
self-esteem can be viewed in these terms, then 
it becomes apparent that cognitive balance 
need not be an all-or-none operation. Certain 
behaviors may be inconsistent for one part of 


one’s cognitive structure and consistent for 
another part. 


Using a self- 


it was hypothe: 
a task would 


task for indi 
but not for th 


Consistency or balance model, 
sized that success or failure on 
be related to enjoyment of that 
viduals having high-self-esteem 
ose having low-self-esteem. 


Present Experimental Paradigm 
; In the present experiment, Ss were given @ 
; a puzzle involving spatial 
relations, Those who solved the puzzle in the 
time period alotted were designated successes 
while those who were not able to solve the 
puzzle were designated failures. Success oF 
failure in this situation was self-evident to 
Ss, and no manipulations were required to 
form success or failure conditions. (This fits 
the experimental Paradigm suggested by 
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Weitz [1967].) The Ss were then asked to 
evaluate their enjoyment of the task and also 
how appropriate they thought the task was 
for measuring intelligence and for studying 
problem-solving behavior. 

At this point in the experiment, Ss were 
given another spatial relations puzzle to solve 
but were given as much time to solve the puz- 
zle as they wished. They were also informed 
of a third task that might be started at any 
time if they could not solve the second puzzle. 
The number of minutes that Ss worked on 
the second puzzle before giving up and trying 
the third task was a major dependent variable 
of interest in this experiment. After each S 
had solved or had given up on the second 
task, he was asked again to evaluate his en- 
joyment of the task, how appropriate he felt 
the task was for measuring intelligence, and 
how appropriate he felt the task was for 
studying problem-solving behavior. A third 
puzzle was given to Ss to work on, but no 
measures were taken since it was only in- 
cluded to keep the experiment plausible. This 
design is similar to, but not a replication of, 
Korman’s (1968, Experiment 1) study. A 
different basic task was used, there was no 
manipulation by Æ of success and failure, and 
certain behavioral measures were included. 


METHOD 


Subjects 


The Ss for th 
as usual from in 
dents in these classes are re 

d r t-approv' 
psychology departmen i 
eight males and 80 females were ol 


manner. 


e present experiment were recruited 
troductory psychology classes. Stu- 
quired to serve as Ss in 
ed experiments. Sixty- 
btained in this 


nent Instruments 


gned for the present experi- 
structed by cutting 12- 
to four to six pieces 


Apparatus and Measuren 


Three puzzles were desi 
ment. These puzzles were a 
in. square masonite panels In 
that teal be assembled to reconstruct the a“ 
The approximate difficulty of each puzzle ag el 
sessed during pretesting sessions where it was a 
mined that Puzzle 1 was, in ge asier puz: 
to solve than Puzzles 2 or 3: 


Three measures of self-esteem \ ee 
the present design. The Ghiselli Self-Descriptive 


Inventory (undated) consists of 64 pairs of personally 
descriptive adjectives and yields a ears 
scale that has been used in a number of studies a 
volving self-esteem (Gavin, 1969; Korman, 19 i 
Schein, 1969). Half of the pairs are adjectives a 
approximately equal positive social desirability, @ 


neral, an € 


m were employed in 


the other half are pairs of adjectives with approxi- 
mately equal negative social desirability. 

The manual for the Ghiselli Self-Descriptive In- 
ventory (undated) attempts to define the concept of 
self-assurance as follows: 


Self-assurance refers to the extent to which the 
individual perceives himself to be effective in deal- 
ing with the problems that confront him. There 
are those persons who see themselves as being 
sound in judgment and able to cope with almost 
any situation, whereas others think of themselves 
as slow to grasp things, making many mistakes, 
and being generally inept (Ghiselli, undated, p. 9). 


Korman (1968) reports data that suggest that this 
empirically based scale possesses moderate construct 
validity. 

The Trait Evaluation Index (Nelson, 1968) yields 
self-confidence and personal adequacy scales. The 
Trait Evaluation Index consists of 125 triads of 
personally descriptive adjectives that have been 
matched for positive social desirability. The respon- 
dent was to judge which adjective of the triad was 
most descriptive of him and which least descriptive. 
Two examples of the triads are shown below: 


A. Independent 
B. Earnest 
C. Self-assured 


A. Sportsmanlike 
B. Quick 
C. Self-reliant 


The manual for the Trait Evaluation Index does 
not attempt to define the concepts of self-confidence 
or personal adequacy and these concepts, therefore, 
can only be defined operationally in terms of those 
words keyed to those scales. There are data reported 
in the manual that indicate some construct validity 
for the Trait Evaluation Index. 

A 5-point rating scale was constructed to assess 
Ss’ evaluation of the first two tasks. The Ss were 
asked to respond to the following three questions 
after each of the first two tasks: (a) How enjoyable 
did you find this puzzle to be? (b) How appropriate 
do you feel this problem is for measuring intelli- 
gence? (c) How appropriate do you feel this problem 
is for studying problem-solving behavior? 


Procedure 


The Ss were instructed to complete a short bio- 
graphical inventory and two personality inventories. 
During this time, S was not supervised by E. Upon 
completion of these inventories, E read a short de- 
scription of the purpose of the experiment and in- 
structions for the first task. The general description 
and instructions given S for the first task were as 


follows: 
As a subject in this experiment you are helping 


v intelligence or aptitude test 


to develop a nev l 
which, hopefully, has less cultural bias than the 


older intelligence tests. 

You will be asked to w 
problems which we believe to 
gence. The results may have im 


ork on a number of 
be related to intelli- 
portant implications 
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TABLE 1 
INTERCORRELATION MATRIX FOR THE INDEPENDENT VARIABLES 
| 3 
k E $ Sc PA 
S-F (Task 1) | S-F (Task 2) Sex SA 3 : 
Variable {i 2 failure, (1 = failure, | (1 = female, (1-48) (1-45) u m m 
2 = success) | 2 = success) | 2 = male) (low) (high) | (low) (high) | (low) (hig 
S-F (Task 1) | 
S-F (Task 2) z ir 
Sex d à | 
SA -06 07 24 | | 
SC 06 —;02 Ae 30 : | 
PA 07 06 .21 26 25 | 


Note.—The direction in which variables were scored is 
052 ee are > ONAR significant at the 012 level for dj 
SC = self-confidence, PA = personal adequacy, N 


in the intelligence-testing field, Your full cocpera- 
tion, therefore, will be most appreciated. 

The masonite pieces which I have placed before 
you on the desk form a perfect twelve-inch square 
when properly assembled. One side of each piece 
has a very rough surface. This rough surface 
should always be kept face down, There is only 
one solution to this puzzle and that solution re- 
quires that no piece has its rough surface showing. 
Are there any questions? You will be given two 
minutes to solve this puzzle. Are you ready? You 
may begin. 


Each S was then allowed 2 min. to assemble a 
four-piece puzzle. After the 2 min. had elapsed, S was 
instructed to rate his enjoyment of the task, ap- 
Propriateness of the task for measuring intelligence, 
and appropriateness of the task for studying prob- 
lem-solving behavior. 

Upon completion of these ratings, E read the in- 
structions for the second task. 


The instructions for the second problem are the 
same as for the first problem. In other words, it 
forms a perfect twelve-inch square. The puzzle is 
different from the first, and you should have more 
difficulty in solving it. You will be given as much 
time to solve this puzzle as you wish, but since it 
is a difficult puzzle, you may start on the third 
problem at any time after you have started the 
second puzzle. This third problem may be a little 
easier than the second puzzle. If you wish to 
start the third problem, simply tell me so and I 
will start you working on it. 


After the S had solved the second task, or more 
frequently, had decided to start the third task, he 
was instructed to rate his enjoyment of the second 
task, the appropriateness of the second task for 
measuring intelligence, and the appropriateness of 


the second task for studying problem-solving be- 
havior, i 


It should be noted that the words “ 
or “success” were never used 
praise, disappointment, 


quit,” “fail,” 
by E. No words of 
or general comment were 


given below each variable, Pearson 
= 146. Abbreviations: S-F 
= 148 (68 males, 80 females). 


= succe: 


expressed after any of the puzzles. If S wener a 
leave the second puzzle and try to solve the thir 
puzzle, he did so at his own initiative. 


Experimental Predictions 


The predictions for ratings of enjoyment of the 
first task were that within the high-self-esteem group 
those who succeeded would enjoy the task more 
than those who failed. No such prediction was made 
for the low-self-esteem group. These predictions are 
similar to those of Korman (1968). g 

The rationale for the prediction of no difference in 
enjoyment for the low-self-esteem group was that 
while success would be inconsistent with their self- 
evaluation of their ability, it might be consistent 


with their cultural concept of enjoyment of tasks in 
which they succeed. 


Predictions for the 


ratings of appropriateness of 
the first task for 


measuring intelligence were also 
based upon a self-consistency model. It was predicted 
that the pattern of these ratings would be similar to 
the enjoyment ratings pattern. 

Similar but w 


for ratings of ap) 
ing problem 


Ppropriateness of Task 1 for 
If this variable is of less 


3 e distortion via self-con- 
mechanisms should be smaller. 


The predictions for the Perseverance on the second 
task variable depend upon the stability of self-esteem 
after an experience of Success or failure on Task 1- 
If self-esteem is not significantly altered by success 
or failure on Task 1, it would simply be predicted 
that high-self-esteem Ss would persevere longer on 
Task 2 than low-self-esteem Ss. 

A strong argument could be made, however, that 

e on Task 1 would have a significant 
everance on Task 2. One might expect 
would be increased or decreased with 


effect upon pers 
that self-esteem 


SELF-ESTEEM, TASK SUCCESS, AND 1ASK ENJOXMEN* Fail 
TABLE 2 
INTERCORRELATION MATRIX FOR THE DEPENDENT VARIABLES 
| | | 
: E (Task 1) | E (Task 2) M (Task 1) M (Task 2) P (Task 1) P (Task 2) 
Variable | (= very | (5 = very (5 = very (5 = very (5 = very (5 = very 
| enjoyable) | enjoyable) appropriate) appropriate) appropriate) appropriate) 
E (Task 1) 
E (Task 2) aso j 
M (Task 1) at | | 
M (Task 2) 2% | .55 | | 
P (Task 1) —.06 | .20 | 04 | | 
P (Task 2) .00 | 12 | Fa | 63 
Perserverance | | | | 
(Task 2) 15 AS al | AS 05 .09 
a l ll 


Task 2 are based only upon S 
below cach variable, Pearson rs > .160 are 
Pearson rs > .165 are significant at the .05 
enjoyment, M = appropriateness for measu 


el 
ring intelligence, 


success or failure on Task 1 and, therefore, the ex- 
pectations for success or failure on Task 2 would also 
be altered somewhat. 

The predicted results for the perseverance on the 
Task-2 variable allowing for the possibility of self- 
esteem being altered by an experience of success or 
failure on Task 1 are shown below. (SE = self- 


esteem.) 


Perseverance on the Second Task 


re < Low SE - Success < 


Low SE - Failu 
SE - Failure < High SE - Success 


High 
d upon the concept that 
high-self-esteem Ss who fail may persevere just as 
long as high-self-esteem Ss who succeed on Task 1 in 
order to maintain their high self-esteem. 

Predictions for ratings of enjoyment of the second 


i `. k for measur- 
task, appropriateness of the second tas! 
Se tact and appropriateness of the second 


This conjecture is base: 


ing intelligence, ; e 
task for studying problem-solving biari a 

tability of self-esteem. For this 
ep he success, and 


reason, relationships between self-esteem, 
failure on Tasks 1 and 2 and these 
variables are bound to be quite comp’ ine 
from pretest data, the percentage of Ss succeeding be 
Task 2 was expected to be too low for meaning! 
analyses of these data in terms of the self-con- 
sistency model, and no specific predictions are pre- 
sented here for these variables. 

The experimental results can 
into two components—results obt: 
were nol able to solve the secon: 
from Ss who were able to solve th 
dictions for perseverance On the second task 
be based upon the results from the first group of Ss. 
The data from the second group of Ss, however, are 
of considerable interest. A comparison of Ss in 
both groups gave information as to what type of 
person not only persevered on the second task but 


three rating 
lex. Further, 


logically be divided 
ained from Ss who 
d task and results 
at task. The pre- 
must 


= 148). Intercorrelations of S ratings and perseverance on 


112). The direction in which variables were scored is given 
and rs > .209 are significant at the Ole level for df = 146. 
significant at the .01; level for df = 110. Abbreviations: E = 
ppropriateness for studying problem-solving behavior. 


also solved it. An analysis of these data was 
made to see if these patterns fit into a self-con- 
sistency framework. 


RESULTS 
Variable Intercorrelations 


Table 1 gives the intercorrelation matrix 
for the independent variables in the present 
experiment: task success, self-assurance, self- 
eee personal adequacy, and set for all 

s. 


The magnitude of the self-esteem variable 
intercorrelations (.30, .26, .25) does not pro- 
vide strong support for the assertion that 
these variables closely reflect the same person- 
ality attribute. 

The sex variable is significantly related at 
the .05 level (two-tailed test) to the other five 
independent variables and were, therefore, in- 
cluded in all other inferential analyses. 

Table 2 gives the intercorrelation matrix for 
the dependent variables, enjoyment of Tasks 
1 and 2, appropriateness of Tasks 1 and 2 
for measuring intelligence, appropriateness of 
Tasks 1 and 2 for studying problem-solving 
behavior, and perseverance on Task 2. 

All correlations between S ratings of Tasks 
| and 2 and perseverance on Task 2 are non- 
cant at the 05 level. The lack of sig- 
t relationships suggests that task perse- 
be taken as a measure of task 
ng as indicated by a rating. 


signifi 
nifican 
verance cannot 
enjoyment or liki 
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TABLE 3 


ANALYSIS OF THE EFFECTS OF SELF-ESTEEM, SUCCESS 
OR FAILURE ON Task 1, SEX, AND INTERACTIONS 
Upon RATINGS OF ENJOYMENT OF TASK 1 


FOR ALL Ss 
Variable | (Rr? — Rs?) | (1 — Rit) | F af ? 
Analysis 1—SA 
92 31| 1144 | .58 
$$ 03 192 8.44 | pi | 01 
Sex 01 192 1.56 | ni4 | 220 
SA X S-F ‘00 92 00 | 1,143 | 1200 
Analysis 2—SC 
01 91 1.27 | 1144 | .26 
SA 06 191 8.89 | i144 | ‘00 
Sex :02 91 2.54 | 1,144 ll 
SC x S- 01 OL 1.41 1,143 .24 
Analysis 3—PA 
PA .00 92 16 | 1144 | .69 
SF 105 192 844) ida | lot 
Sex 01 ‘92 1.56 | 1,144 | [22 
PA X S-F 200 192 00 | 1,143 | 100 


Note.— Abbreviations: SA = self-assurance, S-F 


= success- 
failure, SC = self-confidence, 
= 148. 


PA = personal adequacy. N 


Inferential Tests of Main and Interaction 
Effects 


Least-squares analyses, obtained through 
regression methods as proposed by Cohen 
(1968) and Overall and Spiegel (1969), have 
been used in the treatment of the present ex- 
perimental data. The multiple-regression 
technique used here provides an ANOVA ana- 
logue solution that can easily handle unequal 
cell frequencies and does not require artificial 
dichotomy of the self-esteem variables.” 


* The present design (Method 2, or the experi- 
mental design analysis in Overall and Spiegel’s 
[1969] classification of least-squares procedures) in- 
volves estimation of main effects disregarding inter- 
actions and the estimation of interactions adjusted 
for main effects, 

Multiplicative effects correspond to 
effects in conventional AN 
tested in the present desi 
number of independent vari 
and including this new vy 
equation, 

The F ratios reported here 
plying the multiple R2 
regression techniques t 


interactive 
OVA designs and were 


ign by multiplying any 
ables together for each S 
ariable in the regression 


were obtained by ap- 
s obtained through standard 
o the following equation: 
Re- RS/NL- S) 
(RAW LH 
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Results of analyses of the effects of self- 
esteem (Ghiselli [undated]  self-assurance 
[SA], Nelson [1968] self-confidence [SC], 
and Nelson [1968] personal adequacy [PA]), 
success or failure on Task 1 (S-F, Task 1), 
sex, and interactions upon ratings of enjoy- 
ment of Task 1 are given in Table 3. 

The variation of F values for the success- 
failure and sex variables in the three analyses 
reflects the differing correlations between these 
variables and the self-esteem variables. The 
success-failure variable is significant at the 
-005 level with all other variables being sig- 
nificant at greater than the .10 level. The pre- 
dicted significant Success-Failure x Self-Es- 
teem interaction was not obtained in any of 
the three analyses. 

In Table 4 the self-esteem variables have 
been dichotomized near the median points so 
that means and ¢ tests could be computed and 
visual inspection made of the ratings of en- 
joyment of Task 1 data. 

It should be noted that larger £ values 
(though not significantly larger) were ob- 
tained for the comparison of success and 
failure conditions for low-self-esteem Ss than 
for the same comparison for high-self-esteem 
Ss. These trends are not only nonsupportive 
but are contrary to the predictions. ; 

Analyses of results for ratings of appropri- 
ateness of Task 1 for measuring intelligence 


and studying problem-solving behavior 
yielded, for the most Part, nonsignificant re- 
sults. 


The results of the 
self-esteem (SA, SC, 
on Task 1, sex, 


analyses of the effects of 
PA), success or failure 


three analyses. 


The self-esteem variables were dichotomized 


n points so that means and ¢ values 
computed (in seconds) on Task-2 
general, success Ss on Task 1 per- 
onger on Task 2 than failure Ss. Low- 


where Ri? and Re are the R%s based upon the 
larger and smaller sets of independent variables, re- 
spectively, and LZ and 5 are the number of inde- 
pendent variables in the larger and smaller sets. 


data. In 
severed ] 


See 
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self-esteem Ss who succeeded on Task 1 per- 
severed longer (though not significantly 
longer) than high-self-esteem Ss who suc- 
ceeded on Task 1. Also, in general, low-self- 
esteem Ss who failed on Task 1 persevered 
longer (though not significantly longer) on 
Task 2 than high-self-esteem Ss who failed on 
Task 1. While none of the observed relation- 
ships is significant at the .10 level, even the 
trends noted are not consistent with the pre- 
dictions. 

As suggested earlier, the number of Ss who 
succeeded in their attempts to solve the second 
puzzle was too small to allow for a full analy- 
sis of results in relation to success or failure 
on Tasks 1 and 2 and the self-esteem varia- 
bles. 

Since the self-esteem and Self-Esteem X 
Success-Failure interaction effects in all analy- 
ses involving ratings of Task 1 and persever- 
ance on Task 2 were nonsignificant, these ef- 
fects were ignored in analyzing ratings for 
Task 2. This procedure made it possible to 
compare the effects of various combinations 
of success or failure on Task 1 and 2 upon 
ratings of Task 2. Table 5 gives the means 
and standard deviations of ratings of enjoy- 


TABLE 4 
DARD DEVIATIONS, AND Ns For RATINGS 
or TASK 1 IN RELATION TO SELF- 
SELF-CONFIDENCE, PERSONAL 
D SUCCESS OR FAILURE 


MEANS, STAN 
or ENJOYMENT 
ASSURANCE, 
ADEQUACY, AN 


on Task 1 
Failure* Success” 
SS ae | t 
Variable |—7—7] | 5 
x |sp| | |9|” 
A ; ae 
hes (0-25) | 1.61 1.24} 48 28 me F na 
High (26 up) 1.69 | 1.30| 42 2.30 | 1. 
k 
pm (0-11) |1. 1.30 53 20 m A a 
High (12 up) | 1.81 1.20| 37 | 1.83] 1- 
PA 
Low (0-14) | 1.53] 1.20] 43 2.23 = a i 
High (15 up) 1.77 | 1.33] 43 2.39 | 1. a 


ce, SC 


Ea Very enjoyable, 
and PA 


for SA, sc, 
nd PA were ns; 


on of means 


‘or comparison of means for SA an 
16, 
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TABLE 5 


MEANS AND STANDARD DEVIATIONS FOR RATINGS OF 
ENJOYMENT OF Task 2, APPROPRIATENESS OF Task 2 
FOR MEASURING INTELLIGENCE, AND APPRO- 
PRIATENESS OF TASK 2 FOR STUDYING PROB- 
LEM-SOLVING BEHAVIOR IN RELATION 
TO Success OR FAILURE ON 
Tasks 1 AND 2 


Measure F\F2 SF F\S:° S,S.4 
Enjoyment of Task 2 

z 1.85 2.53 2.19 2.95 

SD 1.33 1.41 1.11 1.13 


Appropriateness of Task 2 for measuring intelligence 


3.20 3.09 
98 97 


a 2.52 
SD 1.19 


2.76 
94 


Appropriateness of Task 2 for studying 
problem-solving behavior 


Xx 2.35 2.51 2.22 2.71 
SD 1,24 1.03 1.22 1.08 
Note.—F1 = failure on Task 1; failure on Task 2; 


ess on Task 1;Sz = success on Task 2. 


ment of Task 2, appropriateness of Task 2 for 
measuring intelligence, and appropriateness of 
Task 2 for studying problem-solving behavior 
in relation to various combinations of success 
or failure on Tasks 1 and 2. 

The mean differences between these four 
groups for the enjoyment of the Task-2 varia- 
ble is significant at the .02 level (F = 3.50, 
df =3, 144). Analyses of mean differences 
for the two other Task-2 rating variables 
yielded nonsignificant results (at the .05 


level). 


Comparison of Subjects Who Succeeded or 
Failed on Task 2 

Only about 30% of all Ss were able to solve 
the second puzzle when given as much time to 
work on it as they wished. Comparisons of 
success and failure Ss differed significantly on 
only two of the seven major independent 
variables. It was found that a higher propor- 
tion of male Ss successfully solved the second 
puzzle than female Ss, and a higher propor- 
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TABLE 6 


MEAN SATISFACTION WITH PERFORMANCE Rarmes IN 
RELATION To INITIAL EXPECTANCY OF SUCCESS AND 
Success oR FAILURE ON THE TASKS 


Failure | Success 
Initial 
prag Rating N Rating N 
Low 2.6 46 | 5.4 | 39 
High 1.3 37 | 45 48 


Note.—This table presents pooled data from several tables in 
the feather (1969) study on a rating scale of 5 = very satisfied, 
1 = very dissatisfied. 


tion of those Ss who solved the first puzzle 
were able to solve the second puzzle. 


Discussion 


The results predicted for the ratings of en- 
joyment of Task 1 were not obtained in the 
present experiment. Task enjoyment was sig- 
nificantly related to task success, with low- 
self-esteem Ss enjoying success as much or 
more than high-self-esteem Ss. Self-esteem as 
measured by the self-assurance, self-confi- 
dence, and personal adequacy scales was 
minimally related to task enjoyment as a 
main effect or as a moderator effect. 

The failure to support Korman’s (1968) 
results is difficult to explain. The dependent 
variables and experimental instructions were 
very similar in both experiments. While the 
tasks employed were not the same, the self- 
consistency model would predict similar re- 
sults for this situation. Both types of tasks 
are used in intelligence or aptitude batteries 
(the Ss were told this in both studies) and 
the mean rating given to the appropriateness 
of Task 1 for measuring intelligence suggests 
that Ss believed the task to be at least mod- 
erately well suited for measuring intelligence. 

A recent study by Feather (1969) yielded 
results that are supportive of the findings in 
the present study. In the Feather experiment, 
measures of self-esteem and task-specific ex- 
pectations of success were obtained prior to 
attempts by Ss to solve anagram-type tasks. 
Feather predicted that self-esteem would be 
significantly related to task-specific expec- 
tancies and that low-self-esteem-expectancy 
Ss who succeeded on these tasks would rate 
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satisfaction with their performance on the 
anagrams task higher than high-self-esteem- 
expectancy Ss who succeeded on the tasks. 
Feather also predicted that high-self-esteem- 
expectancy Ss who failed on the tasks would 
rate satisfaction with their performance lower 
than low-self-esteem-expectancy Ss who failed 
on the tasks. : 

Feather obtained supportive results for his 
hypothesis only when he used the initial task- 
specific expectancy scores, all self-esteem mea- 
sures yielding nonsignificant results. By pool- 
ing Feather’s (1969) results, it is possible to 
present his data so that it can be compared 
with the results obtained in the present ex- 
periment and the Korman (1968) experiment. 
Table 6 presents mean satisfaction ratings in 
relation to initial expectancy and success or 
failure on the tasks for the Feather study. 

Visual inspection of the data suggest that 
success Ss, regardless of initial expectancy, 
rated satisfaction with their performance 
higher than failure Ss. These results are also 
contrary to balance model predictions but 
support the findings of the present study. 

It should be noted that the satisfaction 
ratings in the Feather (1969) experiment were 
different in a subtle way from the satisfaction 
ratings in the present experiment and in 
Korman’s (1968) experiment. In the Feather 
experiment, Ss were asked to rate satisfaction 
with their Performance, whereas in the present 
and Korman experiments, Ss were asked to 
tate their enjoyment or liking of the tasks. 
One can conceive of a high-self-esteem S rat- 
ing satisfaction with his performance low and 
enjoyment of the task considerably higher. 
The ratings in the Feather study may be in- 
fluenced by a Perfectionist effect, 

The Pearson rs between initial confidence 
and the three measures of self-esteem (.17, 
-10, and —.16) were nonsignificant at the .052 
level. Feather’s results suggest that self- 
esteem as measured by available self-rating or 
evaluation tests may be rather poor measures 
of initial expectancies and should not be in- 
terpreted in this way. 

The experimental results obtained in the 
present experiment are consistent with Locke’s 
(1965) conception of the relationship between 
task success and task satisfaction and are also 
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consistent with a naive conception of the rela- 
tionship between these two variables. 

Results obtained for ratings of Task 2 in 
relation to success-failure on Tasks 1 and 2 
are generally consistent with results obtained 
for ratings on Task 1. The Ss experiencing 
success on Tasks 1 and 2 rated enjoyment of 
Task 2 higher than Ss who experienced mixed 
success on Tasks 1 and 2, who in turn rated 
enjoyment of Task 2 higher than Ss who 
failed on both Tasks 1 and 2. 

The pattern obtained for the perseverance 
data was similar, though the relationships 
were weaker, to the patterns obtained for the 
enjoyment ratings for both tasks. The con- 
sistency of the contrahypothesized results in 
the present experiment suggest that some es- 
sential differences do exist between the experi- 
mental designs mentioned in this discussion. 

If experimental tasks and dependent mea- 
sure differences were responsible for the cross- 
experiment differences, then the balance or 
self-consistency model postulated by Korman 
(1968) is a great deal more complex than 
originally conceived, or it is situation bound. 

Tt has also been suggested in this discussion 


that functional differences may exist between 
(e.g., Ghi- 


general measures of self-esteem 
selli’s self-assurance scale and Feather’s 
[1969] measure of self-esteem) and task- 


asures of expectations of success. 
ct differences in the manner in 
tasks would interact with 
m in this case. 


specific me 
One might expe 
which various 
chronic self-estee ; 7 
The fact that success and failure were not 
induced or manipulated in the present nie 
but were self-evident may also be a ma) 
ference in the two studies. 
Analyses of differences on 
dent variables between Ss W 


the se 
i on Task 2 revealed that 
pn on Task 1 variables were the 


for which significant differ- 
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seven indepen- 


ho succeeded or 
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Feather (1969) study suggest that task suc- 
cess is significantly related to task enjoyment 
for low-self-esteem Ss as well as for high-self- 
esteem Ss. These findings are also consistent 
with results reported by Locke (1966) indi- 
cating a positive relationship between task 
success and task liking. 
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Likert’s linking-pin theory and I 
discussed. 


previous 

The present article explores the relationship 
between subordinate perceptions of the be- 
havior of their immediate superiors and the 
satisfaction of subordinate role expectations, 
Subordinate expectations, to the extent that 
they are fulfilled, represent degrees of satis- 
faction with the job and the organizational 
environment. 

The theoretical significance of the 


research 
reported in this 


article lies in the tests of 
hypotheses concerning the effects of several 
leadership behaviors that revealed findings 
contrary to previous research and theory. Of 
particular current interest is the test of an 
hypothesis concerning the effect of leadership 
influence as a moderating variable on which the 
relationship between leadership behavior and 
Satisfaction with subordinate role expectations 
1s contingent, 

According to Likert (1961, 1967) 
hierarchical influence permits 
Managers to obtain the assistance 
of their superiors and thereby 
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to previous research, initiating str 
out of 16 measures of subordinate 


cisiveness was found to have 
a significant positive relationship to 13 out of 16 
tical implications of the find: 


ucture was found to be 
role satisfaction. Leader 
to only 2 out of 16 role 
a rather pervasive 


ificantly related 


ings as related to 
studies on leader behavior are briefly 


linkage with higher strata and status levels in 
the organization. In contrast with a downward 
influence system from superior to subordinate, 
the upward influence of the superior is seen as 
a measure of his ability to control resources 
for the subordinate work group and to repre- 
sent the work group in its dealings with the 
broader organization system. Thus, upward 
hierarchical influence is seen by Likert as a 
necessary though not a sufficient condition 
affecting group satisfaction and productivity. 
For example, Likert (1961) asserts that the 
leader must not only have upward hierarchical 
influence with his immediate superior, but must 
also exhibit a supportive subordinate-centered 
leadership style. The Supportive style is seen 
as building higher expectations on the part of 
subordinates, The satisfaction of these expecta- 
tions is contingent on the ability of the leader 
to represent the group in the broader organiza- 
tion system or with higher levels in a manner 
that benefits the work group. 

„< US, it would appear that leader behavior 
affects employee satisfaction and commitment 
to organization goals and that upward in- 
fluence by the leader is primarily an interven- 
mg or moderating variable, 
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Dimensions of Leader Behavior Studied 


Of the many dimensions of leader behavior 
in the research literature that are found to be 
related to satisfaction and performance in a 
work group, five would seem to be of para- 
mount importance. These are initiating struc- 
ture, consideration, decisiveness, technical 
competence, and hierarchical influence. 

Initiating structure and consideration. These 
two dimensions of leader behavior describe the 
degree to which the leader establishes psycho- 
logical structure for subordinates and creates 
an environment of psychological support, 
warmth, and friendliness. In reporting the 
early development of these dimensions, Halpin 
and Winer (1957) describe initiating structure 
behaviors as those which indicate the leader 
organizes and defines the relationships 
between himself and his subordinates; defines 
the role that he expects each subordinate to 
assume; and endeavors to establish well-defined 
patterns of organization, channels of com- 
munication, and ways of getting the job done. 
They describe consideration in terms of be- 
havior indicative of friendship, mutual trust, 
respect, and warmth in the relationship be- 
tween the leader and his subordinates. 

Recent reviews of the evidence from a large 
number of studies indicate that leaders who 
initiate psychological structure for subordi- 
nates are generally rated high by superiors and 
frequently have higher producing work groups 
than leaders who are low on initiating struc- 
ture, and that considerate leaders generally 
have more satisfied employees (see e.g., 
Fleishman, 1969). However, several studies 
have shown that structure is frequently re- 
sented by subordinates and is a source of 
dissatisfaction, turnover, and grievance (Fleish- 
man & Harris, 1962; Korman, 1966; Filley & 
House, 1969). 

Technical competence and decisiveness. Leader 
decisiveness refers to the degree to which the 
leader is willing to make decisions or take 
decisive action. Technical competence refers to 
the degree to which the leader is perceived as 
capable of providing advice on technical or 
specialized problems and capable of anticipat- 
ing job-related details prior to assigning tasks. 
While these two dimensions of leader behavior 
have not received the attention that the first 


two have, there is evidence of their importance 
as correlates of leadership effectiveness. The 
studies by Comrey, Pfiffner, and High (1954) 
have indicated that supervisors of forest 
workers, government employees, and industrial 
production workers show significant positive 
relationships between these measures and 
measures of group productivity and leader 
effectiveness. 

Hierarchical influence. Research reported by 
Pelz (1952) and Wager (1965) suggests the 
importance of upward influence by the leader 
as a variable on which the relationship between 
supportive leader behavior and the satisfaction 
of subordinate role expectations is contingent. 
In his study, Pelz found that influence served 
to moderate the effect of the supervisor’s 
“social closeness” to employees. When super- 
visors had high influence and also engaged in 
supportive leader behavior by “taking sides 
with subordinates” rather than “taking sides 
with management,” employee satisfaction was 
significantly greater than when supervisors 
with low influence engaged in the same kind of 
supportive behavior. 

Similarly, in his study of the relationship 
between supportive behavior and influence in 
one firm, Wager (1965) found highly supportive 
behavior to be positively correlated with 
hierarchical influence on the part of first-level 
supervisors. He also found that influence had 
a moderating effect on the supportive behavior 
of leaders similar to the effect found by Pelz 
(1952) but that the magnitude of the moderat- 
ing effect of influence varied positively with 
organizational status of the respondent. This 
finding led him to conclude that when em- 
ployees are performing work that permits them 
to be independent of their immediate superiors, 
the influence of the superior is less relevant to 
the fulfillment of subordinate expectations. 
Here it is assumed that when employees enjoy 
high organizational status, they are more likely 
to have occupational mobility, to be more 
autonomous, and to be more self-sufficient in 
their work setting. 


Hy potheses 


The previous research and theory suggest 
the following hypotheses to which the present 
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study is addressed: 


1. The relationship between the leader's 
consideration and subordinate role satisfactions 
will be moderated by the amount of influence 
imputed to the leader by the subordinates. 
Specifically, where the superior is perceived as 
having high hierarchical influence, the relation- 
ship between leader consideration and sub- 
ordinate role satisfaction will be positive and 
significantly greater than when the superior is 
perceived to have low hierarchical influence. 

2, Leader consideration will be positively 
related to the amount of influence imputed to 
the leader by his subordinates. 

3. Leader consideration will be positively 
related to the satisfaction of subordinate role 
expectations, 

4. Leader structure will be negatively re- 
lated to the satisfaction of subordinate 
expectations. 

5. Leader technical competence will be posi- 
tively related to the satisfaction of subordinate 
role expectations. 

6. Leader decisiveness will be positively 
related to the satisfaction of subordinate role 
expectations. 


role 


In addition to tests of the above hypotheses, 
exploratory tests were also conducted con. 
cerning the moderating effect of influence on 
the relationship between: (a) the leader’s 
technical competence, initiating structure, and 
decisiveness; and (b) subordinate role expecta- 
tions. These tests are considered explora- 
tory because neither research nor theory 
Suggest specific hypotheses concerning these 
interactions, 
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data were used in this study. Comparisons of the scale 
means and standard deviations of the questionnaries 
used here and the questionnaires with some missing 
data revealed no significant differences between those 
who failed to complete some items and those who 
provided full information. 


Measures Employed 


Satisfaction of employee role expectations. Two auer 
tionnaires, both developed by the Personnel Researe 
Board of the Ohio State University, were used. The 
first was the Job Description questionnaire that con- 
tains 12 questions scaled 1-5. This questionnaire 
essentially measures the employ attitude toward the 
company and its management (Stogdill, 1965b). N 

The remaining seven measures of satisfaction with 
role expectations are contained in a Job Expectation 
questionnaire. Employee satisfaction of role expecta- 
tions is measured with respect to the following classes 
of role expectations: (a) work, (b) freedom, (c) re- 
spondent’s job as compared with jobs of others, (d) pay’: 
(e) advancement, (f) family attitudes toward the 
respondent's job, and (g) job security. 

Respondents are asked to indicate the degree to 
which each characteristic of the job satisfies his oe 
pectations. Each class of expectation is measured by 
scores of seven questions. 

Leader consideration and initiating structure. A form of 
the Ohio State Leader Behavior Description Question- 
naire (LBDQ) described by Halpin (1957) was used to 
measure these two dimensions of leader behavior. 

Leader decisiveness, technical competence, and hierar- 
chical influence. The scales used to measure these 
behaviors were factorially developed by Comrey, 
Pfifiner, and High (1954). These scales consist of four, 
five, and four items, respectively, and, like the LBDQ, 
are completed on a 5-point multiple-choice response 
mode indicating the frequency with which the leader 
engages in the behavior described by the item stem. 

Following is one sample, item for each scale: 


Decisiveness—He acts cautiously in order to avoid 
sticking his neck out, (1) frequently; (2) occasionally ; 
(3) rarely; (4) almost never; (5) never, 

Technical Competence—When a tough job comes up, 
he has the technical “know how” to get it done. (1) 


rarely; (2) occasionally; (3) usually; (4) almost always; 
(5) always. i 


Hierarchical Tnfluen 
Superiors certain benefi 
been approved, (1) f 
rarely; (4) almost ne 


ce—He has requested from his 
ts for his workers which have not 
requently; (2) occasionally; (3) 
ver; (5) never, 

lt is important to point out that the influence scale 
(and the studies by Pelz [1952] and Wager [1965]) 
deals with the upward influences of the leader only with 


respe h decisions as resource allocation, 
organization changes, objective setting, or policy 
Stogdill (1965) reports that reliabilities of measures 
of initiating structure, consideration, and hierarchical 


E, 


Š 
) 
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influence consistently fall in the range of .80 and above 
for several widely different populations including 
United States Senators, company presidents, middle 
managers, military officers, and state police officers. 

These data reported by Stogdill attest to the internal 
consistency of the leader behavior scales. However, they 
do not clearly indicate that the scales measure what 
they are purported to measure. However, Halpin 
(1957) has shown the LBDQ to have concurrent 
criterion validity and a recent experiment reported by 
Stogdill (1969), conducted under well-controlled labora- 
tory conditions, resulted in rather clear experimental 
criterion validation of the Ohio State scales. The study 
by Comrey, Pfifiner, and High (1954) suggests con- 
struct validity of the Leader Decisiveness, Influence, 
and Technical Competence scales. In an extensive series 
of field investigations they found these three measures 
to have factorial independence and to have significant 
correlations with objective measures of work unit 
performance such as scrap and output as well as 
subjective ratings of managerial performance by 
superiors. 


Method of Analysis 


A modified analysis of covariance using dummy 
variables in a regression was employed. Respondents in 
each company were trichotomized on the influence 
variable with the upper and lower thirds representing 
high- and low-influence leaders.* The high- and low- 
influence conditions are treated as dummy variables for 
regression analysis. That is, the variables are coded so 
that each enters the model separately to yield a regres- 
sion coefficient for the relationship between leader con- 
sideration and satisfaction with role expectations under 
each of the influence conditions. 

To test the hypothesis that the relation between 
consideration and satisfaction with role expectations 
will be moderated by the amount of leader influence, 
the following regression model was specified and tested. 


S = ay tay + aD + aY 
+aYIn + aYI, +e [1] 


where: S = scalar value of satisfaction; Dı = 1 when 
influence is high, 0 otherwise; D: = 1 when influence is 
low, 0 otherwise; Y = scalar value of independent 
p iy eae 

2 Trichotomization results in unequal cell frequencies. 
The reason for the regression formulation is that: 


Unequal cell frequencies preclude the simple parti- 
tioning of the sum of the squares. The computation 
needed to generate the test statistics now involve 
matrix inversion in least-squares regression. However, 
from the point of view of hypothesis testing the two 
techniques (the use of dummy variables in regression 
analysis and analysis of variance or covariance) are 
equivalent [Galbraith & Cummings, 1967, p. 251; 
see also Goldberger, 1964, p. 230]. 


3 This procedure also has the advantage of eliminating 
statistical bias due to colinearity among the predictor 
variables since no two predictors with full scalar values 
are included in the regression. Only the full value of Y 
and the partitioned thirds of influence are included. 


variable; VJ = scalar value of independent variable 
under high influence; FZ; = scalar value of inde- 
pendent variable under low influence; ¢ = stochastic 
error term assumed to be normally and independently 
distributed with zero mean and constant variance; ao, 
a, and az = intercept coefficients; @3, &4, and a; = slope 
coeflicients. 

The Parameters ay-as can be interpreted as follows: 
ay = value of the intercept term under medium in- 
fluence, a, + ay = value of the intercept term under 
high influence, a2 + a) = value of the intercept term 
under low influence, as = value of the slope coefficient 
under medium influence, a; + as = Value of the slope 
coefficient under high influence, as + as = value of the 
slope coefficient under low influence. 

From the structure of Equation 1 it is obvious that 
Parameters a, and as are the differential intercept 
coefficients, and the Parameters a; and @s, the differ- 
ential slope coefficients. Because of the form of our 
hypothesis, we are primarily concerned with the slope 
coefficients under the different amounts of influence. If 
ay and as are significant, the relation between S and Y 
under various levels of influence will be significantly 
different. Of course, all the differential coefficients may 
not be statistically significant. If as or as have no effect, 
a; will yield the linear value of the full range of in- 
fluence. If a; or as has a significant effect, as will repre- 
sent only the effect of medium influence. According to 
our hypothesis, we would expect both consideration (as) 
and consideration in interaction with high influence 
(a3 + as) to have significant effect. 

The above model was tested with regression analysis 
and the same procedure was computed four times for 
cach company substituting the other three dimensions 
of leadership style for consideration, that is, letting the 
F variable take on the value of leader initiating struc- 
ture, decisiveness, and competence. In total, 64 regres- 
sions were run (one regression for each of eight inde- 
pendent variables, for each of two companies, for each 
of four leadership style dimensions, or, eight satisfaction 
measures X two companies X four leadership style 
measures). 

To test Hypotheses 2-6, product-moment correlations 
were computed for the relationships between the 
independent and dependent variables and a one-tailed 
test of significance was applied. 


RESULTS 
Hypothesis 1 


The first hypothesis stated that influence 
would have a positive moderating effect on the 
relationship between leader consideration and 
subordinate satisfaction. Table 1 presents the 
significant regression coefficients. 

For Company A, consideration has no effect 
on any satisfaction variables except family 
attitudes under any degree of influence. For 
satisfaction with family attitudes toward the 
job, there is a positive effect on slope, which is 
the same under high and low influence. 
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TABLE 1 


SIGNIFICANT COEFFICIENTS FOR CONSIDERATION AND CONSIDERATION IN ĪNTERACTION 
WITH INFLUENCE AS A PREDICTOR OF SATISFACTION 


Company B: 
Satisfaction with: 
Company A: 
Parameter Satisfaction with = 
family attitudes Job Work Job freedom Family Security 
attitudes 
a —6.54* — 5:15* —6.36* —5.19" a Vil 
t —1.95 2.30 2.13 -19 —2.43 
az —6.97 — s = — ae 
t —2.02 
CA = 12 — .10* — — 
t 2.30 2.19 
as .17* = — .16* .16* — K a 
t 2.14 —2.15 2.28 2.16 
as -18* 
l 2.16 = = 
Rete ain columns with significant values have been included here, 
eb Sion 


It would be a fair generalization to Say that faction with work and job freedom; it is posi- 
for Company A, consideration under any kind tive and higher under high influence (as + 44) 
of influence has very little effect on satisfaction. for satisfaction with Job freedom and security; 

For Company B, the regression results show it is negative under high influence for satisfac- 
a different pattern. The slope effect (a3) is tion with pay. The slope coefficient under low 
positive under medium influence for satis- influence generally does not differ significantly 


from that under medium influence. 
TABLE 2 o far as the hy othesis is concerned then, 
h ta yp o; 
INTERCORRELATIONS BETWEEN LEADERSHIP STYLE x 3 empirical results show that influence hos 
DIMENSIONS practically no effect on the relationship be- 
- tween satisfaction and consideration for Com- 
Dimension 1 | 2 3 | 4 | 5 pany A. For Company B, there is a pronounce! 
A effect for satisfaction of role expectations con- 
cerning freedom on the job and job security: 
1. Structure = a a a g ä 
; E a 00 as 8 re oe moderates the relationship between 
$ Decisveness = p ge Consi eration and satisfaction in the hypothe- 
. Competence = 18 sized direction, 


Company Bb 


Hypothesis 2 


1. Structure 


2. Consideration | — “54**** Ee eel The second hypothesis stated that leader 
3. Decisivenes -5 E ape A A ca 

4. Influence = see some — CONSideration would be Positively related to 
5. Competence = 


20+ — the amount of influence imputed to the leader 


aN ble 2 shows the inter- 


if correlations between the five leadership style 
ir dimensions. As indicated, the correlation be- 
kok + . 

eT, tween consideration and influence is significant 


ignifica 
y. It approximates signifi- 


= 
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TABLE 3 


CORRELATIONS BETWEEN Five DIMENSIONS OF LEADERSHIP STYLE AND EIGHT 
DIMENSIONS OF SUBORDINATE SATISFACTION WITH ROLE EXPECTATIONS 


Leadership style 
Consideration Structure Decisiveness Influence Competence 
Expectation | 
Com- | Com- | Com- | Com- | Com- | Com- | Com- | Com- | Com- | Com- 
pany | pany | pany | pany | pany | pany | pany | pany | pany | pany 
A B A B A B £ B A B 
Company satisfaction 258 55 25° 47b 35° 42>] 20 37° 39 13 
Intrinsic job expectation 31> wee 30 25> 37> 31> | 24> 12 16 12 
Advancement expectation | .35” 43 06 28> 1b 39> | 24b 13 16 12 
Comparison of job to jobs 
of friends 19 34>} .20 20° 30% 26% | 15 06 12 06 
Pay expectation 16 .29» | —.08 Pate 23" 27> | 12 23» | —.08 04 
Expectation of freedom on 
the job 46» 368 02 12 31b 39 | .19 02 15 16* 
Expectation of family at- 
titudes toward the job| .27* 15 33" 21» 09 45b | 14 .09 10 05 
Job security expectation .18 -.ll 2P | =V 13 | —.14 1g | —.05 14 | —.03 


Note.—N = 84, in Company A; N = 158 in Company B, 
at <.05 ifr > .22 when N = 84; and > .16 if N = 158, 
èt È OL ifr S .28 when N = 84; and > .21 if N = 158. 


cance in Company A (r = .178, p = .06). This 
finding corroborates Wager’s (1965) finding 
that influence was positively correlated with 
supportive leader behavior. Thus, it appears 
that there is a general relationship between 
supportive leadership and leader influence, but 
the magnitude of the relationship is unstable 
and dependent on situational factors associated 
with the population studied. 


Hypothesis 3 


The third hypothesis stated that leader con- 
sideration would be positively related to em- 
ployee satisfaction with role expectations. 
Table 3 shows the correlations between the 
eight measures of employee satisfaction and the 
five dimensions of leadership style. In this case, 
15 of the 16 relationships between consideration 
and satisfaction are positive, and of these, 11 
are statistically significant. Thus, consistent 
with prior findings, there appears to be rather 
strong support for the hypothesized general 
relationship. 


Hypothesis 4 
The fourth hypothesis stated that leader 


initiating structure would be negatively related 


to the satisfaction of employee role expecta- 
tions. As indicated in Table 3, the findings are 
strikingly contrary to those hypothesized. 
Initiating structure is negatively correlated in 
only two cases, and there the coefficients are 
quite low and not significant. Contrary to our 
predictions, in 10 of the 16 correlations, they 
were significantly positive and none were 
significantly negative. 


Hy pothesis 5 


The fifth hypothesis stated that leader 
technical competence would be positively re- 
lated to employee satisfaction. As shown in 
Table 3, technical competence is significantly 
related to the measures of satisfaction in only 
2 of the 16 correlations. 


Hypothesis 6 


The sixth hypothesis stated that leader 
decisiveness would be positively related to 
measures of employee satisfaction. Table 3 
shows that of the 16 possible relationships, 13 
are significant in the expected direction, and 
none are significantly negative. Thus, there is 
surprisingly strong support for the hypothesis 
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TABLE 4 


SIGNIFICANT COEFFICIENTS FOR DECISIVE 


NESS AND THE INTERACTION OF DECISIVENESS 


WITH INFLUENCE AS A PREDICTOR OF SATISFACTION FOR Company B 


Satisfaction with: 


! Pi | Bi = i 
Parameter | | i | Pami ey 
| Company — Pay | Job freedom atitudes | Job security 
| i k | Ed 
| - — — —3.61* |  —6.16* | —5.09* 
ü t —2.19 —2.56 —1,99 
3 61: <3; Ot kA Š | k **k 
s | 614 | .533** 21* = | Ss = 33 
t | 2.09 3.71 2.17 | | —2.55 
1 | ga =a s 39" Aor 
j | 2.25 2 | 
ay = | - p es = | = | 33* 
Š | | | | | z4 
á | | | | | 


Only columns with significant values 


that leader decisiveness is related to employee 
satisfaction. 


Exploratory Analysis 


In addition to tests of the above hypotheses, 
exploratory analyses among the dimensions of 
the leader behavior and of the moderating 
effect of influence were conducted. These 
analyses are viewed as exploratory rather than 
tests of hypotheses because there is neither 
previous research nor theory to suggest specific 
hypotheses to be tested. 

Influence in interaction with other leader 
behavior dimensions. To explore further the 
moderating effects of influence, the regression 
procedure described above was repeated three 
times for each company, substituting the 
dimensions of initiating structure, decisiveness, 
and technical competence for Y in the reg 
sion, that is, for consideration. No signifi 
coefficients were obtained in Company 
interactions between leader beh 
fluence. None were obtained in 
for technical competence or for t 
Competence x Influence interacti 
pany B, however, several significa 
were obtained for initiating structure and 
decisiveness. The significant coefficients for 
Company B are presented in Tables 4 and 5. 
, These tables reveal that for Company B the 
influence condition under which significant 
interactions occur and the sign of the param- 


res- 
cant 
A for 
avior and in- 
Company B 
he Technical 
on. For Com- 
nt coefficients 


have been included here. 


eters are highly inconsistent. In Table 4, the 
high-influence condition is shown to signifi- 
cantly moderate the relationship between 
leader decisiveness and role satisfaction with 
job freedom and family attitudes in a positive 
direction. The low-influence condition moder- 
ates the relationship for satisfaction with job 
security, and the medium-influence conditor 
moderates the relationship for satisfaction wit 

company, advancement, and pay. The inter- 
cept values under the high-influence condition 
is negative for satisfaction with freedom, family 
attitudes, and job security. These findings SUS 
gest that low influence serves little or 2° 
Purpose in enhancing the effects of leader de- 
cisiveness, that high influence may have dys- 
functional Consequences, and that there is € 
midrange in which influence is optimal tor 
leadership purposes. 


_ From Table 5, it can be seen that the high- 
influence condition has a negative moderating 
effect on th 


: e relationship between job satis- 
faction and initiating structure, and the inter- 
cept term for three satisfaction variables are 
higher under high influence, 

Relationshi PS among the dimensions of leader- 
ship style. Table 2 Presents the correlation 
Coefficients for the relationships among the 
adership style. As can be 
Company A, initiating 
ate positive relationship 
etence and is relatively 


structure has a moder 
with technical comp 


i] 
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independent of the other measures of leader 
behavior. Consideration is moderately corre- 
lated with decisiveness and competence in 
Company A and slightly correlated with in- 
fluence. In Company B, consideration is 
moderately correlated with decisiveness and in- 
fluence and slightly correlated with com- 
petence. Technical competence is moderately 
correlated with structure, consideration, and 
decisiveness. 

In Company B, excluding technical compe- 
tence, all of the relationships among the 
independent variables are positive and mode- 
rately high. Technical competence has low 
positive correlations with all variables except 
decisiveness, which is positive and moderately 
high. The finding that the leadership style 
measures are differentially intercorrelated from 
company to company again suggests the 
importance of situational factors in leadership 
research. 


DISCUSSION 


The above findings have several important 
implications for organization theory and sug- 
gest several questions for future investigation. 


Influence as a Moderator of Leader Behavior 


That influence was not an important mod- 
erator in one of the two research and develop- 
ment organizations reported in this study, a 
finding quite inconsistent with Likert’s (1961) 
notion, is perhaps explained by Wager’s (1965) 
research. Wager concluded that when em- 
ployees are independent of their superiors, the 
influence of the superior is less relevant to the 
fulfillment of the employee’s expectations. lf 
the research and development personnel in 
Company A, like Wager’s high occupational- 
status groups, have sufficient freedom and 
control over their environment so that they do 
not have to use their superior as a spokesman 
or as a source of resources, then our findings 
could be explained by Wager’s interpretation. 

A second interesting finding concerns the 
specific amount of influence found to have the 
most pervasive moderating effect. In Company 
B, the medium-influence condition more per- 
yasively moderated the relationships between 
initiating structure, decisiveness, and satis- 
faction of subordinate role expectations. One 
interpretation is that the high-influence con- 


TABLE 5 


SIGNIFICANT COEFFICIENTS FOR INITIATING 
STRUCTURE AND THE INTERACTION OF INITIATING 
STRUCTURE WITH INFLUENCE AS A PREDICTOR 
oF SATISFACTION FOR Company B 


Satisfaction with: 
Parameter: B Job 
Ic ompany| Job | Advance- | freedom 
g | ment | 
| 
k 122* | 715| — Pee 
t 1.94 2.28 | 197 
a 
a i = 34 ate 
. 239 | 191 
ay E —17* = | a 
; —2.19 
as 7 = | e 3 


Note.—Only columns with significant values have been in- 
cluded here. 
*p = .05. 


dition permits the superior to exercise so much 
control over subordinates that the relationship 
between superior and subordinate, as viewed by 
the subordinate, becomes highly delicate. Where 
supervisors are seen to have such high in- 
fluence, it is likely that there will be greater 
status separation between them and their sub- 
ordinates, and that such status differentiation 
will result in a restriction of upward informa- 
tion flow, less willingness on the part of sub- 
ordinates to approach superiors, and less satis- 
faction with the social climate of the work unit. 

The above findings, and the speculative 
interpretations advanced here, suggest a need 
for reconsideration of the influence—linking-pin 
proposition. 


Reconsideration of the Influence—Linking-Pin 
Proposition 

If hierarchical influence is to enhance the 
functioning of the linking pin, it appears that 
the members of the group must be of marginal 
organizational status with little autonomy. 
Since the research from which Likert (1961) 
develops his proposition was conducted in 
hierarchical organizations, it is possible that 
the role of influence only functions in the man- 
ner Likert asserts for low-status organizational 
members who are moderately to highly depen- 
dent on superiors for satisfaction. It also 
appears that the optimum amount of influence 
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relationships are statistically significant but 
not particularly strong. If it were found that 
in discussion groups operating outside the 
context of direct observation, these interrela- 
tionships failed to reach significance, it would 
suggest that the nature of the research situa- 
tion itself is affecting the results attained in 
these earlier studies. Data collected as a part 
of the study described below were analyzed 
to examine this possibility. A portion of this 
study has been published elsewhere (Wilson, 
1969). 


METHOD 
Procedure 


Male undergraduates were recruited for part-time 
jobs. They were offered $6.00 to work in a group for 
one afternoon. Scheduling was accomplished over the 
telephone for work with the university’s freshman 
orientation committee (a ficticious organization) at 
which time the secretary casually asked each appli- 
cant whether he knew any of the people already 
signed up for a particular session. If a potential S 
indicated that he was previously acquainted with any 
of the others, either he or his acquaintance was con- 
tacted for rescheduling with an appropriate excuse. 

Twenty groups of five Ss each met in a room with 
a one-way mirror and hidden microphones. Since the 
experiment was to be disguised as a job, certain 
techniques were used to make this seem more feasi- 
ble. For example, the one-way mirror was covered 
with a fiberglass curtain that, while not obscuring 
the mirror completely, made it appear unlikely that 
anyone would be watching from the other side while 
it was drawn. However, it was in fact possible to 
observe Ss through the drawn curtain. 

All Ss were told that they were working for the 
freshman orientation committee, which supposedly 
was exploring the feasibility of having freshmen 
discuss human-relation stories written by upper- 
classmen about common campus problems. Ten of 
the groups (observed) were observed by E who re- 
mained in the room during the group meeting. A 
second O remained behind the one-way mirror. The 
remaining 10 groups (nonobserved) were left alone 
in the room but were observed from behind a one- 
way mirror (covered by a see-through curtain). 
These Ss were not told that they were being ob- 
served. i 

At five equally spaced points (10-min. periods) 
throughout the discussions (2 hr, and 20 min.) O 
counted the number and content of communications 
initiated by each group member. The reliability co- 
efficient between Os in counting total number of 
initiations for each S was .97. The members in each 


4 In the study there were actually four conditions 
but only the observed-nonobserved manipulations 
were successful. For the full set of instructions, see 
Wilson (1967). 
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group were ranked from one to five according to 
how much they participated in the group discussion. 
The two members in each group with the highest 
rates of participation were considered high partici- 
pators and the remaining three, low participators. 
At the end of the session, each group member was 
asked to rank the others in the group as to quality 
of leadership. The sum of the ranks assigned to each 
individual by the other Ss were taken as his leader- 
ship score. The leadership scores of the fve persons 
in a group were then rank ordered from one to five. 
The top two ranks in each discussion group were 
placed in the high-leadership group, and the remain- 
ing three members were placed in the low-leader- 
ship group. , 
The Ss were also given Bass’ (1962) Orientation 
Inventory, which includes three mutually exclusive 
scales. Each person’s score on the Self-Orientation 
scale was taken as a measure of dominance or as- 
cendance. The results of earlier studies using the 
Self-Orientation scale seem to justify this choice. 


RESULTS 


In order to determine the relationship be- 
tween participation and self-orientation in the 
observed and nonobserved groups, the self- 
orientation scores of high and low participants 
were compared under both conditions. From 
Table 1 it is seen that while high partici- 
pators tend to have higher self-orientation 
scores than low participators in both co” =a 
tions, only in the observed conditions is this 
difference statistically significant. 1 

To determine the relationship between lead- 
ership and self-orientation, the self-orientatio” 
scores of the high and low leadership ranks 
were compared in the observed and nono?” 
served conditions. It is seen in Table 1 that 
only in the observed conditions do those 18 
in leadership have higher self-orientatio® 
scores. Also, only in the observed conditions 


For example, in a review of studies using the 
Orientation Inventory, Bass (1965) reports that 
the self-oriented S is “one who likes to assert him- 
self, to compete, to stand out and be visible P: 
121.” Self-orientation has been positively correlate 
With aggressiveness-competitiveness (.23) on the 
Sixteen Personality Factor Questionnaire (Cat- 
tel, Saunders, & Stice, 1957) and on the Thematic 
Apperception Test, self-orientation correlated with 
the need for aggression (.25) and with the need for 
autonomy (.22). In small discussion groups, sal- 
orientation correlated positively with the followink 
peer ratings: “(a) concerned about himself and his 
own personal needs and feelings”; (b “takes un- 
justified assumptions”; (c) dominates and imposes 
his will on the group (Bass, 1965)” 


OBSERVED AND NONOBSERVED GROUPS Reais | 


TABLE 1 


MEAN SELF-ORIENTATION SCORES BY PARTICIPATION, 
LEADERSHIP, AND EXPERIMENTAL CONDITIONS 


Observed Nonobserved 
Score 
eal w | so | i | s |x@\| x | sol b 
| | 
Participation | 
High 31.10 20 5.36 28.55 20 6.24 
3.28 .01 1.09 ns 
Low 25.27 30 6.49 26.63 30 5.69 
Leadership* G 
High 30.24 21 5.53 27.05 20 6.90 
2.47 .02 AT ns 
Low 25.69 29 6.83 27.87 30 5.20 


Note,—Computation of the standard deviations and ¢ are based on the arcsine transformation of the original figures. Two- 


way analyses of varian: 


a In one observed group, there 


showed a participation main effect but no leadership effect. In both cases there was no observer effect, 
but the interaction effects were significant at approximately the .10 level. 
tie for second place rank on leadership. The conservative resolution was to include both 


Ss in the high-leadership group. Thus, there were 21 Ss in high leadership in the observed condition, 


is there a significant difference between the 
high- and low-leadership ranks. 

The association between leadership and 
participation ranks was examined for Ss in 
the observed and nonobserved groups. Table 
2 shows the results of this analysis. In the ob- 
served groups there is a strong positive associ- 
ation between participation and leadership. 
In the nonobserved groups no such relation- 
ship exists. 


Additional Findings 


Some further findings are relevant at this 
point. Each communication act initiated by S 
was classified according to one of the following 
categories: (a) positive social-emotional (rein- 
forcement, agreeing, joking); (0) negative 
social-emotional (disagreement, rejecting, an- 
tagonism) ; (c) task oriented (all interaction 
concerning the task); (d) task irrelevant 
(out-of-field interaction not classified as posi- 
tive or negative social-emotional) ; (e) posi- 
tive task morale (positive attitudes about the 
task or situation as an object) ; and (f) nega- 
tive task morale (negative attitudes about the 
task or situation as an object). From this, the 
percentage of each S’s total communication 
acts falling into each of the six areas could be 
determined. This made it possible to deter- 
mine whether there were any differences in 
communication content profiles between the 
high and low Ss (on leadership, participation, 


and self-orientation) in the two experimental 
conditions. 

The only communication category that dif- 
ferentiated between the high and low Ss on 
any of the three variables was positive task 
morale. The results are seen in Table 3. In 
the observed condition, Ss high on leadership 
and participation initiated higher rates of 
positive task morale. In addition, in the ob- 
served condition self-orientation was signifi- 
cantly correlated (Pearsonion r) with posi- 
tive task morale (.37, P< .01), while in the 
nonobserved condition it was not (—.13). 

The positive and negative task morale cate- 
gories were included in the study to determine 
how the presence or absence of O affected S’s 
willingness to verbalize attitudes about any 
aspect of the situation (e.g., room, O) or the 


TABLE 2 


RELATIONSHIP BETWEEN LEADERSHIP AND PARTICIPA- 
TION IN OBSERVED AND NONOBSERVED GROUPS 


ee Observed Nonobserved 
oka leadership rank* leadership rank” 
pation 
ranks = 

High | Low | Total | High | Low Total 
High 17 


| 2 | 10 | 10 | 20 
| io | 20 | 30 
50 | 20 | 30 | 50 
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TABLE 3 


Positive Task MORALE RATES BY PARTICIPATION 
LEADERSHIP AND EXPERIMENTAL CONDITIONS 


Observed Nonobserved 
Task morale - - | = | z 7 F 
ate 5 | | Ë (% N SD | 
r ¥(%) | nv | SD | t | p |X (%) | -i | B 
| | | 
Participation | a wer | 
i | 1.89 20 3.08 1.20 | 2 4 
me | | 3.69 | o1 | | as 
ow 0.71 30 | 3.62 | oss | 30 | 355 | 
shi | | 
ten 212 | 21 | 247 | | u | 20 | 434 | 
ii 5.26 001 | 94 ns 
Law 0.64 29 | 3.53 | | ós | 30 | 3.7 | 


Note.—Computations of the standard deviation and ¢ are based on the transformation of the original figure 
analysis was the percentage of each Ss total interaction categorized as positive task morale. Two-way analyses of va 


s. The unit of 
nce showeG 


í 
; e x ays a y the 
participation and leadership main effects but no observation effects. The interaction effects were significant at approximately 


-10 level in both cases, 


n z include both 
a In one observed group there was a tie for second place rank on leadership. The conservative resolution was to include bi 
Ssin the high-leadership group. Thus, there are 21 Ss in high leadership in the observed condition. 


task. Thus, those who ultimately were chosen 
as leaders in the observed group were likely 
to have said something pleasant or construc- 
tive about the situation. In the nonobserved 
condition there was a tendency for leadership 
to be assigned to those who had initiated 
higher rates of task-oriented interaction, but 
this did not quite reach statistical significance 
(p < .10). 

The postsession interviews provide some 
cues about the perceptions of Ss in the ob- 
served condition. When asked what they 
thought O was looking for, 60% of the ob- 
served Ss stated in various ways that O was 
looking for patterns of leadership, dominance, 
influence, or differences in participation. The 
Ss in the nonobserved condition gave no in- 
dication that they saw these issues as having 
any relevance to their being in the situation. 
It is likely that the salience of these issues in 


€ Such interaction was not classifiable as task-ori- 
ented communication because it had the task or 
situation as an object, rather than being directly 
involved in reaching the group goals (the writing of 
the stories). Examples of interaction that would be 
classified in these categories might be: (a) “I think 
this work is worthwhile” (positive task morale) or 
(b) “This whole job is a farce” (negative task 
morale). The reliability coefficient for this category 
was .48, which is rather low because little of this 
interaction was recorded. Therefore, only a few 
disagreements between Os contributed a high propor- 
tion of error. 


the observed condition affected the perception 
of Ss so that they consciously or unconsciously 
used the same criteria in choosing leaders that 
they felt O was using. 


DISCUSSION 


This section, in offering some possible ae 
planations for the differences reported aboV 
takes some cues from those who have. eam 
lyzed the social psychology of the experimen 
(Orne, 1962; Riecken, 1958; Rosenberg, 
1965). Thus, our interpretations are in terms 
of the meanings that the observed and non- 
observed conditions held for Ss and how these 
effected their behavior in the two situations- 
These are admittedly post hoc explanations 
but it is hoped that they provide some hy- 
potheses for further research. 

Following the postulations of Riecken 
(1958) and Orne (1962), it would be ex- 
pected that with O in the room each group 
member is likely to be concerned with how 
he is being evaluated in the eyes of O. This 
concern, or what Rosenberg (1965) calls 
“evaluation anxiety,” should affect various 
personality types in different ways. For those 
Ss with high self-oriented needs, this situation 
would serve as a perfect arena for calling 
attention to himself and expressing his vari- 
ous self-centered needs (see Bass, 1965). This 
type of person would want to be highly visi- 


OBSERVED AND NONOBSERVED GROUPS Rea U 


ble and, in a leaderless discussion group, one 
must compete for visibility and status by par- 
ticipating highly (see Carter, Haythorn, 
Shriver, & Lanzetta, 1950). 

Tt is interesting that both observers inde- 
pendently reported the impression that the 
high participators in the observed groups were 
more concerned with putting on a show for O 
than with actually accomplishing the task— 
traits that characterize the highly self-oriented 
person. Further evidence that the high par- 
ticipators, who were also high in self-orienta- 
tion, were playing roles for O is seen in the 
fact that they initiated high rates of positive 
task morale interaction. This particular group 
of observed Ss (high self-orientation) seems 
to be highly motivated to ingratiate them- 
selves with O. 

In a situation where evaluation-anxiety is 
high and interaction strained, group members 
should be gratified by those who are willing to 
carry out what Redl (1955) calls the “guilt- 
and-fear assuaging effect of the initiatory act.” 
Those who dominate the discussion fulfill the 
function of breaking the ice, and of course 
become highly visible. In short-term groups 
with no defnite structure, what is meant by 
leadership is not at all clear. Certainly, one 
prerequisite to being chosen as a leader is 
standing out and being visible in the group, 
and which behavior stands out in a group 
depends on the nature of the situation. 

In the observed situation, where, as re- 
ported elsewhere (Wilson, 1969), group mem- 
bers uniformly initiated high rates of task- 
oriented interaction, standing out would be 
dependent upon sheer volume of interaction. 
Thus, in the observed condition, high partici- 
pators were chosen as leaders. In the nonob- 
served condition, where the groups initiated 
less task-oriented interaction (see Wilson, 
1969), those Ss whose interaction profiles 
showed high task orientation would be most 
visible. As mentioned above, the choice of 
leaders in the nonobserved groups tended to 
depend more upon task orientation. In addi- 
tion, it was found in a separate analysis, not 
reported here, that the rankings of Sson a 
dimension called “quality of ideas” were sig- 
nificantly more highly related to leadership 
rankings in the nonobserved condition than 
in the observed condition. Thus, contributions 


to the task at hand are necessary to gain 
leadership in groups not bothered by the prob- 
lems raised by the pressure of an Ø. 

A final possible explanation for the inter- 
correlations observed is suggested by Gibb 
(1950) in a study of ad hoc groups, where 
both O and peer ratings of leadership were 
obtained. He suggests that “participants may 
have become increasingly aware of the ex- 
perimental interest in leadership in the study 
and may have endeavored to make their own 
records ‘look better’ by choosing only those 
whom they expected the observers would also 
choose on quite different criteria [p. 669].” 
Tn the present study, the observed Ss’ ability 
to verbalize what they felt O was looking for 
indicates that they were searching for what 
Orne (1962) has called the “demand charac- 
teristics” of the situation. Thus, their re- 
sponses on the questionnaire may have been 
consciously or unconsciously influenced by 
what they felt were O’s criteria for evaluation. 
If this were the case, those being most visible 
and having made ingratiating responses 
would be seen as ready candidates for leaders. 

If only the overtly observed groups had 
been the source of data, the statistically sig- 
nificant relationships among leadership, par- 
ticipation, and dominance-ascendance would 
be consistent with the results of earlier studies. 
However, the failure of these relationships to 
reach statistical significance in the nonob- 
served condition plus the fact that in the 
earlier studies these interrelationships were 
not generally strong, suggests that these rela- 
tionships may hold only for groups operating 
in the presence of an O. This strongly sug- 
gests that O is a situational variable whose 
effect must be considered when attempting to 
make generalizations about leadership and 
group behavior based upon observations made 
in test or experimental situations. 

It is now generally accepted that a leader 
in one situation will not necessarily be a 
leader in another. Gibb (1954) suggests that 
the person who best fulfills the needs of the 
group members at a particular time will be 
seen by the others as a leader. This is cer- 
tainly clear from the present study, where it 
was shown that when an investigator con- 
structs a situation for the purpose of observ- 
ing behavior in groups, he is manipulating the 
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context that determines the behavior of vari- 
ous group members, their perceptions of each 
other, and the criteria used for evaluating 
leadership. , 

These results should be of special interest 
to personnel managers who often attempt to 
predict leadership behavior in other contexts 
by observing behavior in a group situation 
where O is a central component. This may 
lead to erroneous predictions. For example, 
the person who, through peer nominations, 
emerges as a leader in an experimental or 
observational setting is likely to possess quali- 
ties that are useless when the individual is 
operating in situations where strong self- 
oriented needs cannot be gratified. It is in- 
teresting that the leaders in the non-observed 
conditions appeared to be more task oriented, 
which seems more in line with the common 
sense notion of leadership. This type of per- 
son was, most likely, performing adequately 
in the observed condition but merely did not 
come to light. He would therefore be the type 
of person who could function well in a wide 
range of group situations. 

In group-leadership studies, where the na- 
ture of the situation is of obvious importance, 
it is impossible to ignore the impact of the 
research process or the behavior being ob- 
served. It is hoped that more studies dealing 
with the nature of this impact will be forth- 
coming in the future and that pure and ap- 
plied researchers will utilize this information 
to help eliminate errors in making generaliza- 
tions from experimental test situations. 
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RELATION OF A TRAINABILITY INDEX TO 
T-GROUP OUTCOMES* 


CHRISTOPHER J. PINO? 
D'Youville College 


The ability to screen potential T-group members by a relatively incomplex test 
procedure could prove to be an asset in the area of human relations training. 
‘A trainability index is provided by a combination of scores on Thelen’s Reac- 
tion to Group Situations Test. Trainability indexes were calculated, and 
self-disclosures and empathy ratings were measured previous to and following 
sensitivity training of two groups—psychiatric aides and Office of Economic 
Opportunity aides. Positive relationships between trainability and the two 
T-group outcome measures (openness and empathy) were found for both T 
groups. This provides further validation for the trainability index in human 


service professions. 


The sensitivity training (or T-group) 
movement has provided the social sciences 
with the method for instituting personal and 
social change, as well as a productive channel 
for small group experimentation. Bradford, 
Benne, and Gibb (1964) list the following 
major objectives of the T group: (a) to learn 
more about one’s self, (b) to facilitate group- 
member function in other group situations, 
(c) to be a more effective group member, 
and (d) to create greater understanding of 
group behavior. 

In the present study, the T group was 
seen as a vehicle for developing interpersonal 
skills necessary to assist mental health aides 
and Office of Economic Opportunity (OEO) 
aides in a vital aspect of their job function— 
that is, in their interpersonal interactions. 

The specific purpose of the study was to 
evaluate an index for predicting individual 
differences in trainability among participants 
in such T-group activities in these human 
service professions. Recent reviews of the 
effectiveness of T groups have been provided 
by Buchanan (1967), House (1967), and 
Campbell and Dunnette (1968). Although 
the evidence is not uniformly positive, it is 
clear that positive results can be achieved. 
a 


1The author would like to thank Lawrence 
Whisler for his help in collecting data at Chicago 
State Hospital. 

2 Requests for reprints should be sent to 
Christopher J. Pino, Department of Psychology, 
p’Youville College, 320 Porter Avenue, Buffalo, New 


York 14201. 


There are still a number of questions regard- 
ing the conditions under which effective re- 
sults are obtained. An emerging finding is 
that different individuals may react to such 
experiences in different ways. 

Steele (1968), for example, attacks the 
idea that all individuals have adequate skills 
for learning in T groups. Using the Sensation- 
Intuition (S-N) scale of the Myers-Briggs 
type indicator, he found that trainees scoring 
high on the intuitive (N) mode of perception 
(ie., where the perceiver adds to whatever 
is given in the stimulus situation itself 
through his ideas and associations) were 
more effective learners in the T group 
than trainees scoring at the sensation (S) 
end of the continuum (i.e., regarding the 
process of becoming aware of things directly 
through the S senses). Smith (1966) points 
out that individual differences are great in 
the benefits derived from T-group experience. 

Portal-Foster (1970) applied Bion’s (1951) 
work-emotionality theory to small discussion 
groups of school teachers. In nearly all in- 
stances, his analysis supported the theory as 
being a practical descriptive technique. The 
theory failed, however, in accounting for 
individual differences under group pressures. 

Stock (1964) summarized a number of 
independent studies regarding individual dif- 
ferences among T-group participants, which 
showed that certain trainee characteristics 
were associated with readiness to learn and 
positive change while other characteristics 
were associated with lack of positive change. 
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A major step toward identifying these 
characteristics is the work of Stock and 
Thelen (1958). They developed a trainability 
index derived from a test they devised to tap 
interpersonal tendencies in T groups—that 
is, the Reaction to Group Situations Test 
(RGST). The theoretical underpinnings of 
this test were derived from Bion’s (1951) 
work-emotionality theory of group behavior. 
Mathis (1955) conducted a validation study 
that demonstrated that the trainability index 
was predictive of self-perceptual changes 
derived from laboratory training. 

Since, in many instances, the T-group ap- 
proach has been shown to be an effective 
method for training human relations skills in 
its participants, the assessment of potential 
members’ trainability would seem a necessary 
step in the selection process. Stock and 
Thelen’s (1958) RSGT yields a trainability 
index that may be calculated prior to train- 
ing to accomplish this task. The present re- 
search related this trainability index to two 
behavioral criteria: openness (cumulative 
self-disclosure in T groups), and empathy 
(MMPI Role Play scale). The utility of 
the trainability index was tested in two 
different populations: Office of Economic 
Opportunity (OEO) aides and mental health 
aides. 


METHOD 
Subjects 


The Ss in one of the T groups were 12 Negro, 
predominantly lower socioeconomic status, mental 
health (or psychiatric) aide trainees employed at 
Chicago State Hospital. Most Ss had nearly com- 
pleted high school or its equivalent. All Ss had 
been employed at the hospital for at least 3 mo., 
none for longer than 1 yr. All were residents of 
the inner city. Their ages ranged from 18 to 45 yr. 

The second T group was composed of 9 OEO 
aides employed in a rural county in western New 
York state. All the six Caucasian and three Negro 
members were high school graduates 


from the lower and lower-middle 
level. 


and were also 
socioeconomic 


Instruments 


Self-reference content analysis. Reisel (1959) 
adopted the self-reference content analysis scheme 
from Bugental (1948). Self-references during inter- 
action are seen as a method of disclosing self to 
others. The analysis is a simplified scoring system 
that can be easily learned. According to Culbert 
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(1968), two phases of the reference system are: 


First there is the determination of thought units. 
A thought unit is defined as a series of hag 
expressing a single idea or thought; basically, y 
is equivalent to simple grammatical sarine 
Second is the gnment of thought units o 
content categories. There are three types ol cates 
gories having to do with self-references which, for 
these purposes, imply self-disclosure. THe ae 
type of self-reference refers to a person” tall : a 
exclusively about himself. In this case he is eit He 
subject or the subject and object of the tan 
units, which are assigned to the content categ a 
“S—”, The second type of self-reference refers 9 
the person discussing the way some group meine 
or incident affects him. Such units are collectec 
under the symbol “<—S”. The third type of self- 
reference refers to the person discussing the effect 
of his participation on the group or on ae 
sub-set of group members. These thought U 
are collected under the symbol “S—>” [pp- 53-54]. 


All nonself reference statements are given the sbo 
N. For use in the present research a simpli >r 
method was used. Speech segments were mr r- 
either as S (self-reference) or as N  (nonself-rete 
ence). 

Role-Playing scale. The Role-Playing scale E 
MMPI (Dahlstrom & Welsh, 1960) is a eb les 
true-false scale tapping the ability to exchange soe) 
or empathic ability. Specifically, McClelland’s e. vith 
Cefinition is that it “refers to the capacity zed 
which a person can perceive and act out Ore ae 
behaviors and roles” (ie. putting himself in i the 
one elses position). Split-half reliability “avins 
criterion (multiple student ratings of rle 
ability) was .66; the test-retest reliability verk 
over a 1-mo. period on a second sample. Tyo į 
samples were used as further tests of validity. GT 

Reaction to group situation test, The rad 
developed by Stock and Thelen (1958), transla oe 
Bion’s (1951) work-emotionality theoretical La 
work into an objective test. A variety (28) of pee 
situations make up the test items (incompleted ne 


tence stems that S fills in), The test is untimed. 
Stock and Thelen state: 


The subject's responses are scored along three 
dimensions: acceptance or non-acceptance of the 
stimulus, the kind of affect introduced overtly oe 
covertly into the response, and the manner of a 
response. Early work established the reliability ° 
the scoring procedure and demonstrated that dey 
scriptions of the same individuals from jibe 
RS.G.T. protocols corresponded with descriptions 
of the same individuals from observations of thei! 
behavior in the group [p. 226]. 


Mathis (1958) conducted validation studies 0n @ 
trainability index derived from the RSGT. ane 
trainability index was given as the ratio of adient 
(approach behaviors) to abient (avoidance behav- 
iors) indicators from the RSGT. 


RELATION OF A TRAINABILITY 


Previous studies with this test (RS.G.T.) sug- 
gested that relative valences for fight, pairing, 
dependency, and flight (categories derived from 
the RS.G.T.) might be obtained as the number 
of times each of these valences respectively was 
introduced in the responses of the 28 emotionally 
charged stimulus items. An estimate of the con- 
flict. over each emotionality could be obtained 
from the balance of accepted to rejected stimuli 
keyed to each modality (eg. fight, dependency, 
etc.) ; and immobilization could be estimated from 
the number of times responses in the test gave 
signs of emotional blocking. Thus, an Index of 
Trainability would be the sum of fight, pairing, 
and conflict responses divided by the sum of 
flight, dependency, and immobilization responses 


[p. 152]. 


Procedure 


Both groups (OEO aides and mental health aides) 
were involved in similar human relations training 
experiences. The T groups met for a 24-3-hr. session, 
one morning per week for a 4-mo. period, The same 
trainer met with both groups. The format was a 
traditional T group one, heavily group-process ori- 
ented, with some structured experiences (e.g. Jones 
& Pfeiffer, 1969, and Schutz, 1969) interspersed. 
Some self-rating feedback devices (e.g., Miles, 1959; 
Pino, 1969), and occasional lecturettes (e.g. 
“JoHari’s Window” of National Training Labora- 
tories, 1965) were also employed. 

The RSGT was administered to both groups prior 
to training. The RSGT protocols were scored by 
two trained raters (interrater reliability = 83), and 
trainability indexes were calculated for each member. 
Each session, for both T groups, was taped, and 
10-min. segments from every session were chosen 
randomly and rated for self-disclosure according to 
Reisel’s (1959) self-reference content analysis system 
by two trained raters (interrater reliability = 88). 
Both groups were given the MMPI Role Play scale 
(a measure of empathy) before and after training. 


Hypotheses 


1. In each T group, trainability and frequency 
of cumulative self-disclosure statements will be 
associated (indicating openness behavior). 

2. In each T group, trainability and pre-post- 
positive changes in MMPI Role Play scale scores 
will be associated (indicating increases in empathic 
ability over time). 


Statistical Treatment 


For each of the T groups, all members were 
ranked in terms of their trainability index, their 
pre-post-positive change in Role-Play scale scores, 
and in regard to cumulative frequency of self- 
disclosure behavior for the T group sessions over 
the 4-mo. period. Spearman rank correlation co- 
efficients (Siegel, 1961) were calculated for each 
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group between trainability and cumulative self- 
disclosure rankings. To test the two hypotheses, the 
four Spearman correlation coefficients (rs) were 
tested for significance by means of the procedure 
provided by Siegel (1961, p. 211). 


RESULTS 


The results showed that for both T groups: 
(a) trainability and pre-post change in the 
Role Play scale are significantly associated 
(.86 for mental health aides and .67 for OEO 
community aides), and (b) trainability and 
members’ self-disclosure are significantly 
associated (.91 for mental health aides and 
‘88 for OEO aides). All these coefficients 
are significant (one-tailed tests) at the .01 
level except for the .67 coefficient, which is 
significant at the .05 level. 


DISCUSSION 


Sensitivity training is seen as a valuable 
method of training those interpersonal skills 
necessary in any helping process. Thus, the 
job effectiveness of the two groups studied 
here (community aides and mental health 
aides) would seem to be predicated on these 
types of skills. A search of the research 
literature on T groups indicates that the 
effectiveness of T groups in building these 
skills differs according to members’ potential 
trainability. 

The research conducted here tested this in 
terms of two outcome criteria: (a) increase 
in empathy (measured by MMPI role scale), 
and (b) self-disclosure behavior of members 
cumulated over the training period. Results 
indicated a strong association between train- 
ability and the outcome measures, providing 
further validation of the trainability index 
and demonstrating its utility in two mean- 
ingful contexts, that is, the training of mental 
health aides and OEO community aides. 

Mathis (1955) suggested that the train- 
ability index could be a useful screening 
device to pick trainees for laboratory train- 
ing. The present research may provide a basis 
for further studies in assessing the trainabil- 
ity index as a method for selecting candidates 
for jobs in human services. Since many of 
the human services job training programs do 
use laboratory training methods, it follows 
that the trainability index could play a de- 
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cisive role in selecting candidates that could 
profit from the training experiences and, per- 
haps, sustain changes 7 helping skills (e.g., 
f enness, etc.). i 
“ieee : backlog of research to indicate 
that such variables are essential in helping 
relationships provided by human service 
workers (Carkhuff, 1970). Mental health 
aides, in many inpatient settings are tradi- 
tionally responsible for a majority of staff- 
patient interactions. There is an increasing 
use of these nonprofessional workers in both 
inpatient and outpatient counseling capaci- 
ties. The OEO aides have a more unstruc- 
tured role in terms of their helping relation- 
ships. The OEO aides are employed in one 
of the several OEO Community Action Pro- 
gram’s components—that is, Head Start, 
Family Planning, Community Organization, 
Legal Aide, Neighborhood Youth Corps. Job 
assignments may range anywhere from a 
teacher aide to nonprofessional youth coun- 
selor, to community organizer, etc, A common 
thread to all assignments is the requisite of 
supplying a helping relationship to the eco- 
nomically deprived. Concretely, the job in- 
volves supplying information and technical 
assistance, based on the ability to enter into 
trust-building human relations, where the 
consumer (low-income family) views the aide 
as a competent helper and worthy confidant. 
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SIMILARITY OF POST-INTERVIEW TRAIT RATING 


INTERCORRELATIONS 


AS A CONTRIBUTOR TO 


INTERRATER AGREEMENT IN A STRUCTURED 
EMPLOYMENT INTERVIEW’ 


MILTON D. HAKEL * 


Ohio State University 


Degree of interview structure, use of scaled-expectation rating scales, and simi- 
larity of postinterview trait rating intercorrelations are three variables that 
presumably influence interrater agreement in employment interviews. Nine 


nursing interviewers sat as a selection 


board and interviewed and independently 


rated 54 senior nursing students. The interviews were highly structured, and 
ratings were recorded on scaled-expectation scales for general staff nursing 
positions. Although all interviewers shared essentially the same structure among 
their postinterview trait ratings, interrater agreement was no better than in 
previous studies. This finding shows the power of halo in the interview setting. 


Sharing the same structure in ratings 


(and using the scaled-expectation scales 


in a highly structured interview) by no means standardized perceptions of 


job applicants. 


Instrumentation in the selection and place- 
ment of job applicants was revolutionized 
after World War I by the widespread intro- 
duction of tests of various abilities and, then, 
tests of interests and character. These instru- 
ments have achieved an important and valu- 
able place in the employment process and are 
the objects of continuing refinement and cali- 
bration. In the last few years, it has been 
realized that the employment interview, also 
an instrument for making observations, ought 
to be given the same attention devoted to 
employment testing. The focus now is on 
studying the interviewing process, making it 
too the object of continuing refinement and 
calibration. 

It takes little acquaintance with the litera- 
ture on the employment interview to realize 
that interrater agreement is too low to justify 
psychometric use of the interview (see re- 
views by Mayfield, 1964; Ulrich & Trumbo, 
1965; Wagner, 1949; and Wright, 1969). 

The outlook is not entirely bleak, however. 
The literature offers suggestions about varia- 
bles controlling interrater agreement in the 


a 
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2 Requests for reprints should be sent to Milton D. 
Hakel, Department of Psychology, Ohio State Uni- 


versity, 1945 North High Street, Columbus, Ohio 


43210. 


employment interview. First, it is generally 
felt that a structured or patterned interview 
is more reliable than an unstructured one 
(Bingham, Moore, & Gustad, 1959; Wagner, 
1949; Wright, 1969). Although no study has 
been reported that directly varied the amount 
of structure to observe its effects on reliability, 
studies reported by McMurry (1947) seem to 
support this view. 

A second variable that may control inter- 
rater agreement is rating scale format. Maas 
(1965) showed that agreement on scaled-ex- 
pectation rating scales is significantly superior 
to agreement on adjective rating scales an- 
chored in the following manner: very good, 
good, average, poor, and very poor. 

Finally, one finding of the McGill studies 
(Webster, 1964), that interviewers apparently 
make selection decisions by testing the good- 
ness of fit of each applicant against a stereo- 
type of an ideal applicant, is especially rele- 
vant to interrater agreement. Interviewers 
sharing similar stereotypes presumably should 
show higher interrater agreement than inter- 
viewers whose stereotypes differ. Intuitively, 
it seems clear that interviewers who select 
applicants by reference against dissimilar 
standards should show larger disagreements 
than interviewers who use the same standard. 
The usefulness of this suggestion clearly de- 
pends on finding differences among interview- 
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ers in their stereotypes for particular occupa- 
tions. The McGill studies seem to show that 
all interviewers within an occupation use the 
same stereotype, but other research by May- 
field and Carlson (1966) and Hakel, Holl- 
mann, and Dunnette (1970) questions this 
position, thus making an analysis of stereo- 
type similarity and interrater agreement desir- 
able. 

This research examines interrater agreement 
as a function of rater similarity of within- 
interviewer postinterview trait rating inter- 
correlations, recovered from ratings made on 
scaled-expectation rating scales in highly 
structured board interviews. The procedures 
used, such as a highly structured interview 
guide, scaled-expectation scales, the board 
interview setting, and a preexperiment lecture 
on biases (leniency and halo) in ratings were 
all intended to facilitate agreement among 
raters. Thus, the study really focuses on one 
question: Do interviewers who structure their 
postinterview ratings similarly agree more 
with each other than with other interviewers 

- whose structures differ? 

It is mathematically necessary that if two 
raters show perfect agreement on all scales, 
then the within-rater intercorrelations will be 
identical for those two raters. But this rela- 
tion is not symmetrical. Two raters might 
have identical patterns of trait intercorrela- 
tions and show no agreement whatsoever (r 
= .00) or even perfect disagreement (r= 
—1.00). This research is directed at the latter 

point. 
METHOD 
Subjects 


Nine nursing administrators and supervisors from 
seven hospitals served as interviewers in this re- 
search. Their median age was 44 yr, with a 
range of 27-53 yr, All were registered nurses, and 
seven of them held college degrees in Nursing Ad- 
ministration. Median interviewing experience was 
4 yr., with a range of 1-12 yr. All occupied positions 
in which interviewing nursing job applicants was an 
integral part of the job. 

Fifty-four student nurses from three nursing schools 
volunteered to serve as general staff nurse job 
applicants and to be questioned by the interviewing 
board. All students were in the final 3 mo. of train- 
ing and were preparing for the Registered Nurse 
examinations. The investigator met with groups of 
students from each of the schools to recruit volun- 
teers: the reasons for studying interrater agreement 
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and a description of the board interview procedures 
constituted the content of these meetings. Rate of 
participation was high. Each student received a copy 
of a general staff nurse job description and was 
asked to study it prior to her interview. This was 
done to assure that students from each school had 
relatively common expectations about the job be- 
haviors “required of general staff nurses. Each stu- 
dent also completed an application form. The stu- 
dent’s median age was 20 yr., with a range of 19-31 
yr. Eleven students were interviewed the first day, 
15 the second, and 14 on both the third and fourth 
days. Interviews lasted about 15 min. each. 


Procedures 


The interviews proceeded as follows: copies of 
the first interviewee’s application form were dis- 
tributed to each interviewer for a brief examination. 
The applicant entered the room, was introduced 
to the board, and was seated at the head of the 
table. Questioning proceeded according to an pea 
view guide, which was developed and revised with 
the cooperation of the participating interviewers for 
use in this study. The final version of the guide 
contained nine questions (one asked by each inter- 
viewer) . j 

The nine questions were as follows: (a) On zom 
application blank, you described your idea of a e 
supervisor as being a person who... . Why do Kir 
feel this way? On your application, you described ^ 


4 ee 
poor supervisor as one who, .. . Why do you 
this w: 


Id you say to him? What woli 
) Do you feel that it is important y 

our charts and records up to date? wi ed 
de has just completed giving morning ca 4 
ent and you come into his room to am 
his condition. What kinds of things would you we 
for? (If no satisfactory answer is made, ask, “Wha 
would you look for in terms of the patient’s com- 
fort?”) (f) What would you do if a patient who 
ad just received an injection of penicillin told le 
he was feeling queer and having trouble breathing 

(g) Suppose you are caring for a patient who is 1n 
critical condition. The patient’s family is at the hos- 
Pital, they are quite upset and know very little about 
the patient’s current condition. What, if any, is your 


you do? (d 
keep all y 
(e) An ai 
to a pati 


responsibility to the family? If the patient’s wife 
Stops You in the hall to ask about her husband’s 
condition 


» what would 
many responsibilities to 
responsibilities is simply 


you say? (h) Nurses ae 
their patients. One of ae 
to possess a sufficient leve 
of technical competence to be able to meet the medi- 
cal needs of patients, Another of these responsibili- 
ties is to render some kind of psychological support 
to the patient. With respect to your nursing educa- 
tion, in which of these areas of responsibility do you 
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feel you are best prepared? Are you satisfied with 
your preparation in these areas? (i) One last ques- 
tion: In terms of your nursing career, what would 
you like to be doing 10 yr. from now? 

Probing and follow-up questions were encouraged. 
The emphasis throughout the interview was on ob- 
taining as much information about the applicant as 
possible, and to this end the guide contained at least 
one question specifically relevant to each of the 
scaled-expectation rating scales used in the study. 
Following completion of the questioning, the appli- 
cant left the room, and the interviewers made inde- 
pendent ratings. The second application form was 
distributed, the applicant was introduced, and so on. 
After about half of the interviews on each day, ques- 
tions on the interview guide were systematically re- 
assigned in an attempt to equate interviewer use of 
the questions over the entire series of interviews. 

At the end of an interview, each interviewer inde- 
pendently rated the applicant on six rating scales. 
The scales were titled Conscientiousness, Observa- 
tional Ability, Skill in Human Relationships, Organi- 
zational Ability, Knowledge and Judgment, and Ex- 
pected Level of Performance. They were presented in 
this order, one scale to a page, in a rating booklet. 
The first five of these scales were developed by Smith 
and Kendall (1963) and the National League for 
Nursing for the evaluation of general staff nursing 
performance and are known as scaled-expectation 
scales. The noteworthy feature of these scales is the 
use of behavior examples to define levels of per- 
formance on the scale. The use of behavior examples 
to anchor scales virtually eliminates the semantic 
ambiguity that attends anchors such as “average,” 
“superior,” and similar adjectival anchors. The 
elimination of such semantic ambiguity in the scale 
anchors is reflected in the scale reliabilities, the low- 
est of which is 97 (Smith & Kendall, 1963). The 
availability of these behaviorally defined scales dic- 
tated that nursing interviewers and student nurses 
be Ss for this investigation. The sixth scale, Expected 
Level of Performance, was developed especially for 
this study, and was used to secure a rating of overall 
suitability. While its format was similar to the 
scaled-expectation format, the sixth scale differed in 
that summary predictions (eg., I am certain that 
this applicant will be an outstanding nurse) rather 
than behavior examples anchored the scale. Addi- 
tional details of the procedures are given by Hakel 


(1966). 
RESULTS 


The analyses reported here were completed 
in the framework of a double cross-validation 
design to assess the stability of each interview- 
er’s ratings. There would be little sense in 
examining differences between interviews if 
their ratings were unstable. Accordingly, the 


sample was split into two cross-validation 


groups of 27 each. 


Each interviewer’s ratings on the five scaled- 
expectation scales and the overall suitability 
scale were intercorrelated within each cross- 
validation group.* The within-rater intercor- 
relations are uniformly high: Of the 270 
correlations between traits within raters within 
groups, about 80% of them reached or ex- 
ceeded the 5% significance level. Except for 
one interviewer, there was a general failure to 
discriminate empirically among the traits be- 
ing rated. Stated differently, all raters (except 
Number 3) seemed to have structured their 
ratings similarly. 

To examine the stability of each rater’s 
intercorrelations and to test for similarity be- 
tween raters in the magnitudes and configura- 
tions of their intercorrelations, multiple-re- 
gression equations relating the scaled-expecta- 
tion scales to the overall ratings were com- 
puted in each cross-validation group for each 
interviewer (18 equations in all). Once devel- 
oped, these equations were crossed to their 
companion samples, and very little shrinkage 
was observed: The median developmental R 
was .88 (with a range of .79-.92) and the 
median cross-validated 7 was .83 (with a range 
of .58-.90). Next, these equations were ap- 
plied to every other interviewer’s sets of 
ratings in a validity extension study. If Inter- 
viewer I’s regression equation could predict 
Interviewer J’s ratings of overall suitability as 
accurately as it could predict I’s ratings of 
overall suitability, then I and J must have 
structured their ratings similarly. Given large 
shrinkage, the structures must have been dif- 
ferent (since the equations were stable). The 
validity extension correlations are shown in 
Table 1. 

Every interviewer’s ratings of overall suit- 
ability can be predicted as well (and some- 
times even better) by another’s equation as by 


3 These intercorrelations, as well as means and 
standard deviations for each interviewer on each 
scale, and interrater agreement correlations for the 
scaled-expectation scales were computed and have 
been deposited with the National Auxiliary Publica- 
tions Service. Order Document No. 01559 from the 
National Auxiliary Publications Service of the Ameri- 
can Society for Information Science, c/o CCM In- 
formation Sciences, Inc., 909 3rd Avenue, New York, 
New York 10022. Remit in advance $5.00 for photo- 
copies or $2.00 for microfiche and make checks pay- 
able to: Research and Microfilm Publications, Inc. 
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TABLE 1 


CORRELATIONS OBTAINED WHEN EVERY REGRESSION EQUATION IS APPLIED TO EVERY SET 
N JHEN R N N A 
rR TINGS TO PREDICT RATINGS ON EXPECTED LEVEL OF PERFORMANCE 
o A PRE L 


By equation 


dicted | 1A | 1B | 2A | 2B | 3A | 3B | 4A | 4B 


1A 89 | 88 | 82 | 88 | 87 87 88 87 
1B 88 | 89 | 85 | 88 | 84 | 87 | 8 7 86 
2A 84 | 85 | 89 | 85 86 88 87 | 87 
2B 87 | 86 | 82 | 87 | 85 85 86 | 84 
3A 63 | 60 | 67 | 66 | 80 | 70 7 
60 | 57 | 63 | 62 | 70 | 69 | 65 
4A 82 | 80 | 76 | 82 | 77 | 8 | 83 | 83 
4B | 87 | 88 | 84 | 86 | 79 | 89 | 99 | 07 
SA | 69 | 69 | 65 | o7 | 65 | 67 | 69 69 
5B 72 | 71 | 74] T3 | 73 | 75 75 75 
6A 86 | 86 | 88 | 88 | 91 | 87 | 87 83 
6B 77 | 74| 72) 79 | 79] 79 | 79 | 76 
7A 74 | 71 | 72| 75 | 80 | g5 83 84 
7B 88 | 87 | 81 | 86 | 87 | 87 | 88 | 87 
8A 88 | 88 | 84 | 89 | 87 | 87 | 87 86 
8B 83 | 81 | 84 | 84 | 82 | 84| 83 80 
9A 79 | 78 | 79 | 78 | 79 | 79 | 79 77 
9B 80 | 79 | 79 | 81 | 81 |31| gi 


in the "ratings predicted" column ident 
1A in the “ratings predicted" column indicates that 

correlations reported in that row we 
ing Equation 1A (developed on r; 
are italicized. All other entries arı 


Interview. 
ere obtained. The correla 
‘atings of Group A by Int 
e validity extension correl 
his own. The range of the validity extension 
correlations is .38-.91 with a median of .§2 
(recall that the median cross-validated r is 
83). Moreover, all except five of the 288 
validity extension correlations reach or exceed 
the 1% significance level (all exceed the 5% 
level). Nearly 70% of these correlations reach 
or exceed a value of .75, Considering the high 
magnitudes of the developmental, cross-valida- 
tion, and validity extension correlations, and 
recalling the hypothesized relationship be- 
tween interrater agreement and structural 
similarity of postinterview ratings, it would 
be expected that interrater agreement should 
be quite high, 

Between-interviewer correlations were com- 
puted on the overall Suitability scale,* and are 


+The median agreement 
scale are as follows: Conscie 
tional Ability, 31; Skill in 
Organizational Ability, 33; 
Judgment, .36, 


Correlations for each SE 
ntiousness, 21; Observa- 
Human Relations, -16; 
and Knowledge and 


ify the data set to whi 
iewer 1's ratings of appli 
tion in the upper left 
erviewer 1) to dat 
lations. Decimals 


have bee 


vel, 
tions reach or exceed the 5% significance Im 
it is obvious that interrater agreement Table 
e agreement correlations reported a corre- 
2 are no higher, but no lower, than the Ulrich 
lations reported by Wagner (1949) and atrols 
and Trumbo (1965), despite these CO! cess 
and precautions: All interviewers had ser nt 
to the same information from each applica “ 
ratings were made on extremely per 
structed, unambiguous rating scales, an at- 
interviewers shared essentially the eae Tes 
tern of postinterview ratings on the six sca 


Discussion 


At the outset of this investigation it pS 
own that the interrater agreement corre 
tions would be mathematically independent e 
the similarity of intercorrelations among pasi 
interview ratings, It was expected, count 
that they would not be quite so independ, 
empirically. The outcome of “run-of-the-mi" 
agreement correlations despite structured or 
terviews, unambiguous rating scales, “i 
structural similarity of ratings was not at F 
expected. Thus, additional analyses were mai 


kn 


SS! 
we 


i 


i 
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to rule out several potential explanations of 
these findings.* These analyses showed that 
the observed results (high structural similarity 
and low interrater agreement) are attributable 
wholly to halo in the ratings. 

What can be said about interrater agree- 
ment? Clearly, the procedures used in this 
study did not foster higher agreement than 
has been found in other studies (Wagner, 
1949). Structural similarity of the postinter- 
view ratings on the six scales was exhaustively 
studied (a stronger test could be provided by 
using more rating scales), but the brute fact 
is that sharing the same structure in ratings 
by no means standardizes perceptions of job 
applicants. This latter conclusion fits par- 
ticularly well with Passini and Norman’s 
(1966) discovery of a highly stable 5-factor 
structure underlying peer nominations made 
by total strangers. Thus, in studying the 
comparability of judges (raters, interviewers, 
clinicians, etc.) it is not sufficient to show 
that they have similar statistical functions 
(e.g., regression equations) operating in their 
judgments, but that they agree in their judg- 
ments on a given set of stimuli. 

The value of the scaled-expectation scales 
for enhancing interrater agreement was far 
less than expected, especially given Maas’s 
(1965) report of median agreement correla- 
tions of .55 and .69 in two studies. 

Did these interviewers share the 
stereotype of an effective general staff nurse? 
They certainly shared similar patterns of 
postinterview trait rating intercorrelations 
among the six rating scales, but it does not 
follow necessarily that they shared the same 
stereotype. This leaves the possibility that 
jnterrater agreements may be related to simi- 
larity of stereotypes provided that different 
procedures (such as those used by, €g, 


a 

5 Details on these post hoc analyses, done to 
examine (a) capitalizing on chance in small-sample 
multiple regression, (b) systematic (odd-even) versus 
random assignment to cross-validation groups, (¢) 
application of the Spearman-Brown prophesy formula 
to a rating situation, (d) comparability of factor 
analytic and multiple-regression findings, and (e) the 
influence of halo error in ratings (taken as the first 
unrotated principal components factor) on agreement, 


are given in Hakel (1966). 


same 


TABLE 2 


INTERRATER AGREEMENT CORRELATIONS FOR RATINGS 
or EXPECTED LEVEL OF PERFORMANCE 


| 
Inter- $ 2 g 

Sewer! = 21.3 | 4 S 6/7 ;,8 {19 
1 41 22 58 —20 43 43 26 32 
2 05 28 27 14 59 4 23 47 
3 32 07 31 —04 45 38 OL 12 
4 5i 11 19 —03 36 34 50 11 
5 63 14 52 61 02 15 37 32 
6 5 103 25 59 63 34 35 37 
7 30 25 60 40 58 23 30 47 
8 50 16 28 51 64 72 23 18 
9 36 25 38 29 63 18 43 41 


Note.—Nine interviewers, 27 applicants in each group. 
Group A correlations are below diagonal; Group B are above 
diagonal. Decimals have been omitted. 


Hakel, Hollmann, & Dunnette, 1970) are 
adopted for measuring stereotypy. 

In the last analysis these data seem to re- 
affirm the importance of the first major find- 
ing of the McGill (Webster, 1964) investiga- 
tions. Springbett found that first impressions 
are remarkably potent, and that in the vast 
majority of interviews (73% overall), the 
first impression is also the final one (Webster, 
1964). If first impressions are so stable, then 
the results of this investigation could be the 
consequence of interrater disagreement on first 
impressions. Thus, it would be desirable to 
make an analysis of both agreement on first 
impressions and also changes in agreement 
over the course of the interview. 


REFERENCES 


BrncuaM, W. V. D., Moore, B. V., & Gustap, J. W. 
How to interview. (4th ed.) New York: Harper & 
Sons, 1959. 

Haxet, M. D. Perceiver differences in interpersonal 
perceptions: An analysis of interrater agreement on 
scaled-expectation rating scales in an employment 
interview setting. Unpublished doctoral disserta- 
tion, University of Minnesota, 1966. 

Haxet, M. D., HorLmany, T. D., & DUNNETTE, M. 
D. Accuracy of interviewers, Certified Public Ac- 
countants, and students in identifying the interests 
of accountants. Journal of Applied Psychology, 
1970, 54, 115-120. ý 

Mms, J. B. Patterned scaled-expectation interview: 
Reliability studies on a new technique. Journal of 
Applied Psychology, 1965, 49, 431-433. 

MAYFIELD, E. C. The selection interview—A re-evalu- 
ation of published research. Personnel Psychology, 
1964, 17, 239-260. i 


448 


Mayrietp, E. C., & Cartson, R. Selection —_ 
decisions: First results from a long-term researc 
project. Personnel Psychology, 1966, 19, T 

McMurry, R. N. Validating the patterned interview. 
Personnel, 1947, 23, 263-272. ee ee 

e Ni x . T. A universal - 

Eeoa Ae o ae Journal of Person- 
ality and Social Psychology, 1966, 4, HD: 

Sxıts, P. C., & KENDALL, L. M. Retranslation of 
expectations: An approach to the construction of 
unambiguous anchors for rating scales. Journal oj 
Applied Psychology, 1963, 47, 149-155. 


MITON D. HAKEL 


UrrIcH, L., & Trumso, D. The selection interview 
5 Pai a x 
since 1949. Psychological Bulletin, 1965, 63, 100- 
116. 7 E 

Wacyer, R. The employment interview: A qnel 
summary. Personnel Psychology, 1949, 2, rS a 

Wesster, E. C. Decision making in the ge ia r 
interview. Montreal: McGill University Industria 
Relations Centre, 1964. 

Wricut, O. R., Jr. Summary of research on -ng 
selection interview since 1964. Personnel Psyc: 
ogy, 1969, 22, 391-413. 


(Received July 1, 1970) 


Manuscripts Accepted for Publication in the 


Journal of Applied Pyschology 


Do Cultural Differences Affect Job Satisfaction? : John W. 
Pennsylvania State University, 223 Boucke Building 


Employment Interviewers’ Errors in Processing Pı 
Department of Psychology, Wayne State Univ 


Relationship between Intrinsic Needs an 


Psychiatry, Tel-Aviv University, Tel-Aviv, Israel. 


Effects of Instructions on Measures of State and Trait 
Spielberger, and Ronald M. Bale: Head, Clinical 


Aerospace Medical Center, Pensacola, Florida 3251 


Value of Peer Nominations in Predicting Life Insurance 


Manpower Research, Life Insurance 
Connecticut 06105. 
Hierarchies of Scientist Goal Objects: Indivi 


McCarrey,* and Shirley A. Edwards: 
Commission of Canada, Tower 


Use of Subjective Predictors in 
Keeley: Associate Professo 
Ohio 43403. 


A Longitudinal Study of the Effec 
H. Hand, and John W. Slocum, Jr.* 
223 Boucke Building, University Park, 
Children’s Reactions to Second-H. 
Louisville, Louisville, Kentucky 40208. 
Effects of Perceptions of E 


r, Department of Psyt 


quity and Inequity 


1 on Worker Performan 
Marvin D. Dunnette, and Dale O. Jorgenson: Department of 


Indiana 47907. 


Aeromedical Institute, Federal 


* Asterisk indicates author for whom address is supplied. 


Agency Man: 


idual Characteristics and Pi 


sychology Service 
Ottawa, Ontario, C 


Personnel P 


chology, Bowling 


t of a Human Relations 


acking Performance during 
T, and Frederick E. Guedry 
l Aviation Administration, P 


p E inistration, 
Slocum, Jr.*: College of Business Administra 


, University Park, Pennsylvania 16802. 


* 
F p i H ann, 
ositive and Negative Information: Thomas D. Hollm 
ersity, Detroit, Michigan 48202. 


d Women’s Persistence at Work: 


ont of 
Elchanan I. Meir*: Department ‘ 


“harles D: 
Anxiety in Flight Students: Steven F. Bucky,* 4 
Psychology, Naval Aerospace Medical Institute, 
2: 


Nava 


f 
5 - qe. Director 0 
Sales Performance: Eugene C. Mayfield* : Di 
agement Association, 170 Sigourney Street, 


Hart ford, 


Michael W. 


erformance Correlates: lic Service 


and Research Centre, Pub 
anada, 


ont 
ne 3 artmen 
ealities: Benjamin Schneider*: Depa 


ark, Maryland 20742. 


* 
pon Psychomotor Performance and Mood: Jorma Kuusinen, 
ology, University of 


Regression Analysis for Polic; 


and 
Jyvaskyla, 40100 Jyvaskyla 10, Finland. 


+ i s M. 
y Capturing: Michael E. Doherty,* asd Sn, 
Green State University, Bowling 


ert 
Training Program Managerial Effectiveness: Herber 
: College of Business A n oo 
, Pennsylvania 16802. 


and Tobacco Smoke: Paul Q 


gram or i aerei 
dministration, Pennsylvania State Unive 


sarat OF 
‘ameron*: Department of Psychology, University 


* 
, ichard. 
ce and Satisfaction: Robert D. prita e, 
Psychology, Purdue University, Lafay 


Angular Acceleration : Wi 


. “i 
7, Jr.: Chief, Psychology Laboratory, AC-11, Cl 


- O. Box 25082, Oklahoma City, Oklahoma 73125- 


i 


> 


‘5 
H 


Journal oj Applied Psychology 
1971, e 55, No. 3° G40 A37 


TAYLOR-RUSSELL TABLES FOR DICHOTOMOUS 
CRITERION VARIABLES 


NORMAN M. ABRAHAMS, EDWARD F. ALF; anv JOHN H. WOLFE 


Naval Personnel and Training Research Laboratory, San Diego 


The Taylor-Russell tables have been widely used to evaluate the usefulness of se- 
lection tests. They are inappropriate, however, for evaluating predictors used to 
discriminate discrete normally distributed criterion groups where a point-biserial 
correlation coeficient is the proper index of validity. This article provides a set of 
tables, similar to the Taylor-Russell tables, for evaluating the usefulness of selec- 
tion tests where the assumptions require point-biserial validity coefficients. A 


graphic illustration o! 


Taylor and Russell (1939) denounced the 
practice of evaluating the effectiveness of pre- 
dictor variables solely on the basis of their 
validity coefficients. In this significant publi- 
cation they stressed the critical role of base 
rates of satisfactory criterion performance and 
selection ratios in conjunction with validities 
in assessing the practical significance of pre- 
dictors. To facilitate this type of evaluation, 
they prepared tables for determining the ef- 
fectiveness of a selection test by estimating the 
proportion of selectees that are considered 
satisfactory when the test validity, base rate, 
and selection ratio are known or assumed. 

The Taylor-Russell tables were developed 
from normal bivariate correlation surfaces as- 
suming that the “satisfactory-unsatisfactory” 
categorization represents a dichotomization of 
an underlying normally distributed criterion 
variable. Thus, the proper index of the tests’ 
validity would be either a product-moment 
correlation computed on a continuous criterion 
distribution or a biserial correlation computed 
on a dichotomized underlying normally dis- 
tributed criterion. 

There are times, however, when the criterion 
variable represents either a true dichotomy or 
two distinct continuous normal distributions. 
Under these conditions, of course, the assump- 
tion of a single underlying normal distribution 
required for biserial 7 is not met, and point- 
piserial 7 is a more appropriate statistic. For 
example, in evaluating the effectiveness of an 


——_ 
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Abrahams, Personnel Measurement Research Depart- 
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2 Also at San Diego State College. 


f the rationale for developing the tables is also presented. 


occupational interest measure in discriminating 
between two distinct occupational groups, such 
as mechanics and librarians, it would be haz- 
ardous to assume the two groups represent an 
underlying normally distributed continuum. 
Similarly, in evaluating masculinity-femininity 
scales, or scales designed to discriminate clini- 
cal groups, the assumptions of point-biserial 
r may be more appropriate than those of bi- 
serial 7. 

In such cases, where a point-biserial r is re- 
quired, the Taylor-Russell tables are not ap- 
propriate for assessing the effectiveness of a 
predictor. If point-biserial correlations are used 
to enter the Taylor-Russell tables, the esti- 
mated percentage of satisfactory selectees will 
be in error. In general, the further the selection 
ratio and base rate are from .50, the less accu- 
rate the estimate of percentage satisfactory 
becomes. For example, if one entered the 
Taylor-Russell tables assuming a base rate and 
selection ratio of .50 and a point-biserial r of 
.60, the estimated percentage of satisfactory 
selectees is .70. In reality, however, the correct 
estimate is .77 when the validity is a point-bi- 
serial. Assuming a base rate and selection ratio 
of 10 and a validity of .60, the percentage 
satisfactory from Taylor-Russell is 39, in con- 
trast to the actual value of .73. The extent to 
which the Taylor-Russell tables have been used 
inappropriately with point-biserials is, of 
course, unknown. 

The present article provides a set of tables 
parallel to the Taylor-Russell tables for use 
with point-biserial correlations and the ration- 
ale for their development. 

A graphic illustration of the method of de- 
termining entries in the original Taylor-Russell 
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Fic. 1. Theoretical normal bivari 
by dichotomizing the te 


tables is given in Figure 1. This figure depicts 
a bivariate plot for a selection test with 
ity of .50. The horizontal axis, SS’, represents 
a particular base rate of satisfactory criterion 
performance, in this case, 50%. The vertical 
axis, TT’, denotes the cutting score for a 50% 
selection ratio. Thus, Quadrant A contains 
satisfactory employees above the cutting score 
(true positives), Quadrant B contains unsatis- 
factory employees above the cutting score 
(false positives), Quadrant C contains unsatis- 
factory employees below the cutting score (true 
negatives), and Quadrant D contains satis- 
factory employees below the cutting score 
(false negatives). The entry in the Taylor- 
Russell tables for this particular combination 
of validity, base rate, and selection ratio is the 
proportion 4/(4 + B), or the “true positive” 
hit rate. 
An analogous illustration for point-biserial is 
depicted in Figure 2. In this case the validity, 
represented by a mean difference of 1,34 
standard deviation units, is a point-biserial y 


a valid- 


HIGH 


— 1 secte — 
T 


ate distribution divided into four categories 
st and criterion score distributions. 


as ‘ «vated by the 
of .55; the base rate is 50%, as indicated bY 


anes and 
equal-sized upper and lower distributions; m 
the selection ratio, represented by the Ve t-bi- 
axis TT’, is 60%. For this particular point wt 
serial r, base rate, and selection ratio w 
pected proportion of satisfactory selectees, the 
is, A/(A + B), is determined and becomes 
entry in the new tables. 00 
For each point-biserial v, ranging mom 9 
to 1.00, and base rate, ranging from .05 to - ae 
the separation of group means in terms R 
standard deviation units was determined. 
effect, a combined distribution was develope 
and each selection ratio, varying from 05 6 
95, was imposed. For each combination , n 
point-biserial validity, base rate, and select 
ratio, the Proportion of successful selectee* 


A/(A + B), was computed. These values ave 
Provided in Table 1, 


; 1 
There are 11 Segments, corresponding tO 1 


values (.05, .10, .20, .30, .40, .50, .60, -70, A 
-90, and .95) of Proportion of employees sie 
sidered satisfactory in Table 1, The columns 


TAYLOR-RUSSELL TABLES FOR DICHOTOMOUS CRITERION VARIABLES 


SATISFACTORY 


UNSATIS FACTORY. 


LOW: 


z 
l 
| 
| 


1 


TEST SCORE 


451 


HIGH 


SELECTED 


þa— REJECTED 


Fic. 2. Theoretical normal univariate 
two categories by 


each segment represent 11 values of the selec- 
tion ratio: .05, .10, .20, .30, .40, .50, .60, .70, 
80, .90, and .95. The rows of each segment cor- 
respond to values of point-biserial z from .00 to 
1.00 by intervals of .05. The entries in the table 
indicate the proportion satisfactory among 
those selected. 

To illustrate the use of the table, recall the 
previously cited example with a point-biserial 
+ of .55, a Selection ratio of .60, and a base rate 
of .50. To obtain the expected proportion of 
satisfactory selectees, first locate the sixth seg- 
ment in which the proportion of employees con- 
sidered satisfactory is 50. Next, locate the row 
for a point-biserial r of .55 by looking down the 
first column. Finally, locate the column for a 
selection ratio of .60, and read the entry at the 
intersection of this row and column. For this 
example, the expected proportion of satisfac- 
tory selectees is .696. 

In some instances it may be valuable to 
determine the proportion of satisfactory em- 
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>| 


distributions of two discrete criterion groups each divided into 
dichotomizing the test score distribution. 


ployees below the cutting score of a particular 
selection ratio, that is, D/(D + C), or various 
other proportions of interest such as (A +0 
(4+B+C+ D). 

Once the proportion of satisfactory selectees 
is determined, the proportion of the total sam- 
ple falling into each of the four quadrants 
can be found by the following equations : 


= (PS)(SR), 
(SR) — A, 
1 — (BR + B), and 
(BR) — A 


i 


ll 


A 
B 
Cc 
D 


i 


where: 


A-D represent the proportion of cases in 
each of the four areas shown in Figure 2, 


PS = proportion satisfactory among those 
selected as indicated by the table, 

BR = base rate of satisfactory performance, 
and 

SR = selection ratio. 
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TABLE 1 
PROPORTION WHO WILL BE SATISFACTORY AMONG THOSE SELECTED, FOR GIVEN VALUES OF 
THE PROPORTION OF PRESENT EMPLOYEES CONSIDERED SATISFACTORY, THE 
SELECTION RATIO, AND POINT-BISERIAL 7 
Selection ratio 
Tpb 
-050 100 -200 -300 400 500 600 .700 800 900 -950 
Proportion of employees considered satisfactory = .05 
00 | 050 | .050 | .050 | .050 | .050 | .050 | oso | oso | oso | oso 050 
05 | .076 072 066 063 061 .059 057 .055 | 054 .052 2051 
10) 112 | 099 | .086 | .078 | .072 | 1067 | ‘063 | coso | ‘oso | 053 | -052 
15.) AS y ist | 206 | 092 | ioaz: | 075 | loos | ‘oes | “oss | cosa. | s052 
20) 23 | 10 | 128 | o7 | 952 | 2 | o3 | ‘Gee | ‘ogo | ‘oss | 082 
25 | S j Pt) ASE ai cot | soar | “ove | oes | oer | cose || 052 
| a | 25 | £78] 4153 | aoo | 1002 | cor | ‘oo | ‘oe | oss | 098 
“| ‘ew | oa) Bej ae) ts | cos! oe | om | cee | mee | oe 
40} 519 | 348 | 212 | 1153 | io | ‘007 | ‘ogo ‘o71 | 062 | .056 | -053 
| on l oe | Sl aw j at | ooo) es] on | oe | oe | 0 
| oe) ot] Se) ta) gee | S| ee | Ge | | ae) ee 
ae | G o 460 | 285 | aes: | tba | op | “Ges ‘ont | ‘062 | 056 | -053 
| | A | 2) ie | ces | io | pas 071 | 1062 | 056 | -05 
S ae | S| 250 | ter | Sas | ol oe ‘om | ‘o¢2 | 1056 | -05 
EF a 20 as 125 | 100 | .083 | ‘071 | ‘o62 | (056 me 
0] 996 | 300 | 250 | ‘167 | “igs | 109 | 083 | om | ‘062 | 056 | -053 
85) 996 | 500 | 250 | “167 | ‘123 | ‘tgp | 083 | 071 | 062 | 056 | 955 
90] 996 | iso | 250 | ‘gr | 225 | 100 | 083 | ‘ori | ‘062 | ‘056 | 983 
95 | 1.000 | ‘so | 230) ‘ter | 128 | -100 | ‘oss | ‘on | w2 | 056 | %5 
E a ; -125 -100 7 56 ; 
100 | 1.000 | .soo | (250 | “167 | ‘iz | om | o2 j| 2 053 
F 100 | 083 | ozi | ‘062 | .056 | ° 
Proportion of employees considered satisfactory = .10 
oo | 100 | 100 ra 
i : f 100 
05} 134 | ‘120 | “i909 ie Re 100 | 100 | 100 | 400 | .100 10 
iel HEJ 162 | a | uaz | ia) 32| 40| “107 | “tos | “103 mn 
AS | 226 200 | 173 | Tl “19s or 118 | i4 | ¿109 | .105 “104 
35 { ae | 23 | 202 | azz | ago | 13S | 127 | l cis | to? | o 
3 3| 20] 232) ‘98 | ‘ins | “MS | 3s] Ss | ius | 108 mn 
30) .422 343 .264 -219 “180 -157 142 .130 119 .109 he 
a | ae | S m | ao | | E lias | ctor | “ito | 108 
Aj 888 | 400 | 330 | zo) ig | 1S | 153 | cas | ig | ig | 108 
H 676 | .525 | 364 | iag | “gpg | “182 | -158 | “130 ‘3 | tt | 1% 
50 | .765 | .592 | 397 | ‘995 Sg ae we 140 | (424 ‘ii | -403 
ol oy | | ær | 309 | Zo] 3] Jj ial ns] ai -105 
a) A gat | as | ay | oe | Dej tos | ial l i ADS 
Oe | 966 | 800] 476 | [307 | “agg | 198 a) 143 125 | tt | -102 
70] .991 | 864 | “409 i 2 109) 166 | Ta | ‘193 | lita | 105 
#5 | $99 | 920 | aor | Say | 249 | 200 | “ter | ah | 128 | tt Sio 
$0: 1.000 |, 564 | 500 | “ase | Sa | 200 | toy | tay | Oe | ay | E 
85 | 1.000 | .990 | “00 | “333 | “250 | -200 | 167 | ‘ig 125 u1 | -105 
09 | 1.000 | .999 | soo | ‘333 | 20 | 20] a67 | cia | Ga | ii | 10 
95 | 1000 | 999 | So | 33 | “22 | -200 | i6 | ia | ‘igs | a | d0 
1.00 | 1000 1.000 3 ; 20; | 200 | ae a il 3 : -105 
500 333 250) z -143 .125 Alt z 
pee "i sch, Baas : 200 | 167 | i43 | ‘195 | ii | 405 
— E ' | 
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Selection ratio 


Tpb 7 
-050 100 -200 -300 400 -500 | -600 -700 -800 -900 -950 
| 1 
Proportion of employees considered satisfactory = -20 
.00 -200 .200 -200 .200 -200 -200 -200 .200 .200 -200 .200 
.05 .244 231 .229 .224 -220 -216 -213 210 | .207 .204 .202 
-10 -293 277 .260 -248 -239 -232 .225 .219 213 -207 -204 
AS 347 320 .292 274 -260 -248 -238 .228 .219 210 .205 
.20 .405 .368 .326 .300 -280 -264 .249 .236 224 .213 -207 
.25 .468 ALS 363 328 301 -280 -261 244 .229 -215 .208 
.30 535 .472 401 -356 322 -295 :272 .252 .234 -217 -208 
.35 .605 .529 441 -385 .344 „311 .283 .259 .238 .218 -209 
40 676 589 ASS 416 -366 .326 -293 .265 .241 -220 -210 
AS 747 652 529 448 | -387 340 302 -270 244 .220 -210 
50 815 717 577 80 409 354 310 275 246 .221 -210 
205) 876 782 627 514 430 367 318 -279 248 .222 .210 
-60 .927 .846 -680 547 450 378 324 -282 | .249 222 210 
65 964 904 -136 .581 468 387 -328 .284 .249 222 .210 
-70 .986 .952 794 612 482 394 331 -285 -250 .222 .210 
15 996 983 852 | .639 492 -398 333 -286 -250 .222 .210 
.80 | 1.000 .997 .909 .657 498 400 333 -286 .250 .222 210 
.85 | 1.000 1.000 .958 -665 .500 -400 .333 -286 -250 222 210 
‘90 | 1.000 | 1.000 .990 667 500 400 333 | .286 250 322 | 210 
.95 | 1.000 1.000 990 -667 -500 -400 -333 -286 -250 +222 -210 
1.00 | 1.000 1.000 1.000 667 -500 400 333 .286 -250 .222 211 
| 
Proportion of employees considered satisfactory = .30 
.00 -300 -300 300 300 -300 -300 300 | 300 300 -300 300 
05 .349 341 333 327 322 318 315 | .31l | 308 304 302 
‘to | aor | 385 | .367 | .354 | 345 | 337 | 329 | 322 316 | 308 | .305 
ws 456 430 402 383 368 55 344 | 333 .323 312 307 
20 .513 478 438 412 .391 374 358 | 344 -330 316 308 
v25 .572 .528 .476 .442 .415 .393 .372 354 336 319 310 
30 032 579 -516 AT3 440 12 387 364 343 322 | 311 
35 692 632 557 -506 465 A431 401 374 349 324 312 
40 751 -686 -600 540 492 451 415 383 354 327 | 314 
45 .808 -740 -646 576 519 471 -429 392 -359 329 314 
l gwo | aoe | .693 | ael S47 | 492 | 483 | -40L | 363 | 2380) S18 
"35 | 90s | g | .742 | 1653 | .577 | .512 | 456 | 408 | 36r 331 315 
60 | 942 .804 794 696 608 532 468 -415 370 332 | 316 
(65 | -970 936 846 74 “639 1552 479 420 372 333 | .316 
-10 .987 -967 .897 .789 671 -570 .488 .424 S74 333 316 
45 996 988 944 839 -102 .584 .495 427 315 333 | 316 
-80 .999 -997 .979 .891 -128 .594 498 428 375 333 | 316 
(85 | 1.000 | 1.000 997 .941 745 2599: -500 A28 315 333 | 316 
.90 | 1.000 1.000 | 1.000 .982 «750 -600 -500 429 | .375 333 316 
‘95 | 1.000 | 1.000 | 1.000 .982 .750 -600 -500 429 | 375 333 316 
100 | 1.000 | 1.000 | 1.000 | 1.000 | -750 600 | .500 | .429 '375 | 333 | 316 
Proportion of employees considered satisfactory = .40 
00 400 400 -400 .400 -400 -400 .400 -400 i 
‘og | ast | 444 | 435 | 420 | 424 | 420 | 416 | 412 | ae mn ae 
fo | Sos | 488 | 470 | 458 | 448 | 439 | 432 | 424 | a7 “409 | 1405 
ig | 357 | issa | 306 | 487 | 472 | 459 | 447 | 436 | 425 s4 | 1408 
20 | oll -580 542 S17 -497 | 479 463 448 | 433 | 418 “A10 
| F 
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TABLE 1—(Continued) 


Selection ratio 


-950 
g 050 100 200 300 400 -500 600 .700 800 -900 
051 R : ‘ 
Proportion of employees considered satisfactory = .40 E 
.412 
25 | .665 626 580 | .548 | .522 | 500 | 479 460 441 422 
.25 i -62 


S 5 5 496 
30 17 -673 618 -580 -548 -521 “4 
5 7 658 -613 .575 542 512 417 
al “ne | yee | e 647 | 603 | s64 | (529 | “496 | 464 aa | 
ve 860 | sit | 740 | ‘692 | ‘632 | ‘se3 | “sag 508 | A| 436 | A8 
Sal ‘900 | e | 7a2 720 | 663 | 612 | (564 | ‘519 | 477 =| 
> 933 | 894 | 1824 | ‘758 | “606 | ‘637 582 | 530 | 483 | . 
‘60 | .959 | .929 | 1866 | 799 730 


: 414 
472 49 426 : 
84 456 429 ALS 


0 
| eoo j | ae | ae) a20 
65 | .978 | 958 | 906 | s | (768 | ‘on | ‘617 | ‘ss1 | “403 | ` 
‘70 | 990 | .979 | (933 | ‘ggs | ‘go7 


F 719 | 1634 | “360 | (496 me 
75 | 997 | .992 | .972 | .928 | ‘g50 | “74g | “uo 506 | 498 ‘tu | aa 
‘80 | .999 | .998 | ‘ont | ‘966 894 | 774 | 660 570 | .500 | a 
‘85 | 1.000 | 1.000 | “99g 992 | 939 792 665 S71 500 Sa || Me 
90 | 1.000 | 1.000 | 1000 | 1.000 | ‘979 $00 | 667 | 571 | 500 a 
95 | 1000 | 1.000 | 1.000 | 1.000 | (979 | 800 | ‘667 | ‘sri 500 ee 
1.00 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | ‘soo | ‘ger Sm | soo |. 


Proportion of employees considered satisfactory = .50 


.500 
i ; ; 500 | 300 | 500 | 500 -500 500 500 | “03 
.05 552 544 +535 -529 524 520 516 512 509 505 "505 
oo) Oe | S| om | ee | Ge) | ee 525 | 518 | -510 | “fog 
P3 | E33 | SBi | 605 | 587 | “573 | “Sen S49 | 538 j 526 | 515 | “ayy 
aC 02 | -67k | 60 | ote | son | 33 365 | 550 | 1535 | 519 
Zij 148 | .716 | 676 | 647 | “603 | “hs 582 | 563 
| 2. | | a) el eo | Se 599 | 1576 
| ee | ee | ae | | ae | ae 617 
40) 872 | 835 | 783 | “744 


P .520 
í 103 | 669 | 636 | “Gog 571 537 | S01 
45) 905 | 871 | sig | (73 | “39 693 | 655 | 617 | ‘sap | ‘5a 523 
50] 933 | 903 | 1853 | ‘s07 | “765 “18 | 675 | (631 | “sag | ‘545 ‘524 
55 | .956 .932 886 .840 .793 .745 696 646 507 548 325 
00 | 974 | 956 | (918 | "875 826 | 773 F 
.65 -986 975 946 


9 
18 | ia 004 | 551 | > 
F ‘908 | 861 | ‘goa |” e “ a : ‘526 
70) 994 j .988 | (969 | o0 | aos 836 | fel | s | ‘giz | “asa | = 
968 | 1932 | | j -700 | 622 | -555 
996 | 988 | (966 | o09 810 : 

85 | 1.000 | 1.000 | “g99 


7 555 526 
i 8 | 990 | 947 | ‘gay | 70 | od | ; 
‘oy | 1000 | 1.000 | 1.000 | tooo | ‘oop 


% 26 
. 1.000 | ‘999 | g |: ao) 326 
1.00 | 1.000 | 1.000 | 1 000 | 1.000 | 1.000 1.000 '833 T re "556 526 


sis me 
Proportion of employees considered satisfactory = 60 


Ee ea 
-600 -600 


628 an -600 -600 f 603 
i i sw] ` i s| 3 
oa SH) oe | ee | SA 639 | ‘eo | 912 | 609 | “605 

4151 742 | "723 | 700 f 625 


.606 

-684 | 671 59 ; 5 618 610 | “608 
| ae) | es | oe | on | | 2 | Se] oe 

4a) ‘824 | 798 | “toa | “rag | 995 0) sat | a | oe | Se) oe 

O| «859 | 835 | 795 | 768 743 | 721 | “609 ay | “ase “630 | -611 

35 | 891 | 86t | 826 | 706 768 | a2 | min | a | oe | ee 

| tS | 803 | ass || “aos 793 | 764 | 735 706 a | ca | oo 
45 | 942 | 920 | (393 | ‘ast al a a i oss | s46 | -624 
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TABLE 1—(Continued) 
Selection ratio 
Tpb 
.050 .100 .200 300 400 -500 -600 -700 800 -900 -950 
Proportion of employees considered satisfactory = .60 
.50 .961 .942 .910 878 846 -812 -776 737 695 650 626 
55 .976 961 934 -905 872 837 -197 .754 -706 655 -628 
.60 986 976 955 -930 -900 863 .820 771 .716 -659 -629 
.65 .993 -987 .972 .953 .926 .891 845 .789 727 -662 .630 
-70 .997 .994 .986 .972 .951 .919 .872 .808 .736 664 631 
15 999 -998 994 987 973 948 -900 826 743 .666 631 
.80 | 1.000 1.000 998 996 -990 974 930 843 748 -666 -632 
85 1.000 1.000 1.000 .999 998 992 -960 854 750 667 632 
.90 | 1,000 1.000 1.000 1,000 1.000 1.000 986 857 -750 -667 -632 
.95 | 1.000 1.000 1.000 1.000 1.000 1.000 -999 .857 .750 .667 .632 
100 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 1.000 | 1.000 .857 -750 667 -632 
Proportion of employees considered satisfactory = .70 
.00 -700 -700 -700 -700 .700 -700 -700 -700 -700 -700 -100 
.05 -746 .739 .732 726 .722 718 715 712 -708 705 .703 
10 787 .776 .762 752 744 737 -730 723 717 -109 .705 
AS -826 -810 791 TT -766 -155 TAS -736 725 714 -708 
.20 .860 842 819 802 787 774 -761 748 735 720 AiL 
125 890 871 -846 .826 809 -793 -177 -761 -144 -725 714 
30 916 897 871 849 -830 .812 .793 774 154 731 718 
oo 938 -920 894 872 852 831 810 788 -764 737 721 
AO .956 .941 .916 894 873 851 828 803 775 743 724 
AS 970 958 .936 .915 894 871 846 818 -786 749 at 
50 981 972 953 935 O14 891 865 834 798 755 -730 
55 .989 .982 .968 .952 934 912 885 851 811 761 732 
60 994 -990 .980 -968 .952 .932 -905 .870 .823 -166 -7134 
.65 .997 .995 .989 981 .969 952 .926 889 836 771 735 
-70 .999 .998 .995 .990 .982 .970 948 -909 849 774 .736 
.75 | 1.000 1.000 | 1.000 996 992 984 968 931 861 .176 737 
80 | 1.000 1.000 1.000 .999 998 994 .986 953 -870 778 737 
85 | 1,000 1.000 1.000 1.000 1.000 999 997 975 874 778 AOT 
.90 | 1.000 1.000 1.000 1.000 1.000 1.000 1.000 -992 .875 778 s187 
.95 | 1.000 1,000 1.000 1.000 1.000 1.000 1.000 .992 .875 778 Py fi 
1,00 | 1.000 1.000 1.000 1.000 1.000 1.000 1.000 1.000 875 778 737 
Proportion of employees considered satisfactory = .80 
00 800 .800 -800 -800 .800 -800 -800 -800 -800 -800 .800 
05 839 833 827 823 819 816 813 810 807 804 -802 
A0 872 864 852 844 838 832 .826 821 815 809 805 
AS 901 -890 876 865 856 848 840 832 +823 813 -808 
20 .926 .914 .898 .885 .874 -864 853 843 832 819 S11 
25 .946 -934 .917 .904 .891 -880 -867 855 841 824 814 
30 962 951 .935 .921 .908 .895 -882 867 .850 .830 818 
35 O74 965 950 937 924 911 -896 .879 -860 .836 .821 
“to | 983 | .976 | .964 | .952 | .939 | .926 | 910 | .892 871 "s43 | 1925 
45 .990 984 .975 964 953 940 925 -906 882 850 829 
‘50 | .994 991 984 975 966 954 .939 ‘920 ‘894 ‘857 A 
‘55 | .997 ‘995 | .990 | .984 | .977 .967 .953 .934 | .907 865 | 836 
60 999 -998 -995 .991 -985 978 966 949 920 ‘872 "838 
65 | 1,000 | 1.000 | 1.000 | 1.000 | .992 | .987 | .978 | 963 | 934 878 | .840 
70 | 1.000 1.000 1.000 1.000 1.000 994 -988 977 948 ‘884 ‘S41 


456 N. M. Asranams, E. F. ALF, anp J. H. WoLFE 
TABLE 1— (Continued) 
| Selection ratio 
Tp | = - — = —— 
| oso | .100 | .200 | 300 | 400 | .soo | .600 | .700 | soo | 900 | 950 
Proportion of employees considered satisfactory = .80 
i or eee Ls a. th eas i E mam or 
73 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | .995 | .oss | 63 | ss7 | 842 
.80 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1,000 999 | 996 977 „888 842 
85 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1000 | 999 | 990 | 889 | 842 
.90 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 .998 „889 | 842 
95 | 1.000 | 1.000 j 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 998 | `ss9 | 842 
1.00 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1000 | 1o00 | 1.000 | 889 | 842 
Proportion a of f employ ees considered sieb = .90 
00 | 900 -900 -900 | 900 .900 900 | 900 900 | .900 .900 900 
05 | 928 | «924 | 920 | 917 | Sis | i912 | ‘ono 908 | 906 | .903 | .902 
JO j 949 | 944 | 937 | 932 | 1928 | ‘924 | 920 | ‘016 | 912 | ao | 904 
IS | 1966 | 960 | .952 | 916 | (920 | [935 | (930 | ‘924 | “ois | Di | 907 
| ae ue | 265 | 958 | 952 | 946 | w40 | 1933 | 925 | 916 | 910 
‘ol 992 | og | go | 269 | 96 | 957 | 950 | ‘o42 | 933 | i921 | 913 
35 | as | ‘903 | ‘ong | 28 | 972 | 966 | 950 | ost | igar | 927 ay 
“0 | (908 | ‘ope | 2 | 985| 980] 975 | ‘968 | ‘sso | `osg | i933 | 92 
oe ae be | -990 | .987 | 982 | 976 | 968 | (958 | .940 a 
50 | 1.000 | 1.000 1.000 ‘000 | ‘gos | 988 | 983 | 976 | 966 | 987 985 
S toe | tag | Se 1.000 | 995 | 993 | asg | og4 | 974 | 955 eae 
60 | 1.000 | 1.000 | 1000 | roop | 1002 | 1000 | isoa | woo | ‘os2 | 962 943 
60 | 1 ‘ i | 1.000 | 1.000 | 1.000 | 1000 004 .989 910 | one 
65 | 1.000 | 1.000 | 1.000 | 1000 946 
70 | 1.000 | 1.000 | 1.000 | 1000 ‘ee me | ARO | ae ee 947 
0j 1. : A : í .000 | 1.000 | 1.000 .998 .985 ‘947 
75 | 1.000 | 1.000 | 1.000 | 1.000 | 1o00 SAT 
} i. i 1.000 | 1.000 | 1.000 999 | 991 7 
80 | 1.000 | 1,000 i à S O47 
85} 1000 | 000 | ie | to 100) 1 000 1.000 | 1.000 | 1.000 po 947 
90 | 1.000 | 1.000 | 1000 | 1000 | ro f ton | 100 | 1000 947 
: ; : J 000 | 1.000 | 1000 1,000 | 7" 
95 | 1.000 | 1.000 | 1.000 | 1000 | 10 ; 1.000 | 1.000 947 
| L . -000 | 1.000 | 1.000 | ooo | ar 
1.00 | 1.000 | 1.000 -000 1.000 1.000 1. € 
p | 1.000 1000 | 1.000 | 1.000 | 1.000 | 1ooo | 1000 | 1.000 | 9 
= ee Proportion of employees considered satisfactory = .95 
00) 950 | 950 | os sal aaa oe = 
05 | 969 | (967 on 950 | 950 | 950 | 950 | oso | oso | 950 oa 
10) 982 | ‘979 | ‘ors | ‘gon | 209 | 959 | 957 | ‘956 | ‘osa | 952 piers 
15| 990 | 987 | “ogy | “og ‘970 | 967 | .964 | ‘962 | ‘gs9 | 955 056 
20) 995 | (993 | ‘900 | “oer | 278 | 975 | 972 | iosa | ‘os | 950 | 956 
25 | 998 | 996 | ‘oog | “opp | 284 | 982 | 978 | oza | ‘oro | (963 | 959 
30| 999 | oog | ‘907 | ags | 229 | 987 | oga | ‘ogo | ‘ors | 908 | -962 
35| 1000 | 1000 | soon | 23 S94 | 992 | 980 | oge | ‘ogi | 973 | 966 
40 | 1.000 | 1.000 | tooo | co | 22% | 995 | isos | ‘ooo | ‘oss | cove | -970 
45) 1.000 | 11000 | 11000 | 1200 | 1000 | 997 | ‘906 | ‘ong | ‘oe | cogs | 975 
‘50 | 1.000 | iomo | 12 1.000 | 1.000 | 1.000 | 1000 997 | ‘904 ogg | .979 
55 | 1.000 | 1000 Lo ee 1.000 | 1.000 | 1.000 | 1o00 | ‘o97 | 1992 | -984 
63 | £002 | 1.000 | doo | Loo | 1000 | 1.000 | tooo | 1000 | io | -s96 | -988 
Soj pee | $600 | zooo; | To 1000 | 1.000 | 1.000 | 1000 | rooo | loos | -992 
ri TA 1000 | 1.000 | i000 | 1 nO 1.000 | 1.000 | 1.000 | 1.000 | 099 | .995 
75 | 1.000 | 1.000 | tooo | ! wog | 100 | 1.000 | 1.600 | 1000 | 1000 | .998 
80 | 1.000 | 1000 | 4 000 | 1.000 | 1000 | 4 000 | 4 999 
‘85 | 1.000 | 1.000 RR ton 1.000 | 1.000 | 1.000 1.000 1000 1.000 1,000 
90 | 1.000 | 1000 | i 000 | 1.000 | 1000 | i : ; $ 
g d 1000 i 1.000 | 1.000 | 1.000 | 1000 | 1.000 
35 | 1.000 | 1.000 | roo | poo | 1o00 | tooo | 1o00 í : m 
` 1.000 p 1.000 | 1.000 | 1.000 | 1.01 
Eai 4000 | 1.000: | 1-000 1.000 re om 1.000 | 1.000 | 1.000 | 1000 | 1.000 
I aa ala |, 2000 1.000 1.000 1.000 1.000 | 1.000 


NN 


TAYLOR- 


For the previous example where : SR = 60, 
BR = .50, point-biserial 7 = .55, and propor- 
tion of satisfactory selectees equaled .696, the 
proportion in each quadrant is: 


A = (.696)(.60) = -42 
B = (.60) — (42) = -18 
C = (1) — (50 + -18) = 32 
D = (.50) — (.42) = -08 
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With these values, various “hit rates” and 
other comparisons of particular interest may 


be made. 
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JOB ANALYSIS METHOD FOR HEALTH-RELATED 
PROFESSIONS: 


A PILOT STUDY OF PHYSICAL THERAPISTS ! 


NEIL S. DUMAS? 


National Center for Health Services Research and 
Development, United States Public Health Service 


A method for analyzing work of health personnel was devised 


AND 


JOHN E. MUTHARD 


Regional Rehabilitation Research Institute, 
University of Florida 


and applied in 


a physical therapy service. Procedures for developing the special language for 
describing the tasks performed by physical therapists and methods for training 
observers to prepare sequential reports of the ongoing work of staff are pre- 


sented. Observers were able to reliabl 


the tasks in a physical therapy service 
cations of the method are discussed. 


Personnel utilization studies of health and 
rehabilitation services are urgently needed 
since the gap between the supply and demand 
of fully qualified persons has not been nar- 
rowing. The Health Manpower Report (Na- 
tional Center for Health Statistics, 1968) re- 
vealed, for example, that 39% more physical 
therapists and 23% more physical therapy 
assistants were needed to meet the demand 
for services in 1968. In this light, therefore, 
it appears necessary to plan for the redistri- 
bution of the functions and responsibilities of 
physical therapists and other health profes- 
sions to achieve maximum efficiency in train- 
ing and utilizing all levels of personnel in- 
cluding assistants and aides. While the meth- 
odology described in the following pages was 

1 The complete version of the results and instru- 
ments will soon be available in the final report of 
the Social and Rehabilitation Service Project RD- 
2870 from the National Technical Information Ser- 
vice, United States Department of Commerce, Spring- 
field, Va., at $3.00 for hardcopy and 65¢ for micro- 
fiche. Free copies are available from Regional Reha- 


bilitation Research Institute, 901 Lakeshore Towers, 
Gainesville, Florida 32601 as long as the supply 
lasts. 

This research was supported in part by a Research 
Grant No. 22-P-55144/4-02 from the Social and 
Rehabilitation Service, Department of Health, Educa- 
tion, and Welfare, Washington, D, C., 

2 Formerly with the Regional Rehabilitation Re- 
search Institute, University of Florida. Requests for 
reprints should be sent to Neil S. Dumas, Regional 
Rehabilitation Research Institute, University of 


Florida, 901 Lakeshore Towers, Gainesville, Florida 
32601. 
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ly report the detailed characteristics of 
over an extended period of time. Impli- 


applied to a single health profession, it 
should be equally useful in other areas. 
Recently established programs for physical 
therapy assistants and in-service training for 
aides have been based on professional judg- 
ments regarding the relevance of curriculum 
content to available job description data. An 
outstanding example of thorough program 
planning and implementation is described by 
Fenderson and Larson (1968), Nevertheless, 
while this Program development was carefully 
planned, the curriculum was based ae 
available job descriptions and professional 
judgments. Thus, it lacks the clear objective 
linkage to the work of the physical therapy 
Service that intensive job analysis can provide. 


Other studies of the roles and functions of 
physical thera) 


McDaniel (19 


rate description 
staff. 

To initiate a Program of research on the use 
of Support personnel in the health-related 
professions, a More sophisticated method for 
describing the 


5 le elements of jobs was needed— 
a job analysis p 


1 S Procedure with several particu- 
ar characteristics: (a) The language of the 


data and technique should be understandable 
to the professionals in the field being exam- 
ined. (b) The meas 


urements should be reli- 


able and in interval Scales insofar as possible: 


b 
d 
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(c) Classification of tasks should be completed 
on-site to minimize the introduction of errors. 
(d) The resultant data should be directly 
recorded on worksheets specifically designed 
for direct processing input. (¢) The results 
should include not only the proportion of time 
each task occurred but also a record of the 
sequence of events. (f) The methodology 
should be compatible with appropriate sys- 
tems analytic and operations research tech- 


niques so that critical decisions made about 
specific aspects of the job can be simultane- 
ously related to other important data ele- 
ments. 

Standard job analysis procedures such as 
those developed for personnel selection and 
evaluation were reviewed and not deemed 
suitable. The task inventory approach devel- 
oped by the Personnel Research Laboratory 
of the United States Air Force (Morsh, Mad- 
den, & Christal, 1961) was considered be- 
cause it has proven useful, reliable, and is 
designed to permit ready analysis by com- 
puters. Although it does provide for determin- 
ing the relative frequency with which the task 
is done and the proportion of time that work- 
ers contribute to each task as part of their 
job, it does not provide a description of tasks 
as they occur sequentially. Thus, it would not 
enable the data to be used for such purposes 
as determining the extent to which specific 
tasks can be separated from the other work 
of highly trained professionals and reassigned 
to technicians. Further, it would not enable 
simulation models to be prepared that would 
provide administrators with information re- 
garding the impact of various changes on 
other aspects of the total system. 

The series of studies conducted by Bobbitt 
and her associates (Bobbitt, Gourevitch, 
Miller, & Jensen, 1969; Bobbitt, Jensen, & 
Kuehn, 1964) suggested a model for develop- 
ing a computerized procedure for analyzing 
the patterns and sequences of tasks in a job. 
They developed an approach for describing 
the social interaction of mother-infant mon- 
keys that provided “a relatively exhaustive 
description of interactive behavior as it oc- 
curs, aimed at discovering those patterns and 
sequences of behavior which seemed to char- 
acterize the interaction [p. 112].” Such a 
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description would enable an investigator to 
develop new hypotheses that could be put to 
critical and circumscribed tests. 

In their study of patient behavior within a 
hospital environment, Willems and Vineberg 
(1969) describe an observational procedure 
that meets several of the criteria for a suitable 
technique mentioned above. They obtained 
continuous descriptive protocols for complete 
days by having observers tape record their 
observations in everyday language. Subse- 
quently, they were able to reliably classify 
the observations into six categories with a high 
(84%) degree of concurrence between two 
raters. This procedure suggests methods that 
can meet many of the criteria set but lacks 
the precision and computer capability required 
for large scale studies. The pilot study con- 
ducted in the University of Florida physical 
therapy service demonstrated the character 
of a new method and its efficacy in meeting 
the criteria set for a sophisticated task analy- 
sis system. 

METHOD 
Instrumentation 


To collect data about the work demands within a 
clinic, a job-analysis tool called the Physical Therapy 
Observation Schedule (PTOS) was developed. This 
technique is a highly structured language for the 
detailed recording of events and is based upon sig- 
nificant tasks and modalities that were identifed 
through a content analysis of selected physical ther- 
apy and medical texts. In using the PTOS, a trained 
observer constructs a series of “sentences” to de- 
scribe the tasks performed by a clinic staff member. 

Encoding a task. For each task (which begins and 
ends at the observer’s judgment), the following data 
are recorded in sentence-like form: 

1. Action—what is being done. Twenty-eight terms 
were grouped in three categories, that is, people, 
data, and things (Fine & Heinz, 1958). From the 
perspective of the observer, a task statement might 
be “the therapist (I’m observing) is action (doing 
something). . . .” One term is chosen. 

2. Object—type of action in progress. Forty-seven 
terms were grouped but not fixed in parallel, people, 
data, and thing categories. These terms are used to 
expand a description of the action, for example, “the 
therapist (I’m observing) is preparing (action) a 
bandage (object)... .”’ One term is chosen. 

3, Person—individuals involved other than staff 
member being observed. Sixteen categories and com- 
binations of individuals “to, for, or with whom” the 
action is carried out, for example, “the therapist (I’m 
observing) is preparing (action) a bandage (object) 
for a patient (person)... .” This category enumerates 
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OBJECT (2) 


PERSON (3 
ACTION (1) = 


100-ACTION/TECHNIQUE 100-PATIENT 


110-EXERCISES 010-PT-AIDE 
PEPPER 111-BREATHING O11-AIDE 
112-MUSCLE RE EDUC. | 012-PT- 
121 PREPARES P 113-RELAXATION 013-PT 


122-TRANSPORTS P 
-HANDLE 

141 COMMUNICATES 
142-PARTICIPATES 
143-OVERSEES 
150-EVALUATES 
161-SUPERVISES 
162-TEACHES 
163-CONSULTS 
170-DIRECTS 
199-OTHER* 


114-STRENGTH. GEN'L 
115-BALANCING 
116-AMBULATION 
117-DEVELOPMENT AL 
118-POSTURE 
119-SPEC, STRENGTH 
120-FIRST AID 
130-POSITIONING 
140-BANDAGING 
150-TRANSPORTATION 
160-SELF CARE 
170-PATIENT RESPONSE 
180-PROFESSIONAL* 032-SECRETARY 
199-OTHER* 040-RELATIVE 
SS SSS = — 060-ALONE 
200-PROGRESS NOTES 099-OTHER* 
201-INITIAL 
202-FINAL 
210-REPORT 
211-STATISTICAL 
212-NARRATIVE 
220-RECORDS 
221-ATTENDANCE 
222-APPOINTMENT 


014-STUDENT 
015-AIDE-AIDE 
016-ASSISTANT 
017-PT-ASST 
018-ASST-ASST 
019-ASST-AIDE 
020-PHYSICIAN 
021-NURSE 
022-0T 
023-OTHER HRP 
030-CLERICAL 
031-RECEPTIONIST 


999-TRAVELS 


210-COPIES 
220-RECORDS 
230-READS 
240-WRITES 
250-PLANS 


01-THERMOTHERAPY 
02-CRYOTHERAPY 
03-DIATHERMY MW 
04-DIATHERMY SW 


05-HOT PACK 
299-OTHER* 223-MEDICAL CHART 06- INFRARED 
230-MESSAGE 07-PARAFFIN 


240-TREATMENT PROGRAM 
250-INVENTORY 


260-EVALUATION FORM 10- 
270- INSTRUCTIONS ERA 


11-CONTRAST BATH 
299-OTHER* 12-HUBBARD TANK 
--— -WHIRLP 
310-EQUIPMENT 20-EVALUATIOne 
320-PROSTHESIS 21-ROM 
330-ADJUNCT 22-M.M.T 
340-BANDAGE 23-ELECTRODIAG 
350-ASSISTIVE DEVICE | 39 : 
ASSISTI O-INSTRUCTION 


08-ULTRASOUND 
09-ULTRAVIOLET 


310-PREPARES T 
311-TRANSPORTS T 
320-OPERATES 


31-GEN'L EXER 
330-CONSTRUCTS 352-CRUTCH z : 
340-ADJUSTS 353-WALKER ae er ERER, 
350-SETS-UP 354-CANE 34-A.D.L ` 
360-APPLIES 355-WHEELCHAIR 35-AMPUTEE 
399-OTHER* 360-PHYSICAL PLANT 


40-OTHER MEDICAL 
41-TRACTION 
42-ELEC. STIM. 
43-POST. DRAIN, 
44-TILT-TABLE 


399-OTHER* 


*EXPLANATORY 


1-RPT 


45-MASSAGE 
2-ASST+ 
NOTE REQUIRED 3-ASST so-cien su 
4-AIDE+ 60-P 
PADE ROFESSIONAL* 


99-OTHER* 


Fic. 1. Physical Therapy Observation Schedule coding table 


the individuals involved in the task being observed, 
This does not include the staff member under direct 
observation who, of course, must always be present. 
One or more terms are chosen, 


4, Mode—Thirty-three basic divisions of treatment 
or activity. One term is chosen, 

5. Level—the staff member's judgment of the 
Complexity of the task just completed. That is, at 


$ 


a 2 
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HOSP. 1-RPT 5-AIDE OBSV DATE START TIME STAFF NAME 
3-ASST 9-OTHER NUMB MO DAY YR HOUR MINUTE Soc SEC # 
ol 1 121 UN 2j4] 19) 1113) 1510 2]2/2]-]1]71-101010}0 
PATIENT NUMBER AGE VISIT 1-MALE 1-INPAT REFER 1-0B/GYN 5-NEURO 


2-FEM. 2-OUTPAT DEPT 2-ORTHO 6-PEDIA 


23 4|5)6 7\6 on l 1 3| 3-MEDIC 7-SURG 
LL a B 4-E/N/T 8-PSYCH 


PATIENT NAME A. R. Jones pye : 9-OTHER 
DIAGNOSIS/HISTORY/OBJECTIVES Arthritis of Nipa & Spine 
Eva uate and Mobilize 


ACTION OBJECT PERSON MODE LEVEL DURATION COMMENTS PAGE SEQ 
Vlani 1710 {ifojo} 16101 11) 10ļ0| 18l0|-Medical History Johi 
eni nnie 110101 33L Lb 10101 15101 Jol2t 
njen; ppls, moo BBL NL Jou) 17191 10131 
jaji] 1117101 Jojo] 16101 LE 1010} [7l3|-Medical History 40151 
Lysol 11010) jaojo; J210) UL {oli | 10|7]-Gen'1. R.0.M.-not form J0]J6| 
14413} 1119191 110101 16101 uL oli Hl4l-waiting Jowl 
Janjo; 131m0) Jolelol Jz J5L Jaji 1al goniometer, forms, Jols] 
yuso] 11010 jijojo) Jai DL 1019] |8|4|-Goniometey 10191 
yuni DeL 11010 13131 VL Jou) 1514) {1 ol. 
jyan} Hilol 1110101 161o NL 1013} 1115l-Medical History ppp 
mem [M6 Lolo) Lata) LL 10111618L piel 
Į3l11o} 131510) 111010) 13131 15L 1011 10l6l-Parallel Bars L113) 
ws) Lye) mool nL ola uel 114 
llais) dojo) lolol lejo) Lat 1013] 1313 -Resting Ls} 
Ht Et Lei 2 be Le, 
LEL II PE EEL BIL 
L LLLE LMA e ALLA hist 
LLL LEED en Lt Shit joy 
iyidi iiia Lt AA LL 12101 
Fic, 2. Physical Therapy Observation Schedule coding form. 


the completion of cach task (i.e, when it is being rienced assistant, (3) assi 5 y 
Fi * 5 assista 3) assistant, (4 xper aide 
coded) the observer asks “What is the lowest level and (5) aide. j (4) experienced aide, 


of staff member you would have permitted to per- In addition to the above, the duration of each task 
form the task you just completed with the patient is recorded in minutes and hundredths of àmi “ 
(if applicable) you just helped. . . ?” This judg- The recording procedure also indicates Pree = 
ment is on a S-point seale: (1) therapist, (2) expe- in which it occurred via nanie aa Figure 
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1 is a copy of the coding table used by the observers 
as part of their equipment, while Figure 2 displays a 
completed coding form. 


Observer Ti raining 


Flash cards and content test. To field test this 
encoding technique, four licensed physical “ace” 
were recruited and trained as observers. Each ob- 
server was initially furnished with a copy of the 
PTOS Dictionary and a set of flash cards. The Dic- 
tionary provides a definition of each of the 129 terms 
used in the PTOS via narrative or enumeration of 
as many unique mples of cach as possible. The 
flash cards were used a training device to aid the 
observers in memorizing the literal language of the 
definitions, This proved to be an e: since the 
language of the Dictionary origina ly came from 
physical therapy and medical texts and was familiar 
to the observers. A test comprising 81 © 
selected as a stratified sample from the 
was administered to all four observers, 
to select the proper codes for each item (ie. examples 
selected from definitions), three observers scored 
96% (three wrong) and one scored 100%, 

Videotape training. In order to more adequately 
simulate actual field conditions and improve the reli- 
ability of the PTOS in the observers? hands, 11 l-hr. 
videotapes of actual therapists, aides, and patients 
were used for training and testing as follo (a) 
Each observer viewed one tape individually and 
encoded the tasks scen, (b) All observers convened as 
a group to review the same tape and adjudicate the 
coding differences until a consensus was derived, (c) 
Steps a and b above were repeated for eight tapes, 
(d) Each observer individually viewed all eight tapes 
for a second time in three consecutive sessions and 
once again coded the events, (e) The observer group 
convened to discuss this latest coding and compared 
it to the previous group consensus of Steps b and ¢ 


above. (f) Reliability tests were conducted using the 
remaining tapes, 


amples, 
Dictionary, 
Given 30 min. 


Reliability Tests 


Reliability measure, With the | 
Possibilities (over 43,000) 
scheme, any identical judgments on the part of the 
observers during a reliability test Would be signifi- 
cantly different statistically from chance expectation, 
A more practical measure, and the one employed 
here, is to ask whether four trained individuals can 
watch a continuous stream of events and (a) decide 
that a task is beginning at some point in time, (b) 
decide that a task has ended at some point in time, 
and (c) label or encode the task with the same or 
similar three-part identifier, Essentially, what was 
being questioned was whether these four observers 


could, at any moment in time, identically encode the 
event in front of them, 


The reliability measurement unit co; 
in force during time cut into discrete Units of 10 min 
That is, the first minute of the test Comprised 10 
intervals during which some task or tasks occurred, 


arger number of 
inherent in the coding 


nsisted of code 


If one observer reported that a single task penmi 
during these 10 intervals while a second manan 
that another task began in the last inter al, there 
would be 9 intervals of perfect agreement and 1 
interval of disagreement l.e., two observers agreeing 
90% of the time). For cach interval, we were able 
to ‘count five possible occurrences: (a) four agreeing, 
(b) three agreeing, (c) two versus two, (d) two 
Versus one versus one, and (e) four different codes. 
The only modification of this procedure was with 
respect to transition points, that is, moments at 
which one task ended and another began, For tran- 
sition points the time interval was enlarged to a 
maximum of .30 min. (18 sec.). It was felt that if 
the observers could make the decision that a task 
had ended within 18 sec. of each other, then this 
should be considered close enough to be called simul- 
taneous. Since this methodology was designed to aid 
in practical decision making, there is little reason to 
be concerned with events taking less than the time 
necessary for a more extensively trained professional 
to call an assistant to do the job for her, With this 
rationale, accuracy within +30 min. (18 sec.) in 
measuring the duration of a task was judged suf- 
ficient. The judgments made by the observers during 
this larger time unit (maximum of three intervals) 
were, therefore, recorded as simultancous and identi- 
cal for all 18 see. (if the same code was selected). 


Field Test Procedures 


Sampling. Four hours of observations (two a 
in the morning and afternoon) were made for sie 
of 25 consecutive weekdays for a total of 100 mi d 
data. An additional 3 days’ data were also tolles i 
at the start of the study, Without the knowledge f 
the observers, it was determined prior to data ils 
lection that these 3 days were designated as a 
shakedown Period and, as such, considered to be 
potentially Contaminated, that i by observer ner- 


i 
vousness and abnormal staff reactions until the nov- 
celty of the Presence of 


a new person in the clinic 
wore off, The observers, who were licensed physical 
therapists themselves, wore their uniforms jn order 
to seem as much a 


ds S Possible a natural part of the 
clinic and minimize this contamination, 

Each observer received only her own portion of 
the sampling schedule and kept this information con- 
cealed from the clinic staff for obvious ascii, In 
order to provide ‘ness or other problems that 
might prevent from locating a particular 
staff member, ation table was provided 
for the observers! use. Using this guide an observer 
Was able to randomly select another staff member to 
follow if the signed Person was una vailable or 
could not be found. In Point of fact, this device was 
used for 15 out of 50 2-hr, observation periods with- 
out upsetting the balance of the sampling plan. 

Data collection. Just Prior to each starting time, 
cach observer checked her equipment (clip board, stop 
Watches coding table, and forms, ete.) and located 
assigned staff person °F used the randomization 
table to select another $, At the scheduled time, the 


a randomiz 


Jos ANALYSIS FOR PHYSICAL THERAPISTS 463 


observer began following her assigned S, encoding 
the nature-duration-complesity and sequence of cach 
different task performed by that person during the 
next 2-hr. period. In addition, the following data 
were collected once (on the first coding form) for 
each 2-hr. observation period as a whole: (a) hos- 
pital identifier, (b) classification of person being ob- 
served (therapist or aide), (c) observer identifier, 
(d) date, (e) starting hour and minute, and (f) staff 
name and identifier, A se ate coding form was 
completed for each patient treated during the obser- 
vation period. In addition to the other identifying 
information, it included: (a) patient identifier, (b) 
age, (c) number of previous visits, (d) sex, (e) in- 
patient or outpatient status, (f) referring depart- 
ment, and (g) admission cause-history. 

Oversee. Because of its unusual meaning and effect 
on the data collection, the action oversee deserve 
few special comments. Oversee refers to those busy 
times when a therapist is working with one patient 
and is simultancously watching or super sing an- 
other, In this case, it is equally unreasonable to claim 
that the therapist: (a) spent 1 hr. with the first 
patient and no time with the second, (b) spent 4 hr. 
with each, or (c) spent 1 hr. with each, To describe 
this special phenomenon, therefore, the term oversee 
was devised, since in the above illustration a second 
staff member would have been required if the thera- 
pist had not been able to supervise the second pa- 
tient. 

A single example should sufficiently illustrate the 
way in which the term oversee was employed. Con- 
sider a therapist who has just completed all the steps 
leading to a whirlpool bath, including positioning of 
the patient, ete. The therapist now moves on to 
Patient B to provide some appropriate treatment but 
keeps an intermittent visual check on Patient A's 
status. The observer following this therapist would 
have first entered the oversee code as the last entry 
on the coding form that has Patient A’s identifying 
information on top. Next, the observer would have 
started a new coding form for Patient B by filling in 
his name and hospital identifier (to obtain other 
required information after the 2-hr. period) and 
encoding the first task. If, for example, the therapist 
completes Patient B’s treatment and returns to Pa- 
tient A, then the observer follows the procedure just 
described as if the latter had not been seen before. 
In this case, it is not uncommon to find two or 
three different coding forms completed for the same 
patient during a single 2-hr. observation period. By 
comparing the real time difference between the in- 
sertion of the oversee code and the therapist's return 
to that same patient, the duration of this subtle 
manpower phenomenon can be computed. 


RESULTS AND DISCUSSION 
Reliability Results 


A test of the four observers’ ability to reli- 
ably encode tasks were performed using previ- 
ously unseen videotapes of actual and hypo- 


thetical patients to simulate a sample of the 
same audiovisual stimuli that were likely to 
be encountered in clinics. The initial video- 
tape test comprised eight sequences (47.2 
min.) and included 35 tasks. During the test, 
each observer made four brief judgments and 
transferred three additional data to encode a 
single task. These seven items are: 


I. Judgments 


1. Choose a single code describing the 
action observed (V = 28). 

2. Choose a single code describing the 
object (i.e., type) of action observed (N 
= 47). 

3. Choose a single code describing the 
mode (sic modality) in which the action- 
object should be classified (N = 33). 
This three part judgment-encoding pro- 
cedure has V = 43,428 possibilities. 

4. Decide that the task has ended and 
the next event begun. 


II. Data transfer 


1. Having decided that a task has ended 
(1.4 above), record the actual duration 
in minutes and hundredths of a minute. 
2. Encode the people (staff and others) 
who participated in this task (person). 
3. Record the staff member’s judgment of 
the task’s level (rating of the minimum 
level of experience and training the 
observed staff member thinks is required 
to have performed the task just com- 
pleted with the particular patient [if 
any| and that was just treated). This 
was not part of the reliability tests. 


The results of four observers independently 
encoding the identical 47.2 min. (472 inter- 
vals) of videotape prior to the data collection 
period appear in Table 1. Entries in the cells 
are frequencies and cumulative row percent- 
ages reading left to right. For example, in 341 
of the 472 intervals (72% of the time) all 
four observers chose the same action code. In 
92 of the 472 intervals, three observers chose 
the same action code. Thus, three or more 
observers chose the same code 92% of the 
time. ‘ 

A second reliability test was performed im- 
mediately after the field test was conducted 
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sa TABLE 1 


OBSERVER AGREEMENT BEFORE DATA COLLECTION 


No. observers agreeing 
Code category* as i : 
4 3 | 2-2 | 2-1-1] None 

ig 341 | 92 23 9 7 

Cumulative % 72| 92| 97| 99 | 100 
M A 
. 450| 14 8 0 0 

Cumulative % 95| 98 | 100} 100 | 100 
ae al 322| 99 | 37 0 14 

Cumulative % os| s9 | 97| 97 | 100 
Eo 458] Sioa 9 0 

Cumulative % 97| 98 | 98| 100 | 100 

ition points 

Cumulative % 55} 85 |100| — = 
Object 

No. 129 | 116| 20| 197 10 

Cumulative % 21 521 56 98 | 100 
Action-Mode & Object 

No. 115] 99°} 9] 223 26 

Cumulative % 24| 45 | 47 95 | 100 


s The percentages rer 
the codes for which there w 


© Usable for explication purposes only, 


in the physical therapy clinic at the University 
of Florida, J. Hillis Miller Health Center, 
Again, all four observers independently en- 
coded identical videotape lasting 93.6 min, 
(936 reliability intervals). The results of this 
second reliability test appear in Table 2, 

For the purposes of future use, it was de- 
termined on the basis of both reliability tests 
that the action-mode combination (N = 924) 
would be the best basic unit of observation 
since it demonstrated a very good reliability 
(greater than 86% of the time, three or more 
observers agreeing) both before and after 
data collection. The observers? ability to 
enumerate the persons present at any time is 
almost beyond question (greater than 96% of 
the time, three or more observers agre 
As stated, the duration of any identifie 
is based upon the observers? judgment 
when it begins and when it ends, that is 
“transition points.” This measure also meets 
a relatively high standard of reliability with 


eing), 
d task 
S as to 
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three or more observers agreeing more than 
84% of the time. , 

The weak spot in the PTOS technique ap- 
pears to be the use of the object codes to ex- 
plicate the action-mode combinations. The 
pretest showed a 45% reliability (three or 
more observers agreeing) while the posttest 
resulted in a 67% agreement at that level. 
Since this would mean that the object code in 
combination with the action-mode unit is 
likely to be correct only 60% of the time (a 
weighted average of 45% and 67%), it was 
decided to reserve its use to descriptive rather 
than numerical reporting, that is, for explain- 
ing action-mode combinations that have high 
frequencies. If desirable, however, a reliable 
action-object-mode combination seems feasible 
if the circumstances are such that the utility 
of the resultant data would warrant the in- 
creased costs for modifying the Dictionary 
and extending the training of observers. 


Field Test Results and I. mplications 


The field test produced some interesting 
data that bear both on the methodology and 


TABLE 2 
OBSERVER AGREEMENT AFTER Data COLLECTION 


+ ing 
No. observers agreeing 
Code category 


4 3 |22 |2-1-1 None 
Action 
No. 5 5 7 0 
$ 656 | 225 | 18| 37 
Cumulative % 70| 94 | 96| 100 | 100 
Mode 
No. 562 |313 | 57] 2 | 2 
Cumulative % 60 | 93 100 
Action & Mode - $ 
No. 371 | 435 9 
X : ; 5 | 42) eo | 3 
Cumulative % 39 0 
ie o 86 | 90} 97 | 10 
No. 890| 1 
: 6] 2] 4| 0 
Cumulative % 95 | g 00 
Transition points "| | | ee | 
No. 36 ” 
. 13| o| a 
Cumulative % ee 
a A 62| 84 |100| — 
No. 7 
o 283 |379 | 110| 1600| 4 
Cumulative % 30} 70>) 82| 99 | 100 
Action-Mode & Object 
No 5 
o 245 |384 | o3| 183 | 31 
Cumulative % 26| 6| 77| 96 | 100 


* Nonapplicable, E 
able with reservations, 
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the utility of the information derived from it. 
Of the 29 action codes, only 28 were used by 
the observers. Similarly, only two-thirds (22) 
of the modality codes were employed. In all, 
41 different task statements (action-mode 
combinations) were used to encode more than 
80% of the 6,260 events observed over a 25- 
day period. This suggests that the problem of 
analyzing the work of the physical therapist 
relative to patient demands is not only sub- 
ject to solution but also involves only a lim- 
ited range of observable phenomena. 

Preliminary analysis of the data from this 
one-case study provided other insights into 
the clinic operation. While almost one-third of 
the work now performed by therapists was 
judged suitable for reallocation to lesser 
trained personnel, these observations also pro- 
duced a view of some of the subtleties in- 
volved. In very few cases did the therapists 
believe that a task or set of tasks should be 
totally delegated to some other level of per- 
sonnel. Rather, patterns of systematic judg- 
ments about treatments and services to par- 
ticular classes of patients have begun to 
emerge. In short, there is at least a good basis 
now for relating the effect of a task-patient 
interaction to the complexity of the service to 
be provided. 

Tf these preliminary data hold up in a 
larger study that is now being planned, the 
problem will be altered from one of trying to 
enumerate and understand a large number of 
tasks to one of working with a limited number 
of services provided to a wide variety of 
patients. The planned approach to optimizing 
the manpower, equipment, physical plant, and 
training programs relative to patient demands 
involves the development of a simulation 
model and stochastic linear program. Briefly, 
national and local health statistics provide 
usable forecasts of the probable number of 
patients in each admission (treatment) cate- 
gory. The job analysis data could translate 
these forecasts into amounts of manpower at 
different skill levels that will probably be re- 
quired to perform particular tasks with par- 
ticular classes of patients. This information, 


465 


in turn, can provide not only estimates of the 
probable quantities of equipment and physical 
plant required but also can suggest alterna- 
tive curricula for training programs that maxi- 
mize the utility of the personnel being edu- 
cated. Hopefully, the end result will not only 
be efficient and effective physical therapy 
services at the lowest possible cost, but also 
programs of higher education that provide 
preparation with increased relevance for pa- 
tient needs. 


REFERENCES 


Awpersox, H. E., Arprevce, H. P., Warre, B. C., & 
Wror, M. C. The roles of the physical therapist: 
Their importance to the patient. Journal of the 
American Physical Therapy Association, 1965, 45, 
768-772. 

Bossitt, R. A, Gourevircn, V. P., MILLER, L. E., & 
Jensen, G. D. Dynamics of social interactive be- 


N 
havior: A computerized procedure for analyzing 
trends, patterns, and sequences. Psychological 
Bulletin, 1969, 71, 110-121. 

Bossrtt, R. A, Jensen, G. D, & Kurns, R. E. 
Development and application of an observational 
method: A pilot study of the mother-infant rela- 
tionship in pigtail monkeys. Journal of Genetic 
Psychology, 1964, 105, 257-274. 

Bricut, C. P. Professional and non-professional 
activities of staff physical therapists. Journal of 
the American Physical Therapy Association, 1962, 
42, 769-773. 

Fenperson, D. A. & Larson, C. W. Planning and 
establishing a physical therapy assistant program. 
Physical Therapy, 1968, 48, 963-967. 

Fine, S. A., & HEINZ, C. A. The functional occupa- 
tion classification structure. Personnel and Guidance 
Journal, 1958, 180-192. 

McDaniet, L. V. The critical incident method in 
evaluation. Physical Therapy, 1964, 44, 235-242. 
Morsn, J. E., Mapex, J. M, & Curistat, R. E. 
Job analysis in the United States Air Force. 
(Tech. Rep. No. AD 259 389, WADD-TR-61-113) 
Lackland Air Force Base, Texas, Personnel Re- 

search Laboratory, 1961. 

NATIONAL CENTER FOR Heattn Statistics. Health 
resources statistics: Health manpower and health 
facilities. (Public Health Service Publication No. 
1509) United States Department of Health, Edu- 
cation, and Welfare, Washington, D. C., 1968. 

Wittems, E. P., & Viveperc, S. E. Direct observa- 
tions of patients: The interface of environment 
and behavior. Psychological Aspects of Disability, 
1969, 16, 74-88. j 


(Received May 25, 1970) 


Journal of Applied Psychology 
1, vel SSO RE, 5, 466-469 


IDENTIFYING PATTERNS 
FROM STUDENT 
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University of New Engla 


A number of studies have attempted to 
bility underlying students’ perceptions 


OF TEACHER BEHAVIOR 
EVALUATIONS ' 


R. K. BROWNE, asp P, RICH 


nd, Armidale, Australia 


identify the basic dimensions of varia- 
of teachers. It is possible that these 


perceptions reflect stylistic differences among teachers. Applying the method of 


discriminant analysis, it was shown th 
characteristic of students’ discrimination: 
representing important configurations of 


tion of behaviors for each teacher was viewed as 


teaching style. The teachers in the study 
these patterns of teaching behavior, 


Student evaluation of teaching is increas- 
ingly accepted as a normal part of higher edu- 
cation. It raises questions, however, concern- 
ing the basis of student judgments and the 
extent to which students discriminate between 
different patterns of teaching behavior., 

Some recent studies (Bendig, 1954; Isaac- 
son et al., 1964; Ryans, 1963; Smalzreid & 
Remmers, 1943) have factor analyzed stu- 
dents’ ratings of teachers and described be- 
havioral dimensions on which teachers vary, 
but the separate factors do not represent spe- 
cific kinds of teachers. As Morrison and Mc- 
Intyre (1969) assert “it is the patterns in 
which they are combined in individuals which 
characterize their teaching [p. 134].” 


analysis was applied ti 
teachers with the Purpose of (a) 
patterns of teaching i 
dents were able to differentiate mo, 


METHOD 


Teaching behavior was analyzed b 
18-item questionnaire comprising 
loading items from each of the six b. 

1 This study was supported by a research grant 
from the University of New England, Australia, 

? Requests for reprints should be sent to W, Ss. 
Simpkins, Faculty of Education, University of New 
England, Armidale, New South Wales, Australia 2351, 


yY means of an 
the three highest 
asic factor dimen- 
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at distinctive patterns of items were 
s. These patterns were interpreted as 
teaching behavior, and the combina- 
a description of his personal 
were clearly differentiated in terms of 


sions in the Isaacson et al. (1964) study. The factor 
dimensions are described in Table 1, 

The questionnaire of 18 items was applied to a 
group comprising 57 students undertaking an Arts 
degree at the University of New England, Australia, 
and who were enrolled in a course of studies pre- 
sented by five teachers, The students, with at least 
1 yrs academic experience at the University, e 
mainly in residence, and the course group was small 
enough to allow frequent personal contacts among 
the students and between students and teachers. Thus, 
they had an Opportunity to observe teachers closely 
and discuss among themselves their teachers ae 
their course experiences, Further, a course WES ia 
lected that Was presented by a panel of teachers. to 
students enrolled in this course were in a poole c- 
make considered judgments of and comparisons i 
tween teachers within a recognized subject field. ent 
_ The questionnaire was completed by cach studen 

i “irst, they were asked to describe their 
Conceptions of an ideal teacher in terms of the 18 
scale items, Then, on Successive uses of the question- 
naire, they were asked to describe cach of the five 
teachers who had been responsible for presenting the 
course, All SIX sets of responses were included in the 
ceptions of ta at ether wi uae Be 
Provide a broader’ i ‘ne A ‘oe eE iua ie 
tions of teaching 1 myi tanding of the configu ht 
en aching behavior on Which teachers migh 
vary, 

In a preliminary an 


alysis, 
correlations for the 18 


the matrix of inter- 
items was obtained from the 
ces and then factor analyze 
method. The obtained factor 
ere rotated to Kaiser’s (1959) Varimax 

as used to check the underlying 


€ 18 items y 


dimensions of th vith those of the original 


questionnaire, 
To determine tl 


he extent to whi ts dis- 
criminated the Sii © which studen 


teachers (including the ideal) the 

ere analyzed using the multiple 
i ysis technique (Cooley & Lohnes: 
1962; Veldman, 1967). The advantage of this method 
ed all 18 items in combination and 


B 


tuhe À 
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permitted a comparison of patterns of behavior re- 
flected in item groupings. Correlations between each 
of the items and the obtained discriminant functions 
Were used to interpret the patterns of teaching be- 
havior described by each discriminant function. 
Teacher centroids for each pattern in simulated 
three-dimensional space were used to indicate the 
relative standings of the teachers on each pattern. 


RESULTS 

The factor analysis of the item responses 
resulted in patterns of loadings very similar 
to those reported for the original, full ques- 
tionnaire. In each case, the three items se- 
lected to represent each of the original factors 
loaded highly together. The results showed, in 
fact, that the original factors adequately de- 
scribed the dimensions of student ratings of 
teachers as evident in the responses of Aus- 
tralian students to the selected 18 items. 

The test for significance of the differences 
among the six teachers on the 18 items gave 
rise to a Wilk’s lambda of .1835. The F ratio 
or this discriminant solution was 7.213 with 
df = 90, 1552. The probability of obtaining 
an F as large as this or larger is very much 
less than .001, and the hypothesis that these 
six teachers can be considered as random 
samples from a common population is rejected, 

The discriminant analysis gave rise to five 
discriminant functions, the latent roots for 
which are shown in Table 2 together with de- 
tails of chi-square tests of significance. 

While all five roots were significant at bet- 
ter than the .05 level it can be seen that about 
nine-tenths of the information concerning 
teacher separation is associated with the first 
three roots,* and since the fourth and fifth 
roots were more inaccurate, only the first three 
functions are used in describing the discrimi- 
nant space. Table 3 shows those correlations 
of discriminant and original variables, greater 
than .30, used in interpreting the three-dimen- 
sional space. 

An examination of the correlations of the 
original variables with the discriminant func- 
tions strongly suggests that each of the three 


functions defined meaningful patterns of 
teaching behavior. The configurations of 


teaching behavior represented by the three 


3 That is, about 91% of the total discriminating 
variance is accounted for by the first three functions. 
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TABLE 1 


Six DIMENSIONS OF STUDENT EVALUATION 


Factor Description (behaviors defining) 


E 
| 
Skill | General instructional skills 
Overload | Amount and diffculty of work ex- 
| pected by teacher 
| Teacher's course organization and 
planning 
Teacher's expressed evaluations of 
student performance 
The free interchange 
teacher and group 
Friendly interest on the part of the 
teacher 


Structure 
Feedback 


Group interaction between 


Rapport 


Note. See Isaacson et al, 1964, 
major discriminant functiens are described 
below. 


Discriminant Function 1 


This function accounted for the greatest 
amount of the variance (50.7% ) with 12 of 
the items in the inventory highly correlated 
with the discriminant function. It appears to 
describe an interrelated pattern of teaching 
behavior that draws elements from a number 
of the basic factors, rapport, interaction, feed- 
back, overload, and structure, the latter being 
negatively correlated. 

A tentative interpretation of the number 
and types of scale items suggests a configura- 
tion of teaching behavior based on the readi- 
ness of staff to establish a personal relation- 
ship with students based on student partici- 
pation and, within this form of staff-student 
interaction, to praise student efforts where 


TABLE 2 


r Roots AND T: 
THEIR SIGNIFICANCE 


Lat 


S OF 


Percentage 
e of trace 3 
Wo: | Root | Secma a af 
fors 
1 1.139 50.7 | 250.94 22 001 
2 546 24.3 | 143.83 20 O01 
3 | .355 13.7 | 100.21 | 18 001 
4 | -126 | 5.6 | 39.13] 16 | .O1 
5 | 080 | 3.5 25.32) 14 | 05 
| 


a Trace of WA = 2.2459 (total discriminating variance 
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TABLE 3 


CORRELATIONS or DISCRIMINANT FUNCTIONS AND 
ORIGINAL VARIABLES 


| Correlation with dis- 
Item criminant function 
Factor | nos n 

1 2 E 

Ski —_ -76 — 
Skill 1(19) 7 

7 (40) = -80 — 

| 1345) — -66 | = 

Overload 2( 8) 35 3 | = 

8(24) =) ss = 

14(31) — 34 — 

5 : 3(15) | —.35 == = 

Structure 922) z5 = = 

1539) | =41 | — = 

Feedback 4(27) 35 33 | 176 

10(38) | 40] — | 32 

16(44) —! 39 44 

Group interaction 5( 7) Al 36 = 

11 (10) -42 | 40 — 

17 (36) 53 | 152 is 

Rapport 6(16) 53 40 ps 

12(21) 56) = = 

18(32) .63 da p 


à Item numbers referred to were those u 


sed in the question- 
naire in Isaacson et al, (1964), 


merited, to set stimulating 


goals, and to avoid 
Overcontrol of the learnin 


g situation, 
t these behaviors 
entiation between 


ed for 24.3% of the 
e items highly corre- 
ant function, 


and rapport. 


A possible interpretation suggests 


; l a con- 
figuration of teaching behavior hased 


Ton a 
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skillful and stimulating instructional tech- 
nique, in association with active student par- 
ticipation in instructional activities and the 
teacher’s readiness to make positive and nega- 
tive evaluations of student work while setting 
demanding course goals. As for Discriminant 
Function 1, the significance of any element in 
this configuration cannot be seen in isolation 
from other behaviors in the pattern. For ex- 
ample, the importance of the teacher making 
both positive and negative assessments of stu- 
dents’ work cannot be assessed without refer- 
ence to the need for a skillful and stimulating 


teaching technique, positive student participa- 
tion, etc. 


Discriminant Function 3 


This function accounted for 15.7¢; of the 
variance with three items highly loaded on 
the function, In contrast to the two previous 
discriminant functions, all the items were asso- 
ciated with a single basic dimension of student 
evaluations as identified by factor analysis 
(i.e., feedback). This pattern is defined by the 
continuous appraisal of students’ work on the 
Part of the teacher and by the clear communi: 
cation of teacher assessments to the student, 
whether positive or negative, be 

The three discriminant functions ma t 
viewed as patterns of teaching behavior n 
constitute important facets of perceived teac - 
ing style, and it has been shown that they 


provided a highly significant differentiation in 
students’ percepti 


of teachers ( 

It then be 
relative positi 
Patterns and x 


Purpose, the respective 
ere calculated and are 


Some indication of the pat- 
ehavior that distinguished 
>e obtained by noting his 
each of the three discrimi- 
r example, Table 4 shows 


ant Function 2 


ant F » and high on Discrimi- 
nant Function 3. 


Reference to the patterns 
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of teaching behavior associated with each dis-) 


criminant function gives a general indication 
of the orientation of his perceived style of 
teaching relative to the other teachers. 

The data as represented in Table 4° show 
that the respective teacher centroids are quite 
widely dispersed in discriminant space. The 
evidence indicates that students discriminated 
quite finely between six teachers including 
both the perceived and the student ideal. A 
separate discriminant analysis undertaken on 
the five actual teachers obtained a similar 
degree of separation. 


DISCUSSION 


The considerable degree of differentiation 
among teachers on each of the patterns sug- 
gests that students discriminated quite clearly 
between teachers in their perceptions. 

The patterns of mutually interrelated items 
indicate that students differentiated between 
teachers on the basis of distinctive configura; 
tions of teaching behaviors. This suggests the 
possible shortcomings of considering sepa- 
rately various elements and dimensions of 
teaching behavior. For example, in the second 
discriminant function, the value of active stu- 
dent participation achieves proper significance 
only when seen in conjunction with skillful 
teaching and not as an isolated element of 
instructional practice. i 

The relative standings of teachers on the 
three discriminant functions helped to de- 
scribe the sets of interrelated behaviors that 
defined their teaching styles. Also, each dis- 
criminant function identified a cluster of inter- 
related behavioral items that) could differ in 
kind, that is, could be drawn, from different 
dimensions of variability as defined in Isaac- 
son’s (1964) study. Discriminant analysis ap- 
peared to identify patterns, of, teaching be- 


TABLE,4 


TEACHER CENTROIDS 


| Coordinates on discriminant function 


Teacher | eet 
1 | 2 3 
Ideal 1.25 5.55 2.04 
T 08 | 348 2.18 
T: | 55 3.62 1.21 
Ts |" 2.55 3.27 1.90 
Ts | 9.33. 4 odan | 1.79 
Ts | 1.46 | gati | 92 
| 


havior in student evaluations and thus, merits 
further consideration as a research tool in the 
study of teaching styles. i 
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DIMENSIONS OF VISUAL INFORMATION DISPLAYS? 


ARTHUR I. SIEGEL? ann M. A. FISCHL 


Applied Psychological Services, Inc., Wayne, Pennsylvania 


Twelve air-defense-oriented visual information displays, representing all com- 
binations of three densities of air threat, two different primary display formats, and 
two conditions of information coding, were exposed to three S groups in order to 


determine the nature and structure of the display-observ: 
possessed different degrees of familiarity with the dis 
involved in their use. Independent multidimensio 


er interface. The S groups 
plays and the tactical concepts 
nal scaling analyses were per- 


formed on the data derived from each group. Seven dimensions, which were con- 


gruent across S groups, were isolated. The 
interpretation in terms of: (a) stimulus nu 
textual discrimination, (d) structure scanning, 
integration, and (g) cognitive processing. 


As systems unfold from the conceptual and 
definition stages to breadboard models, proto- 
types, and the first production model the 
engineering psychologist is called on with ever- 
increasing frequency to provide estimates of 
the probable effectiveness of alternative 
configurations. Quite commonly, the alterna- 
tive design configurations involve various 
means of presenting information to the human 
operators in the system. 

Concern for display observers’ 
structuring their percepts has receiv 
tion in several recent studies. Silver, Jones, and 
Landis (1965), in investigating radar training 
requirements, indicated a pronounced tendency 
toward stimulus orientation, as opposed to 
response orientation, on the part of relatively 
unsophisticated display viewers; viz. “it is 
clear from the data that the unskilled observers 
tend to make their judgments in terms of 
stimulus characteristics rather than in terms 
of what the situation requires of them [p. 17].” 
Although framed in a slightly different way, the 
Same concern was expressed by Rigney and 
DeBow (1966), who directed an investigation 
into “how the decision-maker arrives at a 
weighted combination of the different factors 
upon which an action decision depends [p. 1J.” 


design 


Ways of 
ed atten- 
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Research, under Contract N00014-66-C0183, Apprecia- 
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patterns of factor loadings suggested 
merosity, (6) primary coding, (c) con- 
(e) critical relationships, (f) cue 


These Es investigated the hypothesis that in- 
tegration of percepts takes place through a 
process of simplification; that is, selectively 
ignoring certain characteristics of a multiple- 
stimulus situation so as to concentrate on only 
one or two characteristics (Rigney & DeBow, 
1966, p. 5). The results of their study seem to 
have supported the hypothesis tested. La 
et al., (1967) examined experiential bac 
grounds and display user percepts from ie 
somewhat unique point of view. These Es on 
ployed Ss, who characteristically use ae 
Perceptual dimensions, to investigate the re n 
tionship between multidimensional rere 
measures and Ss’ “cognitive-perceptual sty i 
Lp. 4-7J.” Their results indicated that “char 
acteristics of the observer of the display itore 
act significantly with the actual display charac- 
teristics in determining the effectiveness ° 
display design Cp. 6-1.” 

© purpose of the present research was tO 
achieve some level of 
nature of the 
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techniques, and (e) testing the resulting factors for 
congruence among groups. 


Visual Displays 


The visual information displays employed were static 
displays of air-defense command-and-control situations. 
Two major east coast air-defense sectors (Philadelphia 
and Washington-Baltimore) were represented. The air 
threats against the two sectors, intelligence information 
about the threats, relevant meteorological information, 
and status information concerning the defensive re- 
sources were displayed. 

In the preparation of these displays, three indepen- 
dent variables were systematically manipulated: threat 
density (three levels), format (two levels), and color 
(two levels). The three threat-density levels (number 
of attacking aircraft) employed were: 5 threats (lowest 
threat density), 10 threats (intermediate threat den- 
sity), and 15 threats (highest threat density). 

Two completely different display formats were used. 
One format was completely pictorial. This format pre- 
sented a map of the area with superimposed range rings, 
battery locations, and threat and environmental in- 
formation. Surrounding the geographic presentation 
were pictorial and graphic summary presentations of 
related and required defensive information. The alter- 
nate display format investigated was the tabular dis- 
play. Each tabular display presented the identical con- 
tents as the pictorial, but presented all information in a 
digital, alphanumeric fashion by means of a large, frag- 
mented billboard. 

The third independent display variable that was 
manipulated was color application. The same pictorial 
and tabular displays were prepared in alternative 
versions, with and without the color coding. 

Accordingly, the full set of stimuli consisted of 12 
displays, that is, each of the 3 threat densities displayed 
in both tabular and pictorial format, and each of the 
2 formats appearing with and without color application. 
A more complete description of these visual information 
displays appears in Siegel and Federman (1967). 


Subjects 


One S group consisted of 17 United States Air Force 
Reserve Officers of the rank of captain and above who 
possessed extensive experience in operational command- 
and-control contexts. A second S group consisted of 
15 male college students who had previously been 
trained to a high degree of proficiency in the interpreta- 
tion and use of these displays. The third S group was 
completely naive as far as these displays and the con- 
cepts involved in their use were concerned. This group 
consisted of 21 male college students who had never 
seen these displays prior to the study and who, with 
one exception, had no prior military experience. 


Procedure 


Each group was convened and was introduced to the 
study. Then, four slides, which had been prepared for 
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training purposes and which were similar to those to be 
employed in the study, were presented and described. 
The four training slides included: (a) the pictorial for- 
mat in black and white, (b) the pictorial format in color, 
(c) the tabular format in black and white, and (d) the 
tabular format in color. All of these were at an inter- 
mediate threat-density level. The Æ explained and gave 
examples of the decisions that could be made on the 
basis of these displays. Next, experience was given in 
the type of scaling that Ss were required to perform 
during the experimental data collection. A part of the 
instructions given at this juncture is presented below: 


You are going to be shown some material on slides, 
and you will be asked to make judgments about them. 
The materials will be presented in pairs, and for each 
pair there is a line in this booklet that looks like this: 


Very Moderately 
Similar Similar 
1 2 3 4 5 
Comparison # — Á — — aM 
Moderately Very 
Different Different 
6 7 8 9 10 11 


As you can see, this is an 11-point scale ranging from 
1 for “Very Similar,” through less and less similarity, 
to 71 for “Very Different.” For each pair of materials 
that is shown to you, you are to put a check at the 
place on the scale which comes closest to representing 
your judgment of the materials. REMEMBER, 
THERE ARE NO RIGHT OR WRONG AN- 
SWERS—THE BEST ANSWER IS THE JUDG- 
MENT OF HOW THE MATERIALS SEEM TO 
YOU. 


‘The formal data collection was conducted in the same 
way as the practice scaling except that there were more 
stimuli (12 instead of 4), and in order to obtain esti- 
mates of intrajudge reliability, stimuli were pair-com- 
pared twice. The procedure was identical in each of the 
three S groups. The pairs were presented by projecting 
1 of the 12 displays as a quasi-standard stimulus against 
which each of the other 11 was successively compared 
in terms of similarity. A second display then served as 
quasi-standard for comparison against each of the re- 
maining 11, and so forth until every display had served 
once as a standard and 11 times as a comparator. In 
order to minimize the risk of inadvertently forcing the 
emergence of one of the independent variables as a 
dimension, and to control for position and sequence 
effects, the following controls were instituted: 


1. With respect to the three independent variables, 
the order of appearance of each of the 12 displays as 
a quasi-standard was: (a) simple alternation of low 
through high density; (b) random choice of tabular 
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format to begin, with subsequent simple alternation of 
the two formats; and (c) random choice of color display 
to begin, with subsequent ABBA sequence. 

2. Coin toss determined that the quasi-standard 
stimulus would appear as the left-hand member of all 
pairs for the first 66 comparisons and as the right-hand 
member for the last 66 comparisons. 

3. For each of the first six sets of comparisons (11 
comparisons per set), a different random sequence of 
the 11-comparison stimuli was employed. These six 
sequences were exactly reversed for Sets 7-12, with the 
quasi-standard stimulus from the corresponding initial 
set occupying the position vacated by the new quasi- 
standard stimulus. 


The mean and standard deviation of the distribution 
of similarity judgments of each pair of displays was cal- 
culated in each S group. The former of these statistics 
gave the interstimulus distances between the pairs of 
displays for insertion into the multidimensional anal- 
ysis; the latter statistic afforded appraisal of the homo- 
geneity of each of the groups’ judgments. In addition, 
since each pair of displays was judged twice by each S, 
the data for an intra-S reliability estimate were avail- 
able. A different one-third of the 66 display pairs was 
randomly selected for investigation in each S group, and 
the correlation between first judgment and second judg- 
ment of these pairs was obtained over all Ss in the par- 
ticular groups. 

Three independent multidimensional analyses of the 
interstimulus distances were performed, one for each S 
group. Each analysis began with the original inter- 
stimulus distances for the group concerned. The first 
step was estimation of the “additive constant” (Torger- 
son, 1958, pp. 268ff) required for converting the ob- 
tained comparative distances to absolute distances. 
Here, the procedure employed was the general solution 
described by Messick and Abelson (1956). The ensuing 
matrix of absolute distances was then employed in 
deriving the matrix of scalar products, which was then 
factored by the method of principal components with 
rotation by the Equamax approximation to orthogonal 
simple structure. The end result was three multidimen- 
sional scaling analyses, each representing the structure 
of the percepts of an individual S group. 


RESULTS 


Intra-S reliability was given by the correla- 
tion between S’s first judgment of a pair of 
displays and his second judgment of that pair.* 
The resulting standard deviations were of 
modest size, suggesting reasonable inter-S 

#The mean similarity judgments for each pair of 
displays, along with the respective standard deviations 
have been deposited with the National Auxiliary Pub- 
lications Service. Order Document No. 01473 from the 
National Auxiliary Publications Service of the American 
Society for Information Science, c/o CCM Information 
Sciences, Inc., 909 3rd Avenue, New York, New York 
10022. Remit in advance $5.00 for photocopies or $2.00 
for microfiche and make checks payable to: Research 
and Microfilm Publications, Inc. 
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agreement. The median of this set of 66 correla- 
tion coefficients was between .62 and .63, 
which, with zs of the sizes involved, is signifi- 
cantly greater than zero (p < 01). The S 
groups’ experience and background were ap- 
parently major influences on the reliability of 
the judgment process. Of the 33 reliability 
estimates larger than the median, 17 were de- 
rived from the Air Force S group, 11 from the 
trained college student group, and only 5 
emanated in the naive college student group. 

The additive constants required to convert 
the comparative distances (the mean similarity 
estimates) into absolute distances were 3.09 
for the Air Force S group, 2.66 for the trained 
college student group, and 2.51 for the naive 
college student group. Each mean was accord- 
ingly increased by the magnitude of its re- 
spective constant. Factor analysis of the 
matrices of scalar products derived from the 
matrices of interstimulus distance judgments, 
with rotation to orthogonal simple structure, 
yielded seven dimensions for the Air Force Ss) 
percepts, five dimensions for the trained college 
students’ percepts, and six dimensions for the 
naive college students’ percepts. Displays 
loaded most heavily on the respective dimen- 
sions are shown in Table 1. Three criteria were 
invoked in order to judge the adequacy of each 
solution : (a) sharpness of break in Eigenvalues, 
(b) amount of predictable variance encom- 
passed, and (c) apparent freedom from over- 
or underspecification. 

Examination of Table 1 indicates a high 
degree of apparent similarity, across S groups, 
in the loading of a number of the dimensions. 
For example, the first dimension in each S 
group appears to be similarly loaded, the 
second dimension in the Air Force group ap- 
pears to be similar to the fifth dimension in the 
naive college student group, and the forth 
dimension in each of the naive and trained 
college student groups appears to be similar to 
the fifth dimension in the Air Force S group. 
To assess these and other apparent similarities, 
tests of factorial congruence (Harman, 1960, 
pp. 256-259) were performed. The results of 
these tests are shown in Table 2. This table 
shows seven sets of congruence coefficients. 
Each set, except the second and the last, pre- 
sents congruence coefficients across all three S 
groups. For the second set there was no dimen- 


sion in the trained college student group sufh- 
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TABLE 2 
CONGRUENCE OF DISPLAY DIMENSIONS Across THREE SUBJECT GROUPS 
i i Dimension | Congruence 
Set | S group — S group no. coefficient 
g š 
i 2 
ir F 1 Trained college students 1 -92 
1 Ai Tate S il Naive college students 1 | 95 
Trained college students | 1 Naive college students 1 .94 
2 Air Force Ss 2 Naive college students $ 84 
N | 

Air Force Ss 3 Trained college students 2 90 
3 Air Force Ss 3 Naive college students 6 -65 
Trained college students 2 Naive college students 6 55 
Air Force Ss 4 Trained college students 5 88 
4 Air Force Ss 4 Naive college students 2 93 
Trained college students 5 Naive college students 2 7 
Air Force Ss 5 Trained college students 4 .92 
§ Air Force Ss 5 Naive college students 4 88 
Trained college students 4 Naive college students 4 92 
Air Force Ss 6 Trained college students 3 86 
6 Air Force Ss 6 Naive college students 3 88 
Trained college students 3 Naive college students 3 91 
7 Air Force Ss 7 Trained college students 3 E 

Air Force Ss 7 Naive college students 3 8 


ciently similar in appearance with the pair 
shown to warrant quantitative congruence in- 
vestigation. The last set is merely an elabora- 
tion of the sixth set, with Dimension 3 of both 
college groups being compared for congruence 
against a different dimension of the Air Force 


S group. Thus, the congruence coefficient across 
Dimension 3 of the college student groups was 
not repeated in Table 2, 


Disctsston AND INTERPRETATION 


The seven sets of dimensions reconcile, to 
greater or lesser degree, all 18 of the factored 
dimensions obtained from the three S groups. 
That is, the seven dimensions of the Air Force § 
group’s percepts, the five dimensions of the 


trained college students? percepts, and the six 


dimensions of the naive college students? per- 


cepts all seem to be manifestations of the same 
common core consisting of seven dimensions of 
the display-observer interface. 

Dimension 1 was the first dimension to 
emerge from the analysis of the data of each 
of the three S groups. In all three groups, the 
displays with the heaviest Positive loadings on 
this dimension were the low-threat-density pic- 


torial displays. The displays with the prasi 
negative loadings were the high-threat-dens y 
tabular displays. While the density bipolarity 
has interpretative appeal, another inet 
exists. With respect to the way the two formats 
Present information, it is apparent that to pre 
Sent the same information the tabular format 
requires considerably more stimuli than the 
Pictorial. Extending this thinking about num 
ber of stimuli, it follows that: (a) the fewest 
stimulus elements should be found in the low 
threat-density Pictorial display; the largest 
number of stimulus elements should be foun 
in the high-density tabular display; (b) ae 
second fewest stimulus elements should be 
found in either the low-threat-density tabular 
display or the intermediate-density pictori@ 
display; and (c) rising toward the larger num- 
ber of stimulus elements should be the high 
threat-density pictorial display and the inte- 
mediate-density tabular display, This w2 
virtually the exact pattern of loadings °” 
Dimension 1. The dimension, accordingly, W2 
interpreted to reflect stimulus numerosily. 

_ Dimension 2 emerged as the second dime 
sion in the Air Force S group and the fifth 
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dimension in the naive college student group. 
Within these dimensions, the displays with 
heaviest positive loadings were the tabular dis- 
plays of both high- and low-threat density. 
The displays with the heaviest negative load- 
ings were pictorial displays, primarily of inter- 
mediate density. This interactive pattern is 
suggestive of a process in which a display 
viewer, in seeking to organize the perceptual 
field, scans for the emergence of a structure. 
The structure may be given in part by the 
literal stimuli he sees, but the structure is also 
given by the meaning he attaches to those 
stimuli. Accordingly, the dimension was inter- 
preted to represent structure scanning, which 
is believed to be partly perceptual and partly 
cognitive, and to be manifest during early dis- 
play involvement but to dissipate with experi- 
ence with a display. 

Dimension 3 appeared third in the Air Force 
S group, second in the trained college student 
group, and sixth in the naive college student 
group. This dimension appears cognitive in 
nature and to be associated with the number of 
mental acts to be performed with the dis- 
played information. The evidence for inter- 
preting this dimension as being involved with 
cognitive activity is drawn from the near- 
perfect bipolar assignment of the extreme 
threat densities. For all three groups, displays 
of high-threat density possessed maximal posi- 
tive loadings on this dimension, irrespective of 
color coding or, in the trained group, display 
format. For the trained college student and the 
Air Force S groups, displays of lowest threat 
density yielded maximal negative loadings on 
this dimension, again irrespective of color cod- 
ing or display format. Thus, over all three S 
groups, the two groups with the greatest 
familiarity with the displays rendered a perfect 
density dichotomy. The available evidence 
seems disposed toward the increase in the num- 
ber of cognitive acts associated with the com- 
prehension of threat density as the principal pro- 
cess involved. Accordingly, the dimension was 
interpreted as one of cognitive processing activity. 

Dimension 4, somewhat like Dimension 2, 
seems to be associated with the display viewer’s 
organization of his perceptuocognitive field. 
Unlike Dimension 2, however, the apparent 
organization seems to be a more sophisticated 
one involving relationships among a larger 
number of variables. This dimension emerged 
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as the fourth in the Air Force S group, the 
fifth in trained college student group, and the 
second in the naïve college student group. With 
the exception that the low-threat-density 
tabular displays, in both color and black and 
white, carry the maximum positive loadings in 
all three groups, the remainder of the factor 
loading patterns appear, on the surface, to 
involve all combinations of density, format, 
color application, and numerosity. Actually, 
there seems to be an underlying ordering of 
these variables, namely, the display viewer’s 
identification of what he perceives as critical 
relationships. The display viewers apparently 
simultaneously dealt with critical relationships 
among numerous perceptuocognitive variables. 
The particular variables of choice seem a func- 
tion of observer experience with the displays 
and the information content, but the organizing 
probe for critical relationships seems present 
regardless. Accordingly, Dimension 4 has been 
named critical relationships. 

Dimension 5 emerged as the fifth dimension 
in the Air Force § group and as the fourth 
dimension in each of the student groups. Dis- 
plays with the heaviest positive loadings on 
this dimension were pictorial displays of inter- 
mediate threat density, with the addition of a 
tabular display of intermediate threat density 
in the trained S group. The pattern of negative 
loadings was consistent in the extremes for 
tabular displays, and reasonably consistent in 
the extremes for high density. Displays with 
heavy, but less than extreme, negative loadings 
showed ordering based on both density and for- 
mat. Several of the previous dimensions also 
seemed characterized by an interaction of con- 
structs associated with the displays. One im- 
portant interpretative difference between the 
present dimension and any of the others, how- 
ever, seems to be in the relative weight assigned 
to cognition in the interactions. Dimension 2 
involved a Format X Content interaction, but 
seemed to be primarily a perceptual dimension. 
Dimension 3 seemed exclusively cognitive. 
Dimension 4 involved the interaction of all 
variables but seemed more heavily weighted 
toward the sensory and the perceptual ac- 
tivities of relating numerosity (sensory), color 
(sensory/ perceptual), and format (perceptual) 
to each other and to the one cognitive variable, 
density. In Dimension 5, the emphasis seems 
to be on a cognitive integration of cues and 
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their meanings; the dimension was interpreted 

ue integration. . p 
4 Dimension 6 emerged as the sixth dimension 
for the Air Force group and as the piren 
sion for the other groups. There wasa o con: 
gruence between the Air Force § Dimension 
and Dimension 3 in each of the student groups. 
To which of the two Air Force S dimensions 
the student Dimension 3 corresponds saan 
unknown. Inasmuch as the lowest BoT 
coefficient (.80) in this set of five involv ed Air 
Force S Dimension 7, Air Force S Dimension J 
has been tentatively held out as a unique 
dimension. Air Force S$ Dimension 6 was 
interpreted as congruent with the student 
Dimension 3. Display Dimension 6 bears heavy 
positive loadings from displays of intermediate 
threat density, primarily tabular in format, and 
relatively high stimulus humerosity. Heavy 
negative loadings are contributed Principally 
by pictorial displays of lower stimulus nu 
merosity. The dimension seems best described 
as a Numerosity X Format interactive one, 
involving perceptual discriminations on the 
basis of physical stimulus contexts. To reflect 
this, the dimension was named contextual 
discrimination, 

Display Dimension 7 bears he 
loadings from tabular displays 
tive loadings from pictorial di 
there appear to be some density groupings, 
the rationale is not clear, Accordingly, it was 
interpreted as reflecting the two primary for- 
mats in which the sets of 


displays were encoded 
and named primary coding. 


This dimensional structure, along with the 
interpretation Suggested by 
Seems to possess a number of i 
both the human factors 
display designer, F 
found in other contexts (e.g., 
that a factorial structure wil 
perience level of the persons sampled varies, 
such longitudinal and developmental effects 
have not generally been considered at display- 
observer interfaces. The finding tends to sug- 
gest, however, that the optimal display design 
will vary with the experience level of the user, 
Thus, the display which js optimal for the 
user possessing one level of experience may not 
be optimal for a user who Possesses a greater or 
lesser experience level, Second, we note that 


avy positive 
and heavy nega- 
splays. Although 


the structure, 
mplications for 
practitioner and the 
irst, although others have 
Fleishman, 1954) 
l vary as the ex- 
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many of the dimensions seemed to be based a 
an interaction of constructs. Dimensions 2, 4, 
5, and 6 seemed to be based on a rather Gr 
plex interaction between display variables zi j 
display-observer variables. T hese Po PER = 
gest that typical human engineering chec 1s A 
which consider hardware/display variables a 
a rather simplistic cue manner, can at best, 
account for only a small part of the variance 
at the display-observer interface. Such check- 
lists seem to be most concerned with our pri 
mary coding factor and seem to have little or 
no concern with many of our other factors. 
Extending such checklists to the other con- 
structs indicated by the present analysis may 
be indicated. Finally, we note that the dimen- 
sions seem to fall into three logical groups. 
Stimulus numerosily and primary coding seem 
largely display-characteristic-oriented. re 
textual discrimination, structure SCANNING, ae 
crilical relationships seem perceptual in nature, 
while cue integration and cognilive processing 
appear largely cognitive in nature. This ae 
ing may help to define the nature of the so 
that occur in the display reading-informat 
processing situation and to provide insight T 
variables underlying effective display desig 
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RECOGNITION OF HUMAN FACES: 


EFFECTS OF TARGET EXPOSURE TIME, TARGET POSITION, 
POSE POSITION, AND TYPE OF PHOTOGRAPH * 


KENNETH R. LAUGHERY,* JUDITH F. ALEXANDER, asn ALAN B. LANE 


State University of New York at Buffalo 


‘Three experiments investigated Ss’ ability to recognize a target person (whom they 
had previously seen) in a test series of 150 pictures of faces. The longer the original 


exposure to the target and the earlier the t 


arget picture appeared in the test series, 


the greater the probability of recognition. The pose position in the test series (front, 
profile, or portrait view) and the type of photograph (color or black and white) did 
not affect recognition. These results have implications for procedures used by law 
enforcement agencies in the process of a witness attempting to identify a criminal 


from a set of mug shots. 


The last 10 yr. have witnessed a tremendous 
upsurge in psychological research on human 
memory. More recently, a great deal of this re- 
search has focused on visual memory or 
imagery. A general finding is that when people 
do form visual images of things to be remem- 
bered, retention is remarkably better than 
when images are not formed (Bower, 1970; 
Paivio, 1969). 

Several visual memory studies have used 
experimental paradigms that resemble the 
situation where a witness who has seen a 
criminal is trying to identify that person’s 
picture from a set of alternatives. Shepard 
(1967) reported an experiment in which Ss 
looked through approximately 600 pictures of 
“things” and “scenes.” The Ss were then pre- 
sented with pairs of pictures where one of each 
pair had been in the original set of 600, and 
the other had not previously been seen. Each S 
saw 60 such pairs and his task was to note 
which item in each pair had been seen before. 
The overall performance was 97% correct. 
Hochberg and Galper (1967) performed a 
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similar study using pictures of female faces, 
(taken from a college yearbook). Their Ss 
looked at 35 or 60 faces and then 15 pairs. 
Although recognition was better with 35 faces 
in the original set, performance was high (about 
90%), even with the larger set. In still a third 
study with this technique Yin (1969) reported 
06% correct identifications with pictures of 
adult males. 

In another experiment using a slightly dif- 
ferent procedure, Nickerson (1965) showed Ss 
200 pictures of “things.” In a second sequence 
of 400 pictures, Ss had to say whether each 
picture had appeared in the original 200. 
Nickerson systematically varied the lag (num- 
ber of intervening pictures) between the first 
occurrence (in the 200 set) and the second 
occurrence (in the 400 set). With lags of 40 
and 200, performance was 97% and 87% 
correct, respectively. 

From the results of these studies one might 
expect witnesses to be extremely good at 
identifying criminals from a set of mug shots. 
Yet, this is clearly not the case; at least not at 
the 90% and higher levels reported in the 
aforementioned experiments. Of course, there 
are many reasons why the identification of 
criminals from law enforcement files would not 
be expected to be at this high level. The experi- 
mental situations described above represent 
ideal conditions by comparison to the real- 
world situation. The trauma of being present 
when a crime is committed, differences between 
the criminal’s appearance during the crime and 
in the mug file (disguises, out-of-date photo- 
graphs, etc.), and the fact that a witness may 
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indeed look through thousands of —- 
before seeing the criminal’s picture are on za 
few of the factors that could reduce the pon : 
bility of a correct identification. On the o a i 
hand, there is little in the way of canoe 
evidence to indicate just what are the hum > 
abilities and limitations in identifving people 
from a set of photographic records. i 

With these considerations in mind, u mee 
experiments were designed to explore the 4 
of several variables on the identification task. 
The procedure in these studies was intended to 
resemble the task of a witness attempting to 
identify a criminal from a set of mug shots. All 
three studies were essentially exploratory in 
that they represented an attempt to isolate 
some of the important factors that determine 
identification performance. 


EXPERIMENT I 


The first experiment was undertaken to 
determine the effects of three task variables 
on the ability to recognize a face. The variables 
were target exposure time, target position, 
and observation instructions. 


Method 


Subjects. The Ss were 128 students from an intro- 
ductory psychology course at the State University of 
New York at Buffalo. There were 53 males and 75 
females. Participation in the experiment partially 
fulfilled a course requirement, 

Task. After viewing four sequentially projected slides 
showing different candid positions of a particular hum 
face, hereafter referred to as the target, Ss viewed 
series of 150 projected slides of human faces, T) 


task was to indicate on an answer sheet whethe 
slide in the test series was or w. 


tion, Ss indicated a confidence: 
decision on a 3-point scale 
certain. 

Design. The design of the experiment was a 
2K 2K DK 2 factorial with all factors manipulated 
as between-S variables. There were eight Ss in each of 
the 16 treatment groups. The variables were manipu- 
lated as follows: observation instructions included prior 
knowledge of the task or no knowledge of the task ; 
target exposure time was 10 sec. or 32 sec. (viewing the 
four candid pictures for 2.5 Sec. or 8.0 sec. each) ; target 
position in the 150-item test series was either 40 or 140; 
and two targets—a Caucasian male with fair colored 
hair and complexion wearing glasses, or a Caucasian 
male with darker colored hair and complexion without 


an 
a test 
he Ss’ 


r each 
as not the target. In addi- 


e level in their (Yes/No) 
—possibly, probably, or 


glasses. 
Apparatus. Instructions were 


Presented with a 
Sony TC 102 tape recorder. Slides 


Were presented with 
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a Kodak Carousel AV 900 projector with a age 
F3.5 Zoom Ektamar lens onto a Da-Lite screen. he: 
the four initial candid-position slides, the aes Bie 
operated manually. During the test series, the Biel 
was set to change automatically at 5.0-sec. ee Naa 
Actually, in the test series every other slot in ange 
trays was empty. With the automatic 5.0-sec. Si E 
this procedure resulted in each slide being R 
the screen for 4.0 sec. with 6.0 sec. between pa am 
during which Ss recorded their responses on the ans 
sheets. ee 
nek, The slides were half frame, 35-mm. ap 
chrome transparencies showing front bust ae 
Caucasian males within an age range of 17-23 yr. a 
four candid-position slides were of this same es 
except, of course, that instead of a front ee ‘ef 
were candid positions and ranged from left-si i 
right-side views. The candid positions were obi re 
by taking a large number of pictures of the wa aE 
casual positions and selecting those that seemed lea 
sed. 
K test series consisted of 149 decoys and the bate 
all front bust views. The decoys were randomly selec 
from a film library of 544 different faces. ‘The pia 
parameters of all slides were constant (sharpness, jE 
lighting, etc.). Faces of fair and darker coloring i ia 
with and without glasses were approximately Paes 
number and randomly distributed throughout hena k 
The target shown with glasses in the four pe aii 
position slides also wore glasses in the test pa teries 
the target without glasses appeared in the a to, be 
without them. The slides were projected so as 
life-size on the screen. sheet con- 
Procedure. Each S was given an answer 5 s on each 
sisting of four pages with two 25-space Cy from 
page. These spaces were numbered consecuti »sponded 
1-200. Each space on the answer sheets corre P slide, 
to the one of the test series slides. After viewing @ yel of 
S marked either yes or no and indicated @ HEN the 
confidence (possibly, probably, or certain) He test 
decision. Although Ss viewed only 150 slides in t der 
series, the answer sheet was numbered to 200 in oF 


peas A icipating 
to eliminate any bias that might result from anticipat p 
the end of the s 


eries. The instructions, incidentally, 
indicated that the target’s picture might appear 10 
mes, only once, or not at all. en 
of the test Series, each § was jh 
which he was to record any pene 
d in making his decisions and nis 
hat were most important 1? 


test series several ti 

At the conclusion 
a protocol sheet on 
ular technique use 
facial features t 
decisions. sons 

The eight Ss in each of the 16 experimental condition” 
were run as a group; the groups of Ss were random 
assigned to the conditions. Four Ss were seated at sa x 
of two long tables, one behind the other, in a nor iga 
sized classroom. Dividers were placed on the t3? 
between the Ss, T 


i ont 
he screen was located in the ff 
center of the room 


the screen. The projector was located at the rear eei e 
of the room, and the Projected light extended across 
center of the two tables onto the screen. The room be 
darkened to assure good vision of the slides but W! 
Sufficient light to read and mark the answer sheet. 


er 


l 
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The first part of the instructions was played to Ss. 
In the prior-knowledge conditions the instructions make 
it clear that Ss would later be looking for a picture of 
the person whom they were about to see (in the four 
candid-position slides). In the no-knowledge conditions, 
Ss were simply told that they would be looking at 
slides containing faces and that the following four 
slides (the candid-position slides) were examples. If an 
S knew the target, he was (to be) excused from the 
experiment. 

The four candid-position slides were shown for a total 
of 10 or 32 sec., depending on the experimental condi- 
tion. Next, the answer sheets were passed out, and the 
remainder of the instructions, which was the same for 
all Ss, was played. Two sample slides were projected at 
the automatic setting of 5.0 sec. to provide an example 
of the task and the pacing. The test series was then 
begun. The interval of time from the end of the observa- 
tion period to the beginning of the test series was 
approximately 8 min. As noted earlier, after each slide 
Ss marked a yes or no and a level of confidence on the 
answer sheet. Also, if an S knew the person in a par- 
ticular slide he so indicated on his answer sheet (a 
known response was given in less than 1% of the 
responses in the entire experiment). In order to make 
sure the test series slides and the numbers on the answer 
sheets corresponded, Æ called out the number of the 
slide coming up next when that slide designated either 
a new column on the answer sheet or the first slide 


after a tray change. p 
At the end of the session, Ss filled in the protocol 


sheet described above. 


Results 


The yes and no responses to the target pic- 
ture in the test series are referred to as hits 
and misses, respectively. Similarly, the yes 
and no responses to the 149 decoys are false 
alarms (FA) and correct rejections (CR). 

In order to take into account the confidence 
levels, two scales were developed and used in 
evaluating performance. The first scale was 
based on hits and misses and consisted of 1 for 
certain miss, 2 for probably miss, 3 for possibly 
miss, 4 for possibly hit, 5 for probably hit, 
and 6 for certain hit. The second scale was 
based on FAs and CRs and consisted of 1 for 
certain FA, 2 for probably FA, 3 for possibly 
FA, 4 for possibly CR, 5 for probably CR, 
and 6 for certain CR. The higher the score, the 
more accurate the response for either scale. 
The maximum hit-miss (H-M) score was 6, 
and the maximum FA-CR score (for 149 
decoys) was 894. 

The mean H-M score for the eight treatment 
conditions (collapsed across targets) is pre- 
sented in Table 1. An analysis of variance 
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TABLE 1 
Mean Hit-Miss Scores 


Position 40 Position 140 


Observation 
instructions | 
| 10sec. | 32 sec. | 10sec. | 32 sec. 
Prior knowledge | 3-6 4.9 3.0 3.0 
No knowledge $5: | 44 2.6 Ai 


| | 


indicated a significant main effect for target 
position (F = 10.80, df=1, 112, p< .005) 
with performance better when the target 
appeared in Position 40. The effect of target 
exposure duration was slightly less than the 
usually accepted .05 level (F = 3.30, df = 1, 
112, 05 < p <.10) with a higher level of 
performance at the longer exposure duration. 
For practical purposes, however, this effect 
may be worth noting. 

The mean FA-CR scores are presented in 
Table 2. The effect of observation instructions 
was significant (F = 3.92, df = 1, 112, p < .05) 
with better performance in the prior knowledge 
condition. The Observation Instructions 
X Target Exposure Duration X Target Posi- 
tion interaction was also significant (F = 8.10, 
df =1, 112, p<.005). The data underlying 
this interaction indicate that prior knowledge 
improves the FA-CR score only under those 
conditions of the other variables where 
the task is relatively easy (32-sec. exposure, 
Position 40). 

Two interactions involving the target vari- 
able were also significant: With the H-M score 
the Target Exposure Duration X Target Posi- 
tion X Target interaction (F = 5.06, df = 1, 
112, p < .05), and with the FA-CR score the 
Observation Instructions X Target Position 
X Target interaction (F = 4.72, df = 1, 112, 
p < .05). While it is not surprising that the 


TABLE 2 


MEAN FALSE-ALARM-CORRECT-REJECTION SCORES 


Position 40 Position 140 


| 
I 


Observation 
instructions | 7 
10 sec. 32sec. | 10sec. | 32 sec. 
Prior knowledge 818 | 871 844 812 
No knowledge 823 | 758 806 834 
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TABLE 3 


Hır RATE: PERCENTAGE OF SS Wuo RESPONDED 
“Yes” TO TARGET PICTURE 


| Position 140 


| 
| 


10 sec. | 32 sec. | 10sec. | 32 sec. 


| 
| Position 40 
Observation 


instructions 


si | 44 | 38 
6O | 31 | # 


Prior knowledge 56 | 
No knowledge 56 


results of a facial recognition experiment should 
be a function of the idiosyncratic features of 
the target, the data underlying these two 
interactions were not readily interpretable. 

While the above analyses take into account 
the confidence levels of the responses, a more 
direct measure (for purposes of quick inter- 
pretation) is simply the percentage of Ss who 
had hits (i-e., said yes when the target picture 
appeared in the test series). The overall hit 
rate was 52.2%. The hit rate for the eight 
conditions of observation instructions, target 
exposure duration, and target position are 
shown in Table 3. 

The total number of FAs was 1,672 or 
approximately 9% of the decoys. There was a 
tendency (but only a tendency, since the effect 
did not approach significance) for FAs to be 
correlated with hits (i.e., those Ss who had hits 
tended to have more FAs). Only four Ss had 
perfect H-M and FA-CR scores. 

Although no attempt was made to assign 
male and female Ss equally to the various 
treatment conditions, Ss of each sex were 
fairly evenly distributed across the conditions. 
In order to get a rough idea of possible S sex 
differences, a simple £ test was carried out on 
the H-M scores. The difference just missed 
being significant at the .05 level, {= 1.62, 
df = 112, 05 < p < 10. The actual hit rates 
were 57% and 45% for women and men, 
respectively. 

The protocols were analyzed to determine 
which facial feature Ss said they used in the 
recognition process. A frequency count was 
made of the number of times various features 
were mentioned by the different Ss (i.e., the 
number of different S: 


c s who mentioned the 
feature). The results of the frequency distribu- 


tion, collapsed across all experimental factors, 
were general structure, 73; eyes, 66; nose, 53; 
2 2 l 


skin, 42; mouth, 33; lips-chin, 31; hair, 28; 
and ears, 19. 


Discussion 


The effects of target exposure time (which 
was near significance at the .05 level) and target 
position were as expected. The longer time to 
study the four candid-position slides of the 
target probably leads to a better representation 
in memory that in turn results in better 
recognition. This finding, incidentally, is con- 
sistent with results reported by Leckart (1966) 
and Leckart, Keeling, and Bakan (1966). This 
improved memory representation may take 
several forms such as a stronger memory for 
particular features or a greater variety of 
features. 

The effect of target position is certainly cone 
sistent with most of the psychological Jireratara 
on memory. The present study does not, a 
course, separate the two most likely explana- 
tions for the effect, namely, decay due to the 
passage of time and interference due to more 
intervening items between the initial espot 
to the target and the appearance of the targ 
in the sequence. yai 

Although the observation instructions, sdge 
able, prior knowledge or no know’ zu 
affected the FA-CR score, it did not signi : 
cantly influence the hit rate. The lack of he 
effect on hit rate may simply be due to $ 
fact that the intent of this experimenti i 
manipulation was not achieved, The laborato!) 


ading 
effect on the H-M score (which should be the 


interpret. The significant Observation Instore 
lions X Target Exposure Duration X ca 
Position interaction seems to indicate that pa 
knowledge is useful only if the exposure © in- 
tion is long enough (32 sec.) to acquire more. 
formation about the target and if the a 
position (Position 40) is early enough t° : 


: ; ‘ HE cJeat 
the information. Again, however, it is not R 


why the effect should be reflected in the rA- 
score and not the H-Miscore. 


urar 


RECOGNITION oF HUMAN FACES 


EXPERIMENT IT 


‘The second experiment was designed to ex- 
plore the effects of pose position on facial 
recognition performance. Pose position refers 
to the view of the person in the test-series 
photographs. 


Method 


Subjects. The Ss were 128 students from the same 
introductory psychology course described in Exp. I. 
There were 55 males and 73 females. 

Design. A 4 X 4 factorial design was employed with 
both factors manipulated as between-S variables. The 
first factor, pose position, represented the view of the 
faces in the test series slides. The four levels were front, 
profile (left side), right portrait, and left portrait. The 
portrait positions were taken with the camera at an 
angle of 45° from the front. The second variable, target, 
simply consisted of four different targets, all 17 or 18 yr. 
old. There were eight Ss in cach of the 16 treatment 
conditions. 

Procedure. The task (with prior-knowledge instruc- 
tions), materials, and apparatus were essentially the 
same as Exp. I. After being told the nature of the task, 
Ss were shown four candid pictures of one of the targets 
for a total time of 32 sec. Answer sheets were handed 
out and the remaining instructions presented. The Ss 
then saw the 150-slide sequence. The slides were drawn 
from the same library and presented at the same rate 
described in Exp. I. Of course, in one condition all 150 
views were front bust, in another condition all were 
profile, and in the other two conditions views were left 
portrait or right portrait. The target’s picture appeared 
only once, in the Position 104. Two of the targets were 
the same as those used in Exp. I. A protocol was ob- 
tained at the end of the test series. 


Resulls 


The H-M and FA-CR scores were again 
calculated. The mean scores, are presented in 
Table 4. Analyses of variance with these two 
scores revealed no significant effects for pose 
position, target, or their interaction. Looking 
at the hit rates, the percentage of Ss who said 
yes when the target picture appeared was: 
front, 53%; profile, 33%; left portrait, 59%; 


and right portrait, 41%. The overall hit rate 
was 48%. 

A t test on H-M scores (collapsed across pose 
position and target) was carried out to compare 
performance of male and female Ss. The dif- 
ference was significant (¢ = 2.5, df = 126, 
p < 05) with males performing better than 
females; means were 3.7 and 2.8 for males and 
females, respectively. The hit rates were 56% 
and 42%, respectively. It should be noted that 


481 
TABLE 4 
Mean H-M anp FA-CR SCORES FOR 
Various Pose POSITIONS 
f 

Pose position H-M FA-CR 
Front 3.6 844 
Profile 29 836 
Left portrait 3.6 827 

2.8 | 853 


Right portrait | 


Note,—H-M = hit-miss, FA-CR = false alarm-correct re- 
jection. 


this difference is in the opposite direction of 
Exp. I, which was not quite significant at the 
.05 level. 

A frequency count was made on the number 
of times different facial features were men- 
tioned in the protocols. The distribution results 
were: eyes, 87; nose, 77; lips-chin, 54; general 
structure, 51; hair, 43; skin, 41; mouth, 25; 
and ears, 12. 

It is also possible to make a comparison be- 
tween Exp. I and Exp. I with respect to target 
position in the test series. Taking the front 
position and appropriate two targets from 
Exp. I and the 32-sec. exposure time and prior- 
knowledge conditions from Exp. 1, there were 
three conditions that were alike except for the 
position of the target picture in the test series. 
The three were Positions 40, 104, and 140, and 
the hit rates were 75%, 53%, and 41%, 
respectively. 


Discussion 


When this experiment was originally con- 
ceived, it was anticipated that the portrait 
positions might lead to better recognition per- 
formance than either the front or profile views. 
The reasoning was simple; in the portrait pose 
a combination of features could be observed 
that was not available in the front or profile 
poses. It was further speculated that the front 
pose would be better than the profile for the 
same reason. Obviously, the data do not sup- 
port the first hypothesis, although the hit rates 
do indicate that recognition is poorer with the 
profile pose (not at the .05 significance level, 
however). It is possible, of course, to suggest 
reasons why the different poses do not produce 
differences in recognition performance. It may 
be that Ss use different facial features with the 
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different pose positions, but they may be ie 
strained by some overall limitation on ie 
much information they can handle. One = 
of support for this reasoning is that in S 
protocols different features were mention 
with different frequencies in the saone ff 
conditions. For example, nose was mentione 
more frequently in the profile conditions. 


EXPERIMENT Il 


Experiment III was a straightforward study 
in which color slides were compared to black 
and white (B & W) slides. It seemed reasonable 
to speculate that the identification process 
would be enhanced by the use of color as co- 
pared to B & W, because color slides provide a 
dimension of information nonexistent in B & W. 


Method 


Subjects. The Ss were 32 students at the State 
University of New York at Buffalo. Each S was paid 
$2.00 for participating. There were 17 males and 15 
females. ` 

Procedure. With two exceptions, the procedure was 
the same as Exp. I and II. Instead of looking at four 
candid-position slides, Ss exposure to the target con- 
sisted of the target person entering the room (live) and 
handing out the answer sheets, Prior to observing the 
target, Ss were instructed as to the nature of the task 
(i.e., they were told that they would later be 
to identify the target in a series of pictures). 
stayed in the room for approximately 30 sec. 
difference was that in this stud: 
appeared twice in the 150-item t 
40 and 140. All test series slide 
length. 

The type of photograph variable (color or B & W) 
was manipulated between-Ss, while the target position 
(Position 40 or 140) was within Ss. There were two 
targets (one with glasses and one without); this variable 
was also manipulated between Ss. 

The color and B & W slides were exactly the same. 
This identity was accomplished when the 
library of slides was made by 

mounted side by side (one with 
B & W film) and taking the pictur 


attempting 
The target 
The second 
ly the target’s picture 
est series—in Positions 
s were front view, bust 


original 
using two cameras 
color and one with 
es simultaneously. 


Results 


the same, 4.9. A cl 


e oser look at the target posi- 
tion variable reve 


aled that most Ss gave the 
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same response when the target goatee 
second time (Position 140) as they had ae 
on its first occurrence. Only six Ss responded 
differently to the two occurrences of the target, 
and only one of these differences was more miar 
one point on the H-M scale (it was a differenc 

2 
p- analysis of the FA-CR scores also orni 
no significant effects. The means for color an 
B &W were 872 and 856, respectively. ; 

The overall hit rate was 84%. ‘The hit my 
with color and B&W were 87% and 81%, 
respectively. A comparison can be drawn ber 
tween the hit rates in this study when n 
target picture was in Position 40 (84%) an€ 
the hit rate in Exp. I in the prior-knowledge 
condition, Position 40, and the 32-sec. exposure- 
duration condition (81%). 

An analysis was carried out comparing H-M 
scores for male and female Ss. The means were 
males, 4.7 and females, 5.2, a difference that 
did not approach significance. 7 

As in Exp. I and II, the protocols bisa 
analyzed to determine which facial features 
were most frequently mentioned. The soe 
and their frequencies were: eyes, 24; hair, cd 
nose, 11; general structure, 10; skin, 8i p 
mouth, 8. There were no obvious agen 
in the feature frequencies between the € 
and B & W conditions. 


Discussion 


If color pictures do contain more information 
than B & W pictures (and they surely must), 
this difference in information does not lead t° 
better recognition performance,’ It ie 
reasonable to speculate that the failure to fin’ 
a difference, as with Pose position in Exp. *"? 
can be attributed to the information processing 
limitations of Ss, (They can only deal with @ 
certain amount of information in a give” 
period of time.) 

The almost identical re 
occurrences of the ta: 
in that once S 
does not make 
hand, if 5 misse 
rence, he is not 


sponses to the tw? 
rget picture is interesting 
identifies the target picture, = 
a mistake later. On the othe! 
s the target on the first ocCUl™ 
likely to recognize it later- 


*The color versus B & W slide variable has ve 
manipulated as part of another experiment and TeP 
cated the no-difference finding of this study. 


RECOGNITION OF HUMAN FACES 


GENERAL DISCUSSION 


tt is interesting to note that even in the most 
ideal conditions of the present experiments, 
the hit rates (81% and 84% in Exp. I and III, 
respectively) were lower than the 90% and 
higher rates previously reported. Granted, the 
task was different from some of the earlier 
studies where Ss were choosing between 2 pair 
of pictures instead of making an absolute 
judgment on individual pictures. Although, in 
the Nickerson (1965) experiment, a similar 
absolute judgment was made with higher 
performance results. However, Nickerson used 
pictures of “things,” which indicates that un- 
familiar faces may be more difficult to recognize 
than other types of visual stimuli. Taking into 
account these procedural differences the 
earlier studies may be less relevant to the 
criminal identification problem than previously 
thought. 

Two of the findings in the present experi- 
ments do seem reasonably clear. The more 
pictures an S (or witness) looks at before seeing 
the target picture, the less likely he is to 
recognize the target. Also, the better the 
opportunity to study the target, the greater 
the probability of a correct identification. 
While the significant interactions involving 
these variables indicate some qualifications 
regarding their effects, the findings are im- 
portant. If law enforcement agencies can find 
ways of organizing files so that the witness has 
fewer alternatives to search, the result could be 
improved identification rates. 

The fact that neither pose position nor type 
of photograph (color or B&W) affected 
performance was somewhat unexpected, since 
it was originally thought that the additional 
information in color slides and portrait poses 
would enhance recognition performance. As 
already mentioned, the negative findings with 
these variables may be the result of Ss’ inability 
to make use of additional information. In any 
case, the outcome suggests that the present 
B&W, front and profile views used by law 
enforcement agencies need not be changed to 
color or a portrait pose (at least not for reasons 
of improving identification rates). 
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The conflicting results of S sex in these ex- 
periments is currently a mystery. Perhaps 
further research will define which sex, if either, 
makes the better witness. 

The facial features mentioned in the pro- 
tocols were reasonably consistent across the 
three experiments. Although these feature data 
are interesting, they should only be considered 
tentative or exploratory. The question of which 
features Ss use in the identification process is 
an important issue, however, and deserves 
more extensive and elaborate investigation. 

Finally, a comment on the generality of the 
results seems in order. The null finding with 
regard to target differences in Exp. IL would 
seem to argue that the results do apply to some 
portion of the population. Most likely, how- 
ever, the finding depends on the fact that the 
targets and decoys represent a sampling from 
the middle range with respect to general 
appearance and specific physiognomic traits. 
Surely, Ss can be expected to recognize any 
highly distinctive face. However, in the actual 
identification of criminals it is the non- 
distinctive face that presents the problem, and 
these studies were planned accordingly. 
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J ILIATION, AND RESEARCH 
N IN AUTHORSHIP, AFF , ; 
ia ces PATTERNS OF JOURNAL OF APPLIED 
PSYCHOLOGY ARTICLES, 1917-1969 


GARY N. CHAISON ! 


State University of New York at Buffalo 


i i i Applied Psychology was made in 
ysis of articles from the Journal oj Appl g i 
aa egos of multiple authorship, funding sources cited by neg 
a thor affiliations. The sample of 729 articles consisted of every fi : 
oN epee Re fa in the Journal from 1917 to 1969. Results indicate trends 
a 


toward increased multiple authorship, more extensive and diverse funding, 
changes in the nature of author affiliations. 


The scientific journal as a primary method 
of communication between researchers ma 
provide a source of information on identifiable 
trends in the research process as well as the 
changing attributes of researchers. The pur- 
pose of this article is to examine some author 
and funding characteristics over a long time 
span for a particular journal, the Journal of 
Applied Psychology. Specifically, changes in 
authorship and coauthorship patterns, sources 
of funding for the research reported, and in- 
stitutional affiliation of the researchers were 
examined. 


The study of psychological journals seems 
to have received considerable research atten- 
tion. Walsh, Feeney, and Resnick (1969) in- 


vestigated the graduate school origins of 
authors of articles fr 


om a psychological jour- 
nal, while Smith (1958) and Mitchell (1961) 
focused upon the increasing trend toward 
multiple authorship, Xhignesse and Osgood 
(1967) studied the use of biographical cita- 
tion as an indication of the nature of the com- 
munication network among psychological re- 
searchers. Woods (1961) studied the APA 
divisional membership of article authors, 
while author instit 


utional affiliations were ex- 
amined by Goo 


dstein (1963), Golightly, 
Byrne, and Capaldi (1963), and Bohn (1966). 


The preceding studies have been strongly 
empirical in nature, basically extracting data 
from sampled articles or volumes for the se- 
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and 


lected variables. In contrast to this orienta- 
tion, Darley (1968) examined the Journal of 
Applied Psychology and presented an histori- 
cal overview of the development of the 
Journal as well as the field of applied psy- 
chology, Writing his article to note the fifti- 
eth anniversary of the founding of the Jons 
nal, Darley read every fifth volume in its en- 
tirety, recording the early emergence of spei 
ingly contemporary methodology and _ ia 
often well-deserved fall of certain tests iad 
obscurity. In his conclusion he suggests ak 
applied psychologists have a — sles 
tude” toward their own history ( eile 
1968, p. 9). The purpose of the presei a 
is to provide some additional historica Pad 
spectives on the sources of authorship & 
research reported in the Journal. 


METHOD 


The study uses a sample of every fifth article, 
selected after a random start, from the first 53 vol- 
umes of the Journal of Applied Psychology. This 
covers Volumes 1-53 for the years 1917-1969. ES- 


says, comments, and letters have been excluded fro™ 
the sample, 


Designating time as the independent variable, a 
were tabulated for the three following dependen 
variables: (a) number of authors per article; b) 
Source of funding, if any, cited by the author(s) ° 
theiarticless (e) type of institniionn! ampation e 
by the authors of the articles, 5 

articles were sampled, including 
the works of 824 individual authors, The data were 
but then combined into periods of 
an initial time period of 3 yr. for 1917- 
was done because of the small number of 


Š yr. (after 
1919). This 


JouRNAL ARTICLES IN 


articles sampled during some years (often less than 
10 articles per year). 


RESULTS 
Number of Authors per Article 


Figure 1 indicates a trend toward collabora- 
tion in applied psychological research. Dur- 
ing the latest time period almost half of the 
sampled articles were written by two or more 
persons.? The causes of this noticeable trend 
may be numerous. Smith (1958) has sug- 
gested that factors affecting collaboration in 
published research might be: (a) the type of 
problem entailed in the research, that is, ap- 
plied research usually involves group efforts 
as opposed to more individualistic theoretical 
works; (b) method of data collection and 
analysis, that is, at times the burdensomeness 
of the data-collection task or the method of 
analysis applied by the researcher may re- 
quire collaboration; (c) affiliations of authors, 
that is, authors in large institutions may be 
more likely to collaborate because of the near- 
ness of colleagues with similar research in- 
terests; and (d) support by government 
funds or foundations, that is, the use of proj- 
ect or team organization—often necessary to 
utilize substantial funding—may affect the 
extent of collaboration. 

Furthermore, Mitchell (1961) has sug- 
gested that there may be a trend, reinforced 
by psychology’s code of ethics, toward giving 
students and assistants credit for their work 
in published form. 


Types of Funding Cited by Authors 


The emergence of funding for research 
from both governmental and private sources 
may be seen in Figure 2. The percentage of 
articles in which there is no citation of 
funding source has rapidly and steadily de- 
creased since the early 1940s. However, the 
outstanding feature of this figure is the large 
proportion of research in the Journal for 
which no financial support is cited. Despite 
the large outlay of government and private 
SS 

2 More complete, tabular breakdowns on which 


this and subsequent figures are based may be ob- 
tained from the author, (See address in Footnote 1.) 
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Fıc. 1. Multiple authorship of sampled articles. 


support, most of the published work in the 
Journal does not receive outside support. The 
curves show the increase Over the years in 
both DOD and HEW agency support and re- 
fect the substantial fluctuations in govern- 
ment funding policies. 


Author Institutional A filiation 


Figure 3 indicates the types of institu- 
tional affiliations of the authors of the articles. 
In the sample of 729 articles, 1,046 author 
affiliations were recorded because of multiple 
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Fic. 3. Institutional affiliations of authors of 
sampled articles. 


authorship. However, because some author’s 
works appeared more than once in the sample, 
the number of individual authors was actually 
864. In the tabulation, the affiliations of all 
authors of sampled articles were recorded, 
hence, the total of 1,046 affiliations, 

As Figure 3 illustrates, the proportion of 
articles from academic institutions exceeded 
80% in the early 30s. The percentage in the 
academic category has remained relatively 
stable at around 60%-70% since 1940. Other 
types of affiliations cited have fluctuated more 
with some trend of increasing contributions 
from industrial affiliations, This source 
reached a high point during the 1960-1964 
period when more than one-fifth of the authors 
cited industrial affiliations. This category in- 
cludes industry as well as private research in- 
stitutions, both nonprofit and profit. Figure 3 
also indicates that governmental nonmilitary 
affiliations of authors have declined since the 
peak in the early 1940s, while there is an in- 
crease in authors citing military affiliations 
since those years, at least until the mid 60s. 
It should be pointed out that much of the 
government-supported research reflected 
Figure 2 is carried out under grants and co! 
tracts with individuals affili 
research and academic institutions, Hence, the 
differences between Figures 2 and 3. 

The Journal of Applied Psychology has had 
only six different editorial administrations in 


in 
n- 
ated with private 


Gary N 


. CHAISON 


its long history. Changes in Editor — = 
1917, 1921, 1943, 1955, 1961, and 1971. i 
would be difficult to relate these trends an 
fluctuations to changes in editorial policies. 


CONCLUSION 


The preceding descriptive analysis of arti- 
cles ara pea of Applied Psychology 
from 1917 to 1969 indicates the following 
trends: (@) a greater tendency toward collab- 
oration in applied psychological research, (b) 
increased diversity in the sources of funding 
cited by article authors. There still appears fo 
be, however, a small amount of funded me 
search relative to the amount of unsupporte 
research reported. (c) An increased diversity 
in types of author affiliations as well as a 
greater role for private organizations as @ 
source of applied psychological research were 
also apparent. A large majority of authors, 
however, still come from academic institutions. 
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SHORT NOTES 


THE VALIDITY OF IMPORTANCE * 


MILTON R. BLOOD * 


University of California at Berkeley 


Previous investigators have questioned the usefulness of weighting by impor- 
tance when combining measures of satisfaction with job aspects into a measure 


of overall satisfaction. Data are presen 


ted that call into question the validity 


of the information that is used for importance weighting. The model of cogni- 
tive behavior that relates the importance of job aspects to the evaluation of 


overall job attitude should be revised. 


The importance of various job aspects to workers 
has been considered by some investigators to be an 
influential variable in the determination of overall 
job satisfaction (Friedlander, 1965; Glennon, Owens, 
Smith, & Albright, 1960; Weiss, Dawis, England, & 
Lofquist, 1964a, 1964b). On the other hand, others 
have shown that weighting satisfaction components 
by their importance ratings did not increase the fit 
of component measurements to measurements of 
overall job satisfaction (Ewen, 1967; Schafer, 1953) 
or to turnover (Mikes & Hulin, 1968). 

The general model that has been used to relate 
importance measures to satisfaction measures is So= 
= ISa, where So = overall job satisfaction, Za = im- 
portance of a job aspect, and Sa = satisfaction with 
a job aspect. Such a model allows predictions of 
overall satisfaction to be based on the combination 
of component satisfactions with the components 
being idiosyncratically weighted by individual work- 
ers. The general advantage of using such a model has 
been discussed by Vroom (1964, PP- 163-165). 

If the information given by workers in ratings of 
the importance of various job aspects is the same as 
that used by the workers in producing a rating of 
overall satisfaction, then that information should be 
discernible in the relationships between satisfaction 
with the job aspects and overall job satisfaction. The 
aspect a worker says is most important to him should 
be the aspect most highly related to his overall job 
satisfaction; that aspect ranked second in importance 
should be second most related, and so on to the 
aspect with the least importance, which should have 


the smallest relationship with his overall job satis- 


faction. 

Method. The Ss for this study were 380 clerical 
workers from the home offices of a large corporation. 
Among the measures that Ss completed as part of a 
larger study were the Job Descriptive Index (JDIs 
Smith, Kendall, & Hulin, 1969), which measures satis- 
faction with five aspects of the job (work itself, co- 


ieee eg 
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workers, supervision, pay, and promotions); @ 
measure of overall job satisfaction on a Faces scale 
(Kunin, 1955) ; and a ranking of the importance of 
the five job aspects. 

For the entire group, the satisfaction scores on 
each of the five JDI job aspects were standardized. 
Then, for each S, the standardized aspect scores were 
reordered so that the first variable for each indi- 
vidual was the standardized satisfaction score on the 
aspect that he ranked as most important; the second 
variable was the standardized satisfaction score on 
the aspect that he ranked second most important; 
and so on to the last of the aspect variables that 
was the standardized satisfaction score on the aspect 
ranked fifth in importance. The satisfaction variables 
from the JDI were thus no longer designated by 
content areas. Instead, the variables were defined by 
the idiosyncratically assigned importance levels. For 
each S, the first variable was his standardized satis- 
faction score for the job aspect that he ranked as 
most important; his second variable was the stan- 
dardized satisfaction score for the job aspect that he 
ranked second in importance, etc., through all five 
job aspects. 

Very little distortion in scale units resulted from 
the reordering. Since the original scales were stan- 
dardized before the reordering, they all had a mean 
of O anda standard deviation of 1. The means of the 
reordered scales ranged from —.04 to +.04, and 
their standard deviations ranged from .98 to 1.03.8 

In determining the relative influence of multiple 
predictors, Darlington (1968) suggests consideration 
of three different sets of information: (a) the multi- 
ple-correlation beta weights; (b) the zero-order cor- 
relation coefficients; and (c) the “usefulness” of the 
predictors, that is “the amount that the squared 
multiple correlation would drop if the variable were 
removed .. . (p. 180).” All three indexes (of the 
relative influence of the aspect satisfaction measures) 
were computed with the reordered measures of satis- 
faction with job aspects as predictors and the measure 
of overall job satisfaction as the criterion. 


2 Equivalence of variance among the scales is an im- 
portant characteristic since a Jack of variance would reduce 
the correlation involving even a variable with high causal 
relevance. 
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Results. The multiple-correlation coefficients with 
the job aspect satisfaction variables (as reordered for 
importance) as predictors and overall job satisfac- 
tion as the criterion was .43 (p < 001). The veii 
weights of the measures of satisfaction with gon 
aspects in descending order from most ee 
least important were .16, .10, .16, .03, and .18. I e 
zero-order correlations of the job aspects with over- 
all job satisfaction in descending order from mat 
important to least important were 31, 2%, 30, i ee 
and .32. The measure of “usefulness suggested by 
Darlington (1968) is proportional to the squared 
partial correlations between each predictor and the 
criterion, holding all other predictors constant. The 
squared partial correlation coefficients for the aspect 
variables in descending order of importance were 
02, .01, .03, .00, and .03. For the sake of comparison, 
it is interesting to note that predicting overall satis- 
faction in this sample from the job aspect variables 
defined in the usual manner (by their content names) 
gave a multiple-correlation coefficient of .45 (p< 
.001), almost exactly the same as the reordered varia- 
bles. 

Discussion. If the importance rankings given by 
Ss are related to the actual importance of the aspects 
as they enter into Ss’ considerations in rating overall 
satisfaction, then the validity coefficients, beta 
weights, and usefulness indexes for the aspect varia- 
bles all should have been in descending order from 
most important (first variable) to least important 
(fifth variable). None of the indexes achieved the 
appropriate order. 

In Schafer’s (1953) sample, overall satisfaction had 
a significantly higher correlation with the aspect 
ranked idiosyncratically most important than with 
the aspect ranked least important (twelfth). The 
relationship was not regular, however, for Variables 
1-12, The rank-order correlation coefficient between 
importance and correlation with overall satisfaction 
was not significant. The variables ranked first, second, 
third, ninth, and eleventh in importance were signifi- 
cantly related to overall satisfaction, while the varia- 
bles ranked sixth and seventh had the lowest rela- 
tionships. Ewen (1967) found that the correlation 
between overall satisfaction and satisfaction with 
the most important aspect was significantly higher 
than the correlation of overall satisfaction and satis- 
faction with the least im ect in only one 
of three samples, 


five job 
measured by the JDI. (In all three ¢ con 


direction.) 
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cluded such dimensions, but the question of rele- 
vance can be answered empirically in the case of 
importance. We should consider any aspect of oe 
job as unimportant in the determination of overa 
satisfaction if satisfaction with that aspect does not 
correlate at some predetermined level with overall 
satisfaction. Such variables should be given a weight 
of zero (in effect dropping them from the sum of 
aspects considered). Use of measurements of impor- 
tance for differential weighting is not warranted 
since these measures are not related to the strength 
of the relationship between the aspects and overall 
satisfaction. A possible manner of operation is. to 
treat importance as a binary property with all “im- 
portant” aspects getting equal weight. The general 
formula, S,= 2 J,S,, would be retained, but the 
variable J, could take on only the value of O or 1. 

Though this is the same conclusion to which Ewen 
(1967) came (to use unit weights rather than weight- 
ing by importance), the present argument is based 
on an additional premise. Ewen argued that differen- 
tial weights did not make a difference (or, if you 
prefer, that “importance” is not important), The 
present argument adds the consideration that when 
making judgments about the importance of job as- 
pects, Ss provide information that has little relation- 
ship to the actual influence of those aspects on an 
overall evaluation of the job. It is unwarranted S 
expect people to behave in the methodical an 
orderly manner necessary to provide data that va 
support the model of behavior implied by the original 
properties of the formula. That conception of im 
portance is empirically invalid. 
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RELATIONSHIP OF INITIATING STRUCTURE AND JOB 
PERFORMANCE AS MODERATED BY CONSIDERATION 


ROBERT C. CUMMINS? 


Department of Business Administration, University of Texas at Arlington 


Many researchers have suggested that 
understand the relationship of leader 


additional variables are necessary to 
behavior and group performance. The 


present study investigates the hypothesis that the effect of initiating structure 


on the rated productivity and quality 


of work-group performance in an indus- 


trial manufacturing plant is moderated by considerate leader behavior (rated 


and seli-report). The results support 


terion measure but do not support it when producti 


The results are interpreted as being 


the hypothesis when quality is the cri- 
y is the criterion measure. 
consistent with Fleishman and Harris’ 


finding that grievances and turnover are less affected by initiation of structure 
when foremen are rated high on consideration. 


_Consideration and initiating structure, two of the 
dimensions of leadership behavior identified by the 
Ohio State University research programs on leader- 
ship (cf. Halpin & Winer, 1957), have received con- 
siderable attention in the literature. Despite the 
amount of research on these two variables, Korman 
(1966) concludes a literature review by stating 
that “very little is known as to how these vari- 
ables may predict work group performance and 
the conditions which affect such predictions [p- 
360].” Fleishman and Harris (1962) suggest that the 
failure to consider the interaction effects of con- 
sideration and initiating structure may account for 
the inconsistent research findings. The results of 
their study indicate that consideration moderates the 
effect of initiating structure on grievances and turn- 
over, More specifically, their data suggest that high- 
consideration foremen can increase structure with 
very little increase in grievances and no increase in 
turnover, whereas, structure is directly related to 
these indexes for low-consideration foremen. 

The purpose of the present note is to determine 
whether consideration moderates the effect of initi- 
ating structure when productivity and quality are 
the criterion measures. It is predicted that initiating 
structure will be more closely related to measures of 
productivity and quality for high-consideration fore- 
men as compared to low-consideration foremen- This 
hypothesis is based on the Fleishman and Harris 
(1962, pp. 53-54) reasoning that high-consideration 
foremen may establish a climate of mutual trust and 
that in such a climate, workers are more likely to 
accept (and implement) structure initiated by the 
leader. 

Method. The Ss were 133 foremen at a large 
manufacturing plant in the midwest employing about 
3,500 nonsalaried workers. The average work group 
size was 21 and supervised activities ranged from 
unskilled to highly skilled engineering and machining 
tasks. 
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The Ss were administered the Leadership Opinion 
Questionnaire (LOQ) (Fleishman, 1969) as part ofa 
large battery of tests aimed at providing information 
for manpower planning. The LOQ measures self- 
reported supervisory attitudes scored on considera- 
tion and structure. They were informed that this 
test information would not be available to their 
immediate superiors. 

Ratings of work-group productivity (overall pro- 
ductivity of the work group), quality (amount of 
scrap, rework, and errors kept to a minimum), 
leader’s initiation of structure (supervisor directs the 
group, plans and explains things in detail, figures out 
what to do in new situations), and consideration 
shown by the leader (supervisor tries to get along 
with his men, talks with them, shows respect for 
their ideas and feelings) were obtained. 

Each foreman was rated on 7-point Likert scales 
by at least four managers who were above the 
ratee in the organizational hierarchy and had 
worked with the ratee for a long period of time, 
usually several years. The ratings were administered 
to each rater separately in a face-to-face interview. 
This administration allowed the researcher to check 
each rater’s understanding of the dimensions and 
purpose of ratings. On the assumption that halo 
effect would be minimized, managers were instructed 
to rate all foremen on one scale before rating any 
foremen on the second scale. 

Results, The interrater reliability of the ratings 
was estimated by use of the Spearman-Brown 
prophecy formula correction of the split-half cor- 
relation. The split-half correlation was obtained by 
dividing the ratings concerning a particular foreman 
into two sets of ratings and averaging; these two 
averages were obtained for all foremen rated, and 
the correlation between the averages was computed 
for each scale. The interrater reliability coefficients 
for the four rated dimensions ranged from .59 to .63. 

The correlation between ratings of leader behavior 
and self-reported supervisory attitudes of the same 
dimensions were low; therefore, ratings and self- 
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TABLE 1 
ic i Y SD QUALITY 
YEE SCTURE AND RATED Propvctivity AND Ova . 
GOnREEATIONS Hanae Siac BY CONSIDERATION 
| 
High consideration | Low consideration High coia gration | Low qai 
Correlation (Rated) | (Rated) he Í = . 
6 
ivi 62 59 E 
Rating) r Productivity "04 -26 03, 
Sucre ôo r Productivity 594+ De E 
Structure (Rating) r Quality 07+ 36+ "32 
Structure (LOQ) r Quality | 


Note.—LOQ = Leadership Opinion Questionnaire. 
aN = 62, 
Ti. 


ions differ a e .05 level of confidence. 

se twi elations differ at the . í 
Bae kese tivo epean Ona A REE ae the OL level of cutidemse 
f these two correlations differ at the .01 level of confidence. 


; same behavioral dimensions 
ied bi Beg ses variables in the analysis. 

As can be noted in Table 1, the results support 
the hypothesis that rated initiation of structure 
would be more closely related to quality of perform- 
ance for high-considerate foremen than for low- 
consideration foremen; this is true where consideration 
is measured by either the LOQ or by management 
ratings of behavior. With respect to behavior ratings, 
high correlations showed that supervisors who also 
had higher structure had higher quality. There is 
also some indication in Table 1 that supervisors with 
attitudes low in consideration had quality go down 
when more structure was exerted. Neither measure of 
consideration moderates the relationship between 
structure and productivity. 

This pattern of results may be explained by exam- 
ining the criterion measures. In most nonsalaried 
jobs in the surveyed plant, productivity 
be influenced by conditions that are n 
related to leader-member interaction, F 
the production foreman’s relation with 


may influence the speed and/or quality 
nance on critical m; 


tivity, whereas, in 


are 


is likely to 
ot directly 
or example, 
maintenance 
of mainte- 
achines and thus affect produc- 
the opinion of many supervisors, 
quality of performance depends more on worker 


motivation and, therefore, may be influenced by 
considerate leader-member interaction. g 

If this interpretation is correct, then the finding 
that the Consideration X Structure interaction affects 
quality but not productivity appears to be con- 
sistent with the Fleishman and Harris (1962) finding 
that this interaction affects grievances and turnover, 
In this situation, quality, grievances, and turnover 
may all be related to the workers’ evaluation of his 
situation, whereas, achieving an acceptable level of 


productivity may be less dependent on Positive 
attitudes toward work, 


It is interesting to note that despite the small 
negative correlation between questionnaire results 
and ratings of consideration, both measurements act 
as moderators for initiation of structure. Thus, 
both considerate attitudes and considerate behavior 
make other leader behaviors more effective. 

A methodological note is in order, Although much 
effort was expended to reduce the possibility of 
halo, the ratings may be correlated partly because 
of method specificity rather than because of some 
underlying causal relationship. The present anes 
that compares correlations at two levels of a thir 
variable should not be influenced by halo. That ee 
there is no a priori reason to believe that halo has 
any effect on the differences between correlations. — o 

Conclusion. The structuring behavior of antim 
supervisors has a greater effect on rated quality 1 
performance when the supervisor believes he eee 
show consideration and/or is rated high On ton 
sideration. Neither attitudes endorsing congas the 
nor ratings of consideration are moderators © tiv- 
relation between initiating structure and produc Its 
ity. These conclusions seem consistent with rest 
of Fleishman and Harris (1962). 
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THE APPLICATION BLANK: 
HOW HONEST ARE THE RESPONSES? * 


IRWIN L. GOLDSTEIN ° 


University of Maryland 


‘An examination of the application blank responses provided by a perspective 
applicant and by the applicant’s previous employer indicated that there was 
substantial disagreement between the two sources. In two categories, duration 
of previous employment and previous salary earned, the two sources disagreed 
on 57% of the cases. In these categories, the typical applicant overestimated 
his duration and his previous salary. Also, there was substantial disagreement 
on the reason for leaving the previous job. The data appear to support the 


hypothesis that inaccurate information is likely to appear on the application 
blank as well as in the interview situation. 


Many recent studies (Fleishman & Berniger, 1960; 
Scott & Johnson, 1967) have demonstrated that the 
application blank can be used to successfully predict 
tenure on the job. Scott and Johnson have especially 
noted the feasibility of employing the application 
blank in low-skilled occupations. Recently, a firm that 
Was cooperating with the investigator on a research 
Project questioned the honesty of the responses given 
by the applicants applying for jobs as nurses aides. 
The honesty of the replies is an issue aside from the 
question of validity since it is possible to predict from 
most any set of data. In any case, a search of the 
literature did not disclose any studies that had col- 
lected data reflecting on the honesty of the applica- 
tion blank responses. 

Several studies, however, have questioned the accu- 
racy of data collected in the interview. A study by 
Keating, Paterson, and Stone (1950) found that the 
correlations between reports on wages, and duration 
of employment were above 90. Another study by 
Weiss and Dawis (1960) discovered poorer agreement 
for a sample of 91 physically handicapped persons. 
The information varied in accuracy from 0% invalid 
information for reports on sex to 55% on the ques- 
tions of receiving assistance. Essentially, invalid in- 
formation was found for substantial numbers of 
persons (over 20% of the sample) on questions re- 
lated to previous education, age at disablement, re- 
ceived assistance, job title, pay, and length of em- 
ployment, 

_ Blum and Naylor (1968) have suggested that the 
interview results in a face-to-face pressure that might 
cause a respondent to stretch the truth. In addition, 
Prospective employees might be more likely to re- 
spond honestly on the application blank when their 
responses are actually recorded in their own hand- 
writing. The present study examined the data re- 


a from the 
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corded on the application blank for a sample of 111 
applicants. 

Method. The original sample consisted of 173 
persons with the final sample equalling 111 persons 
for whom matching application blank data and previ- 
ous employer responses were available. The applica- 
tion blanks were filled out by persons applying for 
positions as nurses aides in a group of nursing homes 
owned by the same corporation in a large east coast 
city. The blanks were collected during the period 
from September 1, 1968, to June 1, 1969. A form was 
filled out for each of these individuals with the fol- 
lowing items listed: name, address, dates employed 
by last employer, and position previously held. The 
information for these items was obtained from the 
completed application blank. This form was sent to 
the employer listed by the applicant as the last 
place they were employed. The following informa- 
tion was requested from the previous employer: 
marital status, position held, dates of employment, 
salary of employee, and why the employee left. The 
previous employer was requested to draw a line 
through the item if he did not have the requested 
information. He was also asked to return the ques- 
tionnaire and indicate “not employed by us” if he 
had never employed the applicant, The previous em- 
ployers were assured of the confidentiality of the 
individual pieces of information as well as their firms’ 
anonymity. They were also assured that their infor- 
mation would not affect the employment of the indi- 
viduals concerned. 

Results and discussion. The data generally indi- 
cate a substantial number of discrepancies between 
the employee’s application blank responses and the 
responses of the previous employer. The interpreta- 
tion of these data do require a note of caution. In 
this study, two sets of responses are being com- 
pared, and the true state of the world is not avail- 
able. It is possible that some of the previous em- 
ployer’s responses are in error. As the discussion 
proceeds, however, it will become apparent from 
some of the directions of the differences that the most 
parsimonious explanation seems to point to a bias on 
the part of the applicant. 
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For the 111 employers who responded to the sur- 
vey, 15% (17 respondents) indicated that the appli- 
cant had never worked for them. Married females 
who may have been previously employed under their 
maiden name were excluded from this count. Those 
applicants (N =94) whom the employer indicated 
had worked for their firm were included in the rest 
of the analyses. Information that was left blank by 
the applicant or the employer (e.g., in some cases 
salary disclosure was against company policy) re- 
duced the V for the various categories. Also, in some 
cases, the respondent used completely different scales 
(ie., pay per week versus pay per hour without the 
number of hours per week being stated). In these 
cases, or in any other case in which the responses 
could not be reconciled, the data were not used in 
these analyses. This was done in order to avoid, wher- 
ever possible, reported differences where none might 
exist. 

The largest amount of agreement between the 
applicant and the previous employer occurred on the 
topic of the previous position held. There wi 
five clear discrepancies, Considering that the 
was applying for a position as an aide in 
home, which is an unskilled low 


ere only 
applicant 


The categories ancies were 
largest were the employment 
‘arned. As far as duration of 


7% disagreement. Since this 
salary 


In the 


» the applicant’s re- 


ob by one year. For 
erage overestimation 
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was 16 months. An inspection of the data indicate 
that the average overestimation was not raised sub- 
stantially by a few applicants but, rather, occurred 
across most of the applicants. g 

It was difficult to analyze the degree of estimation 
data for salaries because the types of differences were 
more complicated. In the 43 cases in which there was 
disagreement, there were 18 instances where the ap- 
Plicant and previous employer disagreed on the mode 
of payment. Most often, the previous employer indi- 
cated that the applicant worked on a piece-rate 
basis while the employees indicated they worked on 
a dollar-per-hour basis. For the 25 instances in which 
the data could be compared, the applicant over- 
stated his salary in 18 instances (z = 4.56, p < .05, 
two-tailed test). 

In summary, there were 
discrepancies between the 
those answers provided by 
It is interesting to note that 
applicants. Of the 94 employees that were reported 
by the previous employer as having been employed, 
31 sets of data were in complete agreement. Thus, 
the remaining 63 employees disagreed with the 
Previous employers on two of the four categories 
(previous position held, duration of employment, 
Previous salary earned, reason for leaving) that 
Were examined. 

These data suggest th 
likely to appear on the 
in the interview, The 
that Blum and 


a substantial number of 
applicant’s responses and 
the previous employers. 
this was not true of all 


at inaccurate information be 
application blank as well as 
same face-to-face pressure 
Naylor (1968) discuss in ph 
analysis of the interview situation probably does 
exist in the context of the application blank. Le: 
Pressure to obtain a job, however, may be a “i 
demanding jn both situations, The interview ¢@ 5 
indicate that two of the responses with the mos 
inaccurate information are duration of previous 
employment and Previous salary earned. The data In 


this note on “pplication blanks indicate that the 
largest discrepancies = 
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FACTORS RELATED TO ACCEPTANCE AND REFUSAL OF 
EMPLOYMENT AMONG TEMPORARY HELP APPLICANTS? 


MARTIN J. GANNON axb URI BRAININ 


Behavioral Science Division, Department of Business, University of Maryland 


Although the temporary help service industry has been described historically, 
there are no published studies that have tested specific hypotheses relative to 
the type of individual who is attracted to this kind of work. The present study 
analyzed two types of applicants, 299 in all, who applied to a temporary help 
firm during 1968. Although both types of applicants were interviewed and 
were hired, one group refused while the other accepted employment. Of 18 
personal characteristics, 5 were predictive of acceptance, namely, type of job, 
number of skills, availability for work in terms of days per week, expressed 
interest in a permanent job as a temporary worker, and past experience in 
terms of months. The analysis delineates several disadvantages in the use of 
temporary workers, especially when the labor market tightens. 


Temporary help firms, which hire individuals and 
then send them out to do temporary work for any 
company that has need for such intermittent em- 
ployment, are crucial in any country that has a 
chronic labor shortage or a tight labor market. In 
the United States the total volume of business in 
this industry is approximately $1.5 billion per year 
and, recently, has been increasing at the rate of 20% 
per year (Sloane, 1970). Cosgrove (1970) states that 
2% of the workers in the United States are now in 
the temporary help field. 

In light of this growth and importance, it is in- 
teresting to note that there have been few studies 
of this industry. In fact, a review of the literature 
suggests that only one major study has been pub- 
lished, namely, Moore’s (1965) historical description 
of the development of the industry in the United 
States. 

The present study is an analysis of some of the 
personal characteristics that are related to acceptance 
and refusal of employment among temporary help 
applicants. Specifically, the data pertain to the per- 
sonal characteristics of two groups of temporary help 
applicants who were interviewed and hired by a 
temporary help firm during 1968. The first group 
consisted of applicants who, although they eventually 
left the firm, accepted employment during 1968. The 
second group was composed of applicants who were 
hired during 1968 but refused employment when the 
firm was able to place them. 

There are at least two relevant reasons for such 
an analysis. First, it should be possible to reduce 
interviewing costs by pinpointing personal character- 
istics that are related to refusals of offers of employ- 
ment. If such characteristics can be delineated, the 
firm can develop a short questionnaire and screen out 
those applicants not intrinsically interested in this 
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employment, so as to eliminate the waste of valuable 
interviewing time. This procedure would be espe- 
cially helpful to firms in this industry since they 
are bombarded with applicants. For example, Kelly 
Girls alone recruits 200,000 individuals each year 
(Cosgrove, 1970). 

Second, the analysis of refusals should help in the 
evaluation of the directions and strategies that 
temporary help firms should pursue, especially since 
the number of refusals will probably increase as the 
labor market tightens and permanent jobs become 
available. 

Method. The study was undertaken in a temporary 
help service company that has 21 offices located in 
New York, Connecticut, New Jersey, and Pennsyl- 
vania. Only the central office, however, which is 
located in New York City and is responsible for 
hiring within this locality, was used as the source of 
data in order to control for variations in labor 
market conditions. 

The sample consisted of 299 individuals. Of the 299 
individuals, 149 were temporary help applicants who 
accepted a job during 1968. The remaining 150 indi- 
viduals were applicants who were interviewed during 
1968, hired, but refused acceptance of a job. Both 
sets of applicants completed the employment inter- 
view. 

Eighteen variables were examined relative to the 
two types of applicants, namely: age; years of 
school; sex; marital status; number of children ; 
days available per week; expressed interest in a 
permanent job as a temporary employee; past expe- 
rience in months; numbers of previous jobs; number 
of skills; type of job; current attendance at school; 
and the interviewer’s perceptions as to the applicant’s 
alertness, appearance, manner, attitude, voice, and 
grooming. The interviewer's perceptions were noted 
in terms of a 14 (excellent-poor) scale. Chi-square 
was employed as the methodological tool of analysis. 

Results and discussion. Of the 18 variables, 5 
were predictive of acceptance and refusal among 
applicants who were offered jobs by the temporary 
help firm. These were: days available per week, ex- 
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TABLE 1 


Nt EFUSAL OF JOB 
PREDICTORS OF ACCEPTANCE AND REFUSAL OF 
OFFER AMONG TEMPORARY HELP APPLICANTS 


7 % 
Variable cepted | fused | N 2 
job job 
offer | offer 

Type of job | 

General secretary (lowest | arol ia | z 

skill level) PASA 

General clerk a8 oF |e 

Typist A é x à 

Sea (highest 31.8 | 68.2 22| 11.65** 
poses 38.8 | 61.2 | 49 

Mf 51.0 | 49.0 | 96 

2 41.1 | 58.9 | 73 

3 54.3 | 45.7 | 46 

4and above 74.3 | 25.7 | 35| 13.43** 
Daysavailable per week sss lasla 

ae onal 47.6 | 52.4 |252| 4.88* 
Interested in eana EDE jobas | 

oyee 

a 56.2 | 43.7 | 144 

No 43.1 | 56.9 | 153] 5.10% 
Past experience (in mo.) 

60 and under 

61-80 

81-100 

Over 100 


Note.—N = 299 (N in each subtable does not total ex- 
actly 299 due to incomplete information.) 
* p< OS. 


at 4 201. 
* pZ 001. 


pressed interest in a permanent job with the firm, 
past experience in months prior to the employment 
interview, number of skills, and type of job. 

As shown in Table 1, two measures of skill level 
were associated with the rate of acceptance, namely, 
type of job and number of skills. In both cases, an 
increase in skill level was related to a decline in the 
rate of acceptance. This finding is important in view 
of the fact that the temporary 
is currently expanding rapidly into areas of employ- 
ment that require skilled and professional workers 
(Sloane, 1970). As the labor market tightens, skilled 
employees will generally find permanent jobs quickly, 
Hence, temporary help firms that specialize in skilled 
and professional services may very well find them- 
selves in a position where they h 
applicants who desire placement. 

Another consideration in the acceptance of a job 
offer concerns the applicant’s availability for place- 
ment. The applicants who indicated that they were 
available for placement more than 3 days a week 
tended to have a low rate of acceptance. Such a find- 
ing suggests that temporary help firms have difficulty 
in the procurement of workers who are available on 
a full-time basis. In view of the fact that business 
firms frequently desire and need temporary employ- 
ces on a full-time basis for a short period of time, it 
is plausible that they may become disenchanted with 
temporary help agencies as sourci 


ancies | es of effective place- 
ments. This proposition is supported by the fact that 
agencies that place permanent workers generally are 


less effective than referrals by high schools, current 
employees, reemployment, and even walk-ins (Gan- 


help service industry 


ave few, if any, 
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non, 1971). Further support for this proposition is 
noted in the fact that, as the applicant indicated his 
lack of desire for a permanent job as a temporary 
worker, his rate of refusal increased. 

Finally, the type of individual who is attracted to 
the temporary help service industry is one who has 
the least or the most past experience measured in 
terms of months. Of the 22 applicants who had over 
100 months’ experience, not one refused the offer of 
employment. This finding suggests that the tempo- 
rary help applicant who has a long history of past 
experience is serious in his desire to seek work as a 
temporary worker. 

Conclusion. The analysis indicated that temporary 
help agencies may have difficulties in finding appli- 
cants who are available on a full-time basis, espe- 
cially when the labor market tightens and/or the 
skill level is high. 

The typical characterization of the temporary help 
industry has been very positive in the sense that its 
advantages are highlighted (Anonymous, 1963; 
Moore, 1965; Sloane, 1970). The present analysis, 
however, has demonstrated that many disadvantages 
and weaknesses, both actual and potential, have not 
been properly assessed. For example, temporary work- 
ers are most crucial when the labor market is tight, 
since most business firms are understaffed during 
this period. Yet, it is precisely during these periods 
that temporary help firms have much difficulty in 
providing reliable workers, especially in the skilled 
and professional areas. 

Another potential difficulty with this type of ser- 
vice concerns the psychological set of the applicant 
who come to the temporary help firm. As indicated 
above, applicants who expressed a lack of interest Me 
& permanent job as a temporary worker had a higher 
rate of refusal of job offers, 


Finally, the type of individual who is attracted to 
this industry t 


ends to be available for work only 3 
days a week or less, Such a constricted amount of 
availability can become a source of difficulty, espe- 
cially when the labor market is tight and business 
firms are interested in procuring workers on a full- 
time basis for a short period of time 

While further research into the subject of tem 
Í is obviously necessary, the present analy- 
we eas suggested particular difficulties and dist- 
at indicate that temporary help service 15 


not a complete panacea fi F rfec 
labor market, or the ills of an impe 
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FACE-TO-FACE INTERACTION IN THE PEER-NOMINATION PROCESS 


ARIE Y. LEWIN, PETER DUBNO, anp WILLIAM G. AKULA * 


Graduate School oj Business Administration, New York University 


In this note, the importance of face-to-face interaction in the peer-rating 


process was tested. Peer ratings were 0 


btained from interacting group members 


participating in a management game and were compared with ratings obtained 
from observers who viewed 15-min. videotapes of these groups participating in 
a game decision-making session. There was no difference between the two sets 
of peer ratings leading to the conclusion that face-to-face interaction is not a 
critical variable in this judgmental process. 


The validity of peer ratings in the prediction of 
various future performance criteria—for example, 
supervisors of life insurance agents (Weitz, 1958), 
managers (Roadman, 1964), and naval officers (Hol- 
lander, 1965)—has been shown to be remarkably 
high. Indeed, in a recent comprehensive review of the 
literature, Korman (1968) concluded that “peer 
ratings are better predictors than psychometric pro- 
cedures [p, 319]” and better than most objective 
tests, Similar conclusions were reached by Lindzey 
and Byrne (1968) and Miner (1968). Al of these 
writers, however, concluded by noting the lack of 
theory to explain the advantage of peer ratings and 
their validity in predicting future managerial per- 
formance or leadership. 

Miner (1968), in commenting on the predictive 
potential of peer ratings over other selection or 
predictive methods, concluded that the explanation 
for this advantage at present was “conjectural.” 
Lindzey and Byrne (1968), who conducted a survey 
of studies dealing with sociometric status (inter- 
changeable with peer ratings), concluded that work 
in this area is almost totally devoid of theory. 
Furthermore, they concluded that after three decades 
of research on sociometric status, the typical investi- 
gation still is one in which regularly available mea- 
sures are correlated with some index of choice status. 
Their criticism is particularly focused on the lack of a 
coherent theoretic structure to integrate the nu- 
merous findings to guide subsequent research toward 
critical questions. Without such a structure, work on 
sociometric status will remain as clear examples of 
limited returns to be expected from theoretic em- 
piricism. 

Finally, Korman (1968) acknowledged the explana- 
tory problem by suggesting “systematic testing rather 
than a scattered, buckshot approach [p. 318]” as one 
kind of reorientation that was needed in predictive 
research. 

Thus, it appears that in peer ratings we have a 
judgmental process that has high validity in the 
prediction of managerial performance but that at 
this time lacks a theoretical framework for satis- 
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factory explanation of this process. In this note, the 
authors report on an experiment designed to test the 
importance of face-to-face interaction in the peer- 
rating process in an attempt to provide one small, but 
perhaps essential, portion of a theoretical formulation 
for peer ratings. 

Statement of problem. The problem dealt with here 
is concerned with the well-known phenomenon that 
group members form judgments about one another 
as a result of interacting. Stated differently, it is the 
outcome of face-to-face social encounters that an 
individual interprets and applies to the judgmental 
process about the other participants of his group. 
Thus, Bales (1950) used face-to-face interaction as 
the means with which to define the basic concept 
and boundaries of any “small group.” Bales thus 
assumes that interaction is a necessary ingredient for 
people to make judgments about others. That is, face- 
to-face interaction serves as a mediator to facilitate 
the learning process of humans and helps test their 
impressions Or perceptions of each other. Similar 
assumptions about face-to-face interaction appear as 
a common thread in the studies of Homans (1950, 
1961), Hare (1962), and Gibb (1969). 

Gibb (1969) used “interaction” in both an ab- 
stract and physical sense. When referring to the 
simultaneous relations between independent variables, 
for example, Gibb is describing interaction in an 
abstract or quasi-mathematical form not relevant to 
the current study. When Gibb refers to (a) “inter- 
action amongst constituent members,” or (b) that 
members of a group should be seen, and see them- 
selves, as having shared or compatible goals, and (c) 
that they be understood as satisfying individual 
needs through interactions with others, then he is 
referring to a physical or eye-to-eye interaction as a 
necessary part of the group process. 

In this note, interaction is operationally defined as 
either social interaction or face-to-face interaction 
denoting the reciprocal influencing of the acts and 
feelings of persons. Consequently, the following gen- 
eral hypothesis is tested here, based on the Bales 
(1950), Hare (1962), and Gibb (1969) rationale: 
Assuming that face-to-face interaction is a determin- 
ing factor in the process of individuals learning about 
each other within a small group, then peer ratings 
obtained from such an interacting group would be 
different from the ratings obtained about the same 
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TABLE 1 


Proportions OF POOLED OBSERVER AGREEM 
QUESTIONAIRE ITEMS FOR DIFFERENTLY R 
INTERACTING GROUP MEMBERS 


| | Question 3: 
| | Question 2: 


Who con- 
Who con- tributed 
| Question 1: | ‘tributed most to 
1 | With whom most to the analysis 
äjä KEE OR achieving | and solving 
| work best? | the goals of | of day-to- 
| the team? day prob- 
| blems? 
| | f = 
> | | 
Interacting | 
group Ss a | P D 
Tanked first o | o | am 
Interacting Ss 
ranked Sines 
second, third, 
or fourth & 31 -31 | „34 
Interacting Ss E is 
ked last 85 87 .89 
ranked last 6 aa | Rei 
a l 
Note.—N 97 for four observer groups. Interacting group, 
N = 14,df = 13, ns. 


group from a similar but external group of non- 
interacting observers. 

Method. The research was conducted at the Col- 
lege of Business Administration of a large eastern 
university. A total of 95 evening students partici- 
pated in a management game simulation. The business 
simulation used the JAN Lee’s Executive Game for 
the IBM/1620 computer. 

The 95 students were randomly allocated into 19 
teams of five members each. All participants were 
exposed to a 1-hr. business game indoctrination and 
one practice session. Thereafter, they played three 
20-min. sessions of the game. 

The third 20-min. game session was recorded on 
videotapes. These recordings made up 15-min. samples 
of group decision making in the game. Although all 
19 teams were videotaped, two teams were eliminated 
for viewing for external observers because of tech- 
nical difficulties in the recording. Furthermore, three 
other teams had to be eliminated because they had 
one or more members absent at the 
session. Consequently, 14 teams comprised 
for the present study. 

After having played the game for three 20-min. 
sessions, peer ratings were obtained from each par- 
ticipant with respect to his other group members in 
response to three questions covering the dimensions 
of: (a) With whom can you work best?, (b) Who 
contributed most to achieving the goals of the team? 
and (c) Who contributed most to the analysis mid 


solving of day-to-day problems? The sociometric 
questionnaire regarding 


from Fiedler (1967). 

The same questionnaire was then given to 97 ex- 
ternal observers. These observers were asked to make 
the same peer ratings after observing 15-min. video- 
tapes of each of the teams participating in the man- 
agement game. In other words, the videotapes con- 
sisted of visual records of short time spans in the 
decision-making sessions of students in teams playing 
the game. The 97 external observers were divided into 


peer-rating 
the sample 


these questions was adapted 


four groups of 20, 22, 25, and 30 members each. 
After a standardized practice session, each group was 
shown a randomly selected sequence of five manage- 
ment-game team videotapes. The 15-min, sequences 
of videotapes were shown one at a time and the 
observers were asked to make the peer ratings after 
cach showing. 

Analysis of results. The general hypothesis was 
tested within teams and across all Ss. To obtain an 
overall impression for the degree of agreement be- 
tween the peer nominations of the interacting groups 
and the external observer groups, the percentage of 
times observers in the four groups agreed with the 
ranked (1-5) interacting members was computed for 
each of the three criterion questions. The rankings 
were then pooled, according to the method outlined 
by Guilford (1954, pp. 181-188), to allow for a com- 
parative analysis of the rankings obtained from the 
observer and interacting groups. Table 1 shows the 
pooled rankings of the observer groups for those in- 
teracting members ranked first and last; members 
ranked second, third, and fourth were combined into 
a single pooled ranking. The results presented show 
how the observers ranked the stimulus Ss across the 
three criterion questions. 

The obtained pooled rankings were then subjected 
to chi-square analysis; a theoretical frequency of 
1.00 was employed for the first and fifth rank posi- 
tions representing perfect similarity to the interacting 
groups’ peer ratings. The middle ratings were Te- 
moved from the analysis, however, because no sensi- 
ble theoretical frequency value could be assigned. 
Indeed, the table shows that there was considerable 
disagreement between observers and interacters for 
these three positions, as might well be expected due 
to the well-known observation that Ss in the middle 
Tanges are often difficult to rank. y 

The extent to which the observers and interacting 
members’ peer nominations agree can be tested bY 
comparing the individual ranks that game participants 
received from their group members on each question 
with the ranks they received from the external ob- 
servers. Another procedure employed for testing 
agreement of ratings involved the computation ° 


a total rank for each group member by group fro™ 
the peer ratings made by this group, and from te 
average of the external observers. Then, the dif- 
o ranks was computed for 
student on the 14 teams, The Wilcoxo? 
Matched-Pair Signed Rank Test was used to de- 


termine whether these differences were significant. 5 
Ek be recalled that the hypothesis stated a difference 
etween the peer ratings obtained from interacting 
groups and from observers, 

be Z i for the difference in ranks were .036 
~.068, and —.118 for Questi resper 
tively. Th aa ater 


he results reject the hypothesis that the dif- 
ferences in ranks are significant. 
Discussion. In con 


these results clearly 
interaction does not 
vening v. 
that Jimi 


ference between the tw 
each 


trast to theoretical expectations: 

demonstrate that face-to-face 
3 appear to be a critical inter- 
ariable in the judgmental process. It appears 
ted vicarious interaction is sufficient for indi- 
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viduals to make judgments about other humans in- 
teracting in groups and that these judgments would 
be in agreement with those made by members of the 
interacting groups. Clearly, this raises the intriguing 
question that perhaps the judgmental process depends 
on the situation and is general to all people rather 
than being individualistic and complex. Furthermore, 
it suggests the additional possibility that viewers (our 
external observers watching the videotapes) engage 
spontaneously in a form of role playing that vi- 
cariously makes them members of the group they 
are watching for purposes of evaluating the real 
members of the group. This being the case, it would 
appear that “interaction” has psychological properties 
(social and projective) beyond those implied in the 
face-to-face notion, and that an explanation of the 
high level of predictive validity associated with peer 
ratings is far more complex than heretofore sug- 
gested. 

Apparently, the face-to-face aspect of interaction is 
only one small physical ingredient in the process 
that does not even appear to be necessary for ade- 
quate peer ratings to Occur. What seems to be much 
more probable is that some aspect of role behavior 
lies at the core of the judgment-interaction process. 
Some support for this point of view stems from 
Bem’s (1965) replication of three dissonance experi- 


ments in which Ss were prevented from experiencing 
the dissonance. He reported that his S’s verbal re- 
sponses were essentially similar to those in the origi- 
nal dissonance experiments when they were given 
only a verbal description of the original dissonance 
situation. He concluded that S does not have to be 
physically in contact with a dissonance situation, Or 
to actually experience any dissonance of his own, to 
respond similarly to the dissonance-generating treat- 
ments given to others. As a consequence of the find- 
ings presented here, further research employing role 
concepts may be warranted. 
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Furthermore, similarity in the ranking procedures 
oi the internal group members and external observers 
may have been a function of their similarity with 
respect to values, attitudes, and culture. The external 
group members were selected from the same popula- 
tion of Ss as the internal group members, that is, a 
population of business administration students. 
Further research should test the hypothesis with 
internal and external Ss of different social, cultural, 
and even sexual characteristics. 
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y i i ationship between scores on the Personal 
t study investigated the relations! S ( i 
ns Inventory (POI) and success in Peace Corps training programs in 
AE The POI was administered to 92 Peace Corps trainees prior to the 
beginning of training and the results correlated with the final selection board 
g 


ratings upon the completion of training 
for the total group with considerable sex 


Attempts to predict trainee success in Peace Corps 
training programs and subsequent volunteer assign- 
ments have produced mixed results, Psychiatric in- 
terviews have been found to be ineffective (Fisher, 
Epstein, & Harris, 1967) as have “sickness”-oriented 
tests (Smith, 1965) and measures of intellectual 
ability (Dicken, 1969). Assuming that the crucial 
variables for Peace Corps trainee success could be 
best described in terms of positive mental health, 
Smith (1966) and Ezekiel (1968) found significant 
correlation coefficients (r= .41 and 34, respectively) 
between estimates of psychological effectiveness and 
volunteer success. 

In recent years, Maslow (1962) has adv: 
idea of the self-actualizing person as on 
developing and using all of his unique 
and potentialities. The Personal Orientat 
tory (POI) was developed by Shostrom 
provide a measure of values and behavio. 
be of importance in self-actualization an 
mental health as opposed to Psychopathol 
values and behaviors are similar to those described 
elsewhere as providing a coherent core of psycho- 
logical attributes necessary for competent, effective 
Peace Corps functioning (Guthrie & McKendry, 
1963; Smith, 1966). 


The present note investigated the relationship be- 
tween scores on the POT and trainee success in 
Peace Corps training programs in Micronesia. 

Method. The Ss for this study were 92 Peace 
Corps trainees (53 males, 39 females) participating 
in an in-country training program in Micronesia. 
The mean age of the sample was 22.2 yr. All but 
four of the trainees were college graduates, Each 
trainee had been screened for Physical and psychi- 
atric symptomatology prior to arriving at his re- 
spective training site, 

The POI consists of 150 two-choice. 
value and behavior judgments. The 
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. Significant relationships were 


found 
differences for several scales, 


scored twice. First, for two basic scales of personal 
orientation, Inner Directedness (127 items) and 
Time Competence (23 items) and, second, for 10 
subscales each of which measure a conceptually im- 
portant element of self-actualization, Several studies 
have supported the construct validity (Fox, Knapp, 
& Michael, 1968; Graff, Bradshaw, Danish, Austin, 
& Altekruse, 1970; McClain, 1970; Shostrom, 1964) 
and the reliability (Iardi & May, 1968; Klavetter 
& Mogar, 1967) of the POI. 

The criterion of success in the training programs 
was the Final Selection Board (FSB) rating. Upon 
completion of a 10-wk. training program, each 
trainee was assigned a composite rating of either 3 
(upper quartile), 8 (third quartile), 7 (secon 
quartile), or 6 (lower quartile) by the final selection 
board composed of the core training staff members. 
Each rating incorporated a trainee’s performance i 
language training; technical studies; cross-cultural 
studies; and a general staff evaluation of inter 
Personal skills, motivation, and cultural adaptation. 
The composition of the final selection boards was 
not constant for all Ss with relatively equal numbers 
being judged by four separate selection boards ap- 
Propriate to the island-group to which the trainees 
Were assigned. High interjudge agreement (r= 87) 
has been reported for FSB ratings (Goldberg, 
1966), and such ratings have been found to be 
significantly related to a field performance criteria- 

The POI was administered to all Ss prior tO 
arriving in Micronesia for training. Instructions at 
the time of administration indicated that the P 


strument that would be scored On 
e training program, Quadriserial ” 


ship between POI scores and FSB 
ratings. 


Results and 
the POT and th 
1. For the total 


discussion, The 


or beyond. For male Ss, 4 of 
05 level or better; while for 
Scales reached statistical sig- 
Of particular note are the correlation C0- 
efficients between the FSB ratings and two major 
scales, Time Competence and Inner Directedness. Tn- 


pe 


. 


TABLE 1 


CORRELATIONS BETWEEN PERSONAL ORIENTATION INVENTORY 
SCALES AND FINAL SELECTION BOARD RATINGS 


| Final selection board rating 
aa 
POI scale 
ha Females 
Total ra | Males 7> s 


Self-Actualizing Value 

tentiality 

eling Reactivity 

Spontaneity i 

Self-Regard 

Self-Acceptance 

Nature of Man Constructive 
ynergy 

Acceptance of Aggression 

Capacity for Intimate Contact 


Note.—POI = Personal Orientation Inventory, 


4k? <05. 
p? <.0L. 
"ep <.001, 


ner Directedness reflects the individual’s autonomy, 
self-support, and, in general, freedom from rigid 
adherence to the social pressures and expectations. 
Time Competence may be viewed as an expression of 
use as opposed to misuse of time (Shostrom, 1966). 
Relatively high and significant correlations for Inner 
Directedness and FSB ratings suggest that this type 
of personal orientation favors adjustment in new 
Situations, especially ones demanding personal ini- 
tiative and relatively high degrees of self-support so 
characteristic of Peace Corps assignments. i 

The use of time, or Time Competence is also sig- 
nificantly related to the FSB ratings. An appreciable 
sex difference is noted for this scale. The difference 
is too great to be explained on the basis of higher 
scores reported for females in general (Shostrom, 
1966). It is possible, however, that the difference 
may be largely due to the different concept of time 
and the male-female role designations in the Micro- 
nesian society as compared with the American cul- 
ture. Thus, a transplanted American male who 
prides himself on being efficient in the use of time 
may be perceived by the Micronesians as an im- 
patient individual and rated poorly on his adjust- 
ment to a relatively easy-going, nonschedulized 
daily approach to living. In contrast to males, 
behavior of females within the Micronesian culture 
seems to be more regulated, thus imposing on an 
outsider a clearly structured demand to conform to 
the existing sociotemporal orientation. 

Of the remaining scales, Existentiality, one’s 
flexibility in applying self-actualizing values and 
Principles; Self-Acceptance, relative acceptance of 
one’s self in spite of one’s weaknesses and deficien- 
cies; and Capacity for Intimate Contact, one’s abil- 
ity to develop meaningful relationships with others 
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seem to be the most consistent across sexes in terms 
of validity. These personal orientations appear to 
be in line with the resourcefulness generally expected 
of the Peace Corps volunteers. With the exception 
of Feeling Reactivity and Acceptance of Aggression 
scales, which are not significant, the remaining five 
scales show at least one significant correlation with 
the FSB ratings. Lack of empirical data on which 
to base an explanation of the observed sex differences 


invites further exploration of this dimension. 

While the need to test the effectiveness of the POI 
in identifying characteristics of successful performers 
in other populations is clear, these results tend to 
support Smith’s (1966) hypothesis that indexes of 
psychological effectiveness (or positive mental 
health) provide the best predictors of competent 
Peace Corps functioning. 
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{ ving records of penalties for violating 
oe ee ae nonviolators. Differences in 
Tasma D pest sae were found. For example, violators scored lower on the 
secon = sness, vigor, responsibility, and emotional stability. Work groups 
al oe a leadership style of their supervisors to be social and well 
tc ood a than instrumental receive fewer punishment letters than groups 
oe the opposite pattern of opinion about their supervisors. 


Every organization makes rules that are considered 
necessary for its effectiveness and continued ie one 
Each employee agrees to meet certain standards o 
task performance, cooperation with other mem Hers 
of the organization, secrecy concerning company 
secrets, and respect for company Property. 

Wherever people work, however, deviations from 
these rules occur. Sometimes they occur to such a 
degree that it hampers the daily routine of the or- 
ganization or threatens the well being of its members. 
Violators of formal company rules could be punished 
by means of a reprimand, a written warning, a trans- 
fer to another department, suspension, or discharge. 
Each year, at Hoogovens, an integrated iron and 
steel plant situated at Ijmuiden, the Netherlands, 
about 130 out of the 10,000 male Dutch employees 
receive a punishment letter (ie., a written statement 
concerning a penalty). (Officials, women, Italian and 
Spanish workers were not included in this study.) In 
this article, violators of company rules are defined as 
people whose personnel record contains at least one 
punishment letter. Those workers whose record is 
“clean” are called nonviolators, 

Sinha and Wherry (1965) have pointed out that 
despite the growing concern over increasingly fre- 
quent norm-violating behavior in the industrial 
world, very few systematic attempts have been made 
to investigate the determining or concomitant varia- 
bles of deviants. Yet, “the tremendous number of 
reprimands, probations, temporary layoffs, demotions, 
and discharges that are invoked y 


yearly by manage- 
ment indicate the perceived importance of discipline 
in business [Booker, 1969] .” 

The central theme in the present study is: What is 
the difference between violators and nonviolators in 
industry? Few studies have been published on this 
topic. Specifically, 


the author examined background, 
personality and ability differences, 


and character- 
istics of crews and their supervisors as possible 
sources of variance, Earlier work had suggested some 
personality correlates for possible 


study. For example, 
Cooper and Payne (1967) found correlations between 


1 During the time this research was done, the author wis 
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versity Twente, Schubertlaan 6, Ensch 
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Requests for reprints should be sent to this address, 


extraversion and neuroticism on one side and be 
frequency of nonpermitted absence and frequency o 
lateness on the other side. In a Navy study, Knapp 
(1963, 1964) found the Socialization scale of the 
California Psychological Inventory and the Conform- 
ity scale of the Survey of Interpersonal Values cor 
related significantly with a delinquency criterion. 
Navy offenders were found to have attitudes favor- 
able toward escapism and toward nonconformity to 
rules and regulations and were lower on smania 
Willingham and Ambler (1963) also found significant 
but low correlation between violating and the Gor 


; Jigor 
don Personality Inventory Cautiousness and Vig 
scales. 


Method. The Ss were as follows: 


Group A: 83 violators who received letters during 
1961, 1962, and 1963; kers 
Group B: a group of 62 unpunished ase 
matched with Group A on the variab s3 
marital status, and length of service; 2 
Group C: 622 persons, hired in 1964, hose Prs 
sonnel record did not contain a punishmen! 
ter at the end of 1966; per- 
Group D: 125 persons, hired in 1964, aR, PET 
sonnel record did contain a punishment i the 
at the end of 1966 (Groups C+D equa 
total of those hired in 1964) ; on- 
Group E: 266 Violators and nonviolators who © 
stituted 6 different crews; 


Group F: 19 Psychopaths and 28 criminals. 


For Groups A-E the following data were collected: 
the antecedents of punishment letters, if any SY es 
letters could be found in the company files; eas ; 
on the General Aptitude Test Battery (GATB)} 
scores on the Kuder Vocational Preference Re 
and scores on the Gordon Personal Profile and st 
ventory. The tests were administered during the f" 
workday after hiring. The attitudes of GrouP d 
about their own group and their leader were collecte 
during a survey not related to this project. Only Ha 
members of Group F were specially tested for g 
study. 


Results. The violators and nonviolators could not 
be discriminated upon by age, percentage married aaa 
single, migration from other parts of the Nether” 
lands, and length of service. The workers who wet 
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punished differed from those who were not punished 
on some variables. The violators were more likely 
to be divorced, to have very small (0 or 1 child) or 
very large families (5 or more children), to be more 
unskilled (fewer mechanical and electrical craftsmen, 
fewer operators and inspectors, more from the lower 
wage classes, more with an elementary school edu- 
cation), and to be less healthy (many at the medical 
examination before entering service declared that 
they suffered from diseases and mentioned a large 
variety of diseases). 

No significant differences were found between 
Groups A and B for any of the GATB scores. From 
Groups C and D multiple-correlation and discrimi- 
nant function analyses indicated that abilities did not 
discriminate violators from nonviolators.* 

The results consistently confirmed only one hy- 
pothesis concerning the ability of violators: the 
standard deviation of the ability scores of violators 
tended to be significantly higher than that of non- 
Violators. 

_ Violators scored significantly higher than non- 
Violators on the persuasive and literary factors and 
significantly lower on the factor measuring pref- 
erence for mechanical activities of the Kuder Voca- 
tional Preference Record. For the scores on compu- 
tational and artistic activities, the expected simi- 
larity between the groups was confirmed. The vio- 
lators showed a lower preference for outdoor work 
and social service and a higher preference for mu- 
sical jobs and office jobs. To summarize: norm vio- 
lators in this industry prefer light jobs involving 
little physical exertion. 

In agreement with findings of Cooper and Payne 
(1967), Knapp (1963, 1964, 1965), and Willingham 
and Ambler (1963) and with the fact that many 
authors reported inferiority feelings and bad ad- 
justment in criminals, a factor analogous wi 
low score on the Gordon Personal Inventory 
found that violators scored lower than nonviolators 
on cautiousness, vigor, emotional stability (on the 
Gordon Personal Profile), responsibility, and ad- 
justment. For these factors, the product-moment 
correlation coefficients with violations were —.46, 
—.26, —.36, —.44, and —.49, respectively. A multiple- 
correlation coefficient of —.59 was obtained when 
predicting violations from both Gordon and Kuder 
variables, 

The multiple-regression analysis was Cro: 
dated, but it proved impossible to discriminate the 
lightly and severely punished Ss using this technique. 
A discriminant function analysis proved to be more 
successful, It was also found that the more serious 
the violation, the lower were the scores on the 
Personality traits relevant to norm violating. The 
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scores of recidivists in light violating were similar 
to those of heavily punished first offenders and of 
criminal recidivists. 

A Group Dimension Scale developed by Hemphill 
(1956) and adapted for the Netherlands by H. J. 
van Dongen (1969), was administered to the six 
crews included in Group E. The scale measures vari- 
ous aspects of group cohesion. The major hypothesis 
here was: The higher the scores on variables char- 
acterizing aspects of the cohesion in a group, the 
lower the number of punishment letters received by 
members of this group. 

A group index was calculated from the sum of the 
group-cohesion variable scores divided by the average 
score for all groups. This resulted in a scale ranging 
from one to six. The rank correlation coefficient be- 
tween punishment index (number of punishment 
letters of the group, divided by the number of mem- 
bers of the group) and this group index was rho 
=—.83 (p < 01). 

A scale developed by H. J. van Dongen (1969) 
was used to measure the relation between leadership 
style and probability of violation. This scale mea- 
sures social, instrumental, balanced, and dynamic as- 
pects of leadership as seen by the group members. 
Social aspects include good relations with subordi- 
nates, tolerance to violation of rules, and applies 
more positive than negative sanctions. Instrumental 
aspects include giving priority to a quick working 
rate, high production, and strict observation of job 
instructions. Balance aspects include well-balanced, 
predictable, and stable characteristics. Dynamic as- 
pects include readily receiving and implementing new 
ideas. 

Two hypotheses were tested here: (a) the more 
social and well balanced a supervisor (as perceived 
by his subordinates), and (b) the less instrumental 
they think he is, the lower the punishment index of 
members of his group. 

These hypotheses were confirmed in that crews 
with the lowest punishment letters per crew consid- 
ered their bosses more social and well balanced and 
Jess instrumental than the crews with the highest 
punishment index. 

The sum of the social and balance scores minus 
the instrumental scores divided by the mean scores 
of all the crews on the leadership variables is called 
here the leadership index. The rank order of this 
index correlated significantly with the punishment 
index (Violations), rho=—.93 (p < 01). 

A multiple-regression analysis on the relative con- 
tribution of personality traits (P) and characteristics 
of groups (Gr) and leaders (L) gave the following 
results: 


Violations = 1,714 — 4P — 4.6 Gr — 7.2 L 
where: 


Violations = 1,000 X number of punishment letters 
per group divided by the number of 
group members, 

P=100 X the result of substitution of 
scores in a formula with three Gordon 


variables divided by the average score 
of all groups for this formula, 

Gr= 100 X score on group cohesion scales 
divided by the average score of all 
groups, and 

L= 100 X score on the leadership scales 
divided by the average score of all 
groups. 


Discussion. It appears possible to discriminate 
norm violators from nonviolators from personality 
scores. The variance accounted for by personality, 
however, is low and may be of small practical utility. 
Leadership style and group cohesion appear to be 
more important variables than personality as predic- 
tors of norm violations. The correlation between 
leadership and punishment cannot be a result of 
selection of workmen by their leader because the 
correlation of the leadership index with the person- 
ality index is very low. 

The correlation between group cohesion and norm 
violation is analogous with the results obtained by 
other investigators who studied delinquency, The 
results in these studies indicated that those parts of 
towns that are characterized by a low cohesion in 
community life showed delinquency rates eight to 
nine times higher than the cohesive parts of these 
communities. These results also Suggest that norm 
violation is closely related to leadership style. Al- 
though norm violation and leadership style are m 


i: ea- 
sured independently, it 


is possible that the high 


SHORT NOTES 


correlation found here is partly caused by contamina- 
tion because the boss judges his personnel and 
nominates violators for punishment. This comment, 
however, refers only to the “sanctions” aspects of 
social leadership style and not to other style aspects. 
There is, of course, a need to cross validate these 


suggestive results and replicate them in other indus- 
trial situations. 
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PORNOGRAPHY AND THE SEX OFFENDER: 


PATTERNS OF PREVIOUS EXPOSURE AND AROUSAL EFFECTS 


OF PORNOGRAPHIC STIMULI’ 
ROYER F. COOK,? ROBERT H. FOSEN* 
American Institutes for Research, Washington, D. C. 
anp ASHER PACHT 


Wisconsin Division of Corrections 


One hundred twenty-nine inmates (63 sex offenders, 66 criminal code offenders) 
were shown a series of 26 slides depicting different sexual behaviors. The Ss were 
also interviewed about their past exposure to pornography. No differences were 
found between the two groups on the measure of rated overall sexual arou to the 
nerous differences were found between the two groups in their past ex- 
sex offenders generally experienced less frequent and milder 
than did the criminal code offenders. Factor analyses of 


slides. Num 
posure to pornography ; 


exposure to pornography 
arousal ratings to the di 


fferent slides identified several arousal factors and sug- 


gested possible differential relations to type of sex offense. 


With the current proliferation of depictions 
of sexual behavior in books, magazines, films, 
and theater, a larger segment of society appears 
to be engaging in vicarious sexual experience— 
viewing and reading about sexual behavior. 
The trend has been accompanied by renewed 
interest on the part of both psychology and law 
in the relationship between exposure to por- 
nography and the commission of a sexual 
offense (Cairns, 1971). 

Only during the past decade have researchers 
begun systematically to investigate this rela- 
tionship, typically employing survey-interview 
methods to determine the type and frequency 
of exposure to pornography (e.g. Gebhard, 
Gagnon, Pomeroy, & Christenson, 1965; 
Thorne & Haupt, 1966). These studies com- 
pared the exposure experiences of sex offenders 
to those of nonsex offenders and/or groups 
from society at large. Results have been largely 
negative, indicating that exposure rates of sex 


1 This study was supported by a grant from the 


Federal Commission on Obscenity and Pornography. 
The authors would like to thank Albert S. Glickman 
for his helpful guidance throughout the research and 
Raymond Anderson for his assistance in collecting 
the data. Í 

? Requests for reprints should be sent to Royer F. 
Cook, who is now affiliated with the Department of 
the Army, Motivation and Training Laboratory, Com- 
monwealth Building, 1300 Wilson Boulevard, Arling- 
ton, Virginia 22209. _ 

3 Now Assistant Commissione 
ment of Correctional Services, Albany, 


r, New York Depart- 
New York. 


offenders have not significantly differed from 
their control groups. 

More recently, several survey-interview 
studies supported by the Commission on 
Obscenity and Pornography have generally 
shown that sex offenders had significantly less 
exposure to pornography as children and 
adults than comparable control groups (Gold- 
stein, Kant, Judd, Rice, & Green, 1970; 
Johnson, Kupperstein, & Peters, 1970; Walker, 
1970). However, none of these studies has 
measured the immediate effects of porno- 
graphic material on the arousal level of sex 
offenders. 

The present research was designed to iden- 
tify differences in immediate and subsequent 
arousal levels between individuals in prison 
for sex crimes and those individuals in prison 
for crimes unrelated to sexual behavior when 
these individuals are exposed to pornographic 
stimuli depicting different types of sexual 
behavior. Also investigated were differences 
between these groups in the type and frequency 
of the individual’s prior history of exposure to 
pornography: (a) during preadolescence and 
adolescence, and (b) more immediately prior 
to his crime. Finally, an exploratory study was 
performed on the existence of general or 
specific arousal patterns to different types of 
pornographic stimuli and possible relationships 
of these more specific patterns to sexual 
offenses. 
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METHOD 
Subjects 


One hundred twenty-nine male inmates at the ve 
consin State Prison served as Ss. All Ss ne bo 
beforehand that participation was voluntary an ra 
their responses were ig agp oe Mones 

to participate; they were ra y. dl 
hated eos aens types. T he most prominent 
reason for refusal was an unwillingness to cooperate 
with the institutional authorities. Despite assurances, 
a few of the sex offenders felt that the experiment 
might have a bearing on their chances for parole. 
The Ss were 66 persons convicted of criminal code 
offenses unrelated to sexual behavior (CC) and 63 sex 
offenders (S0). g 

The CC sample was representatively selected from 
a population of 850 inmates; the SO sample from a 
population of 114. The CC sample contained inmates 
convicted of burglary (V = 17), homicide N= 18), 
forgery (N = 14), and robbery W = 17). The SO 
sample contained inmates convicted of indecent 
liberties with females (V = 21), indecent liberties with 
males (V = 16), attempted rape (V = 10), and rape 
(N = 16). The two samples were reasonably com- 
parable in average education (SOs = 8.7 yr., CCs = 8.9 
yr); average general abilities (SOs = 89.7 on Cattell 
Culture Fair Test, CCs = 92.2 on Cattell Culture Fair 
Test), and average age (SOs = 28.6 yr., CCs = 29.7 
yr.). The only practical restrictions placed on selection 
were that all Ss were to be functionally literate and 
that only the most recently incarcerated were chösen 
to minimize deprivation effects. Thus, the large 
majority of Ss in both samples had been incarcerated 
for less than 3 yr. 


Apparatus and Materials 


The stimulus materials w 
from four 10-min, films. Tabl: 
content of the slides. A Kodak 
automatically presented a 223 in. x 15} in. 
a Screen approximately 7 ft, from S. Data 
lected in one of two rooms located in the clinical 
services area of the prison, 


Pretests 


A pretest was conducted wi 


Design 


Three measures* were used: (a) a set of ratings by 
means of which S rated his level of sexual arousal to 

+A fourth measure, a mood descriptor given to § 
before and after slide viewing, is not Presented here 
Many 5s had difficulty understanding the scales and 
many of the data appeared invalid, 
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each slide as each was presented, (b) a maroon 
eliciting S's responses to 22 questions about his betes 
experiences with pornography, and (c) a Der een 
given to S the morning after the experimental se z iha 

With regard to the immediate arousal eae ‘ier 
slides, the two variables of major intere t were of ai, 
type and presentation order (story line or an a 
In the story-line presentation the slides were arrang E 
in a realistic sequence from fully clothed, throni 
petting, to intercourse; in the random presentation, 4 
received the 26 slides in one of three tandon ie 
sequences. These variables (two presentation FE 
and eight types of offenses) were arranged in a 2 > ed 
factorial design to permit an andependentig oR 
analysis of variance on the ratings data (Winer, 1 en 

With Tespect to the pornographic experience Hie 
tionnaire, the 22 questions were categorized as fol ae 
(a) type and frequency of S's experience with atte 
raphy, during year before imprisonment, (b) type aa 
frequency of experience with pornography onna 
preadolescence and adolescence, (c) S's Nagy of ae 
educational function of pornography, (d) S’s ree 
tion of the adjustment function of pornography, (e) ‘S's 
motivations for experiencing pornography, and (f) S5 


š È 4 % a jmi- 
view of the immediate effects of pornography on cr 
nal behavior. 


The questionnaire 
the experimental 
questions: 


R; 


so . after 
administered the morning a 
session asked the following 


How often did you think about the slides after 
you left the experiment yesterday? 
2. When you left the experiment yesterday, 
excited did you feel? +1 you 
3. When you left the experiment yesterday, did Y 
feel “turned off,” disgusted? ng 
+. Did you have any sexual fantasies after the © 
periment (daydreams, etc.) ? day 
id you engage in any sexual behavior yestere* 
or last night (e.g., masturbated) ? 


how 


Procedure 


The experimental session 
uring the first half of t 
Presented at 15-sec, 


r j arts. 
consisted of two ides 
he session, § viewed the S 4 
Intervals and rated his level , 


: > 4 in- 
sexual arousal to each slide. In the second part, ae 
terviewed § about his pr 


Pornography usin 
he S reported to the clinical services area and W 
escorted to the exp 


€ was told the following: 

The Commission 
has asked us 
study of rea 


We simply Want to fin 
Pornography, Pictures o p 
look at some pornographic slides and make ae 
Judgments about them. We would like you to ser” 
as a subject. Read over this r 
and if you would like to pa 
at the bottom, 


After fi 


; at 
lling out the mood descriptor, § was told co 
he would see a series of 26 slides which he should ra! 
according to a 6 


; <cit- 
“point scale (1 = neutral, not exe 
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ing; 2 = slightly exciting; 3 = moderately exciting; 
4 = very exciting; 5 = highly exciting; 6 = extremely 
exciting). It was emphasized that he was to rate the 
slides according to Wis level of arousal, not as he 
thought someone else might rate them. The S was 
shown five preview slides with a representative range 
of arousal values (as obtained in the pretests) to esta- 
blish a frame of reference, then was given a pad with 
a separate page for each slide rating. The Æ set the 
projector so that each slide would be presented auto- 
matically for 15 sec. The Æ then left the room while S 
viewed and rated the slides. The Æ returned after the 
last slide was presented, gave S another mood de- 
scriptor to fill out, then began the exposure interview. 


RESULTS 
Immediate Arousal Effects 
Mean ratings of individual slides for all Ss 
are given in Table 1. In terms of thematic 


TABLE 1 


MEAN AROUSAL RATINGS OF SLIDES 


Content X SD 

1. Intercourse, ventral-ven tral, 
F over M 3.26 1.63 

2. Intercourse, ventral-dorsal 

F over M 3.23 1.63 

3. Nude petting, M hand on F 
genitals 3.14 1.65 

4. Nude petting, M hand on T 
genitals, kissing breast 3.04 LH 

5. Intercourse, ventral-dorsal, | 
F over M 3.02 | 1.61 
6. F masturbating 3.01 | 1.57 
7. M approaching cunnilingus 2.90 | 1.62 
8. Cunnilingus | 2.89 | 1.73 
9a. Cunnilingus 2.86 | 1.72 
9b. M and F in underclothes, hands 
on genitals 2.86 | 1.72 
10. Fellatio, M over F 2.85 | 1.66 
11. Fellatio, F over M 2.84 | 1.72 
12. Intercourse, ventral-dorsal, each ig 

on side 83 | 1.56 


13. M over F, genitals on breasts 
14. Intercourse, ventral-ventral, 
M over F ‘i 1 
15. Fellatio, M over F Lo 
16. Nude F reclined on back 1 
17. Petting, partially clothed, 
F hand on M genitals -09 $, 
18. F partially clothed, undressing M 126 


19. F partially clothed, undressing M | 2.19 | 1.30 
20. F and M undressing each other 2.13 | 1.21 
21. F and M with blanket over them | 1.83 | 1-14 
22. Fand M dressed, about to kiss | 1.66 | 1.00 
23. F and M dressed, about to kiss r A 


24. Nude M ‘ f 
25. Fully clothed couple 1.12 A3 


Note, —Abbreviations: F = female, M = male. 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF 
AVERAGE AROUSAL RATINGS 


Presentation 
Offense 
Story | Random | Total 
Indecent liberties with 
females 
X 2.32 2.06 2.18 
SD | 1.28 1.53 1.27 
Indecent liberties with 
males 
x 2.62 2.54 ASI 
SD 1.59 1.52 1.55 
Attempted rape 
xX 3.28 2.91 3.10 
SD 1.51 1.43 1.48 
Rape 
x 3.08 2.75 2.94 
SD 1.57 1.63 1.60 
Burglary 
X 2.64 | 2.49 LSI 
SD 1.60 1.63 1.61 
Homicide 
x 2.52 2.78 2.63 
SD 1.68 1.54 1.62 
Forgery 
x 2. 1.93 2.21 
SD 1.52 1.20 1.40 
Robbery 
Y 2.03 88 2.43 
SD 1.45 1.76 1.60 


content, most highly rated are those slides 
showing intercourse in which the female is 
over the male. Next are the nude petting 
scenes, followed by cunnilingus, then fellatio 
slides. Toward the bottom of the ranking are 
the partially clothed petting and undressing 
scenes. 

Mean rankings by SOs and CCs were nearly 
identical. The / tests for individual slides 
showed significant differences between SOs 
and CCs on only two of the slides (SOs having 
higher ratings), both of which were relatively 
low in arousal level (Slide 21, £ = 2.51, df 
= 127, p < 01; Slide 25, £ = 1.97, df = 127, 
p < .05). 

Table 2 displays the means and standard 
deviations of overall arousal ratings (average 
across all slides), categorized according to 
method of presentation (story line or random) 
and type of offense. An analysis of variance 
revealed no significant efect of either pre- 
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mtation (F = .15 df = 1,113) or offense 
P= 1:40 df= 7,113). The Presentation 
‘ Offense interaction was also nonsignificant 
? = 77 df = 7,113). The overwhelming im- 
ession is one of similarity rather than dif- 
rences between overall arousal ratings of 
Os and criminals who are not sex offenders. 


revious Exposure Patterns 


Most of the data were analyzed by chi- 
quare. With 2 X 2 contingency tables with 
‘above 40, Yates’ Correction for Continuity 
ras used. Fisher’s Exact Test was used on 
X 2 tables with N < 20 (Siegel, 1956). Most 
f the data are simple counts of particular 
juestionnaire item responses. The one excep- 
ion was the use of a combined severity-fre- 
juency score. This was an effort to obtain 
ndex of intensity of exposure th 
unction of frequency and degree 
value of the material. The severity-frequency- 
core was derived by first assigning ratings ac- 
cording to the severity of the material, A rating 
of 1 was assigned if S had viewed no pornog- 
raphy. A 2 rating was given for exposure to live 
entertainment (burlesque girls, §0-go dancers) 
erotic novels, semierotic movies shown in regular 
theaters, and magazines displaying seminude 
males or females (e.g., Playboy), A rating of 3 
indicated exposure to hard-core pornography: 
photographs, drawings, or films (stag films, 
“smokers”) depicting explicit sex acts. The 
S’s severity-frequency score was obtained by 
adding up his ratings. Therefore, if Sh 
a stag film, scen go-go dancers, 
Playboy, his severity: 


an 
at is some 
of arousal 


ad seen 
and read 
-frequency score 


would 
have been 7, 
TABLE 3 
EXPERIENCE WiTH PORNOGRAPHY 
DURING PREADOLESCENCE 
Offense* 
Type T tea 
yp f | Criminal 
Sex offender | code 
| h 
| violator 
— 
None 43 | 31 
Occasional magazines 6 | 10 
Hard-core pornography 11 22 
= ea SS 
ee 
*p 2.05. 


TABLE 4 4 


EXPERIENCE WITH PORNOGRAPHY 
DURING ADOLESCENCE 


| Offense* 


n 


| a 
| | Criminal 
| 


Severity frequency 


Sex offender | code 
violator 
$ 
Infrequent or mild | 21 | 10 
Frequent or hard core 33 | 46 
*p <05. 


No significant differences were found be 
tween SOs and CCs on the severity-frequency 
index during the year just prior to imprison- | 
ment. 

As Table 3 shows, during preadolescence 
(6-10 years) there are significant differences 
between SOs and CCs with regard to type ol 
exposure; the major result here is that SOS 
report less exposure than CCs. 


These differences carry through to adoles- 
cence (see Table +): SOs experience milder 
forms of pornography and at a less frequent 
rate than do CCs, . 

When S was asked specifically how often he 
was exposed to pornography during his child- 
hood, again SOs reported significantly less 
exposure than CCs QE = 11.92, p< 05). 

While there were no differences between 
SOs and CCs as to whether they feel they 
learned anything about sex from pornography) 
it should be noted that 49 Ss felt they dic 
learn something about sex, while 33 Ss sale 


© a i i a 
that they learned information that W® 
misleading, 

v . ‘ 

Most Ss in both groups did not feel the 
they would have 


been better off sexually © 
y had been exposed to more pornography 
during their youth, But additional response? 
to this question show that more CCs (V = 9? 
than SOs (V = 1) said that they had “plent 
available” ag youngsters, while more 
(V = 11) than CCs (N = 2) said they woul’ 
have Preferred “some sex education.” Althougs? 
One Ns here are small, these differences wel? 
significant when analyzed by Fisher’s Exact 
Test (p < 02). i 

There are signific 
and CCs in the 


they h 


me 308 
ant differences between SO 
kind of sexual behavior reporte 
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after viewing pornography (see Table 5). The 
CCs said they typically engaged in sexual 
relations after viewing pornography, while SOs 
more frequently reported masturbation. 

Most Ss claimed not to have viewed por- 
nography during the 24 hr. before committing 
the offense. When asked if viewing pornog- 
raphy encouraged them to commit their 
offense, only four SOs and no CCs answered 
affirmatively. 


Post Slide-V iewing Sexual Behavior 


_ The follow-up questionnaire yielded no 
significant differences between SOs and the 
nonsex offenders. Most Ss (90%) claimed not 
to have been “very excited” upon leaving the 
experiment, nor did they feel disgusted upon 
leaving the experiment (87%). Most claimed 
not to have sexual fantasies (82%) or to have 
engaged in sexual behavior afterward (90%). 


Factor Analyses of Arousal Ratings 


attempt to identify the 


In an exploratory 
response to visual 


structure of the erotic 
depictions of sexual behavior, the intercor- 
relations of arousal ratings were factor analyzed 
employing ratings of the 26 slides as variables. 
Although the analysis of variance revealed no 
differences in total arousal levels between the 
two methods of presentation (random and 
story-line sequence) it was conjectured that the 
factor structures might be different. Conse- 
quently, two separate factor analyses were 
carried out, one for the group receiving the 
slides in a random sequence and one for the 


TABLE 5 


REPORTED SEXUAL BEHAVIOR AFTER 
VIEWING PORNOGRAPHY 


Offense* 
Behavior Criminal 
Sex offender code 
violator 
Masturbation 16 2 
Sexual relations 6 m 
Both masturbation and 
sexual relations 4 | 13 
| a ~ 
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group receiving the slides in a story-line 
sequence. 

Because Vs were relatively small, 66 for 
story-line and 63 for random sequences, the 
resulting factor structures must be considered 
tentative. Similarly, the subsequent com- 
parison of group factor means is intended 
mainly to suggest directions for further 
research. 

For each analysis the squared multiple 
correlations were used as communality esti- 
mates. The squared multiple R is generally 
considered a conservative communality esti- 
mate, the “true” communality lying some- 
where between the squared multiple R and 1. 
Oblique rotations were used. Table 6 presents 
the complete rotated factor matrices for each 
analysis. 

The two presentation orders did result in 
different factor structures. While the story- 
line presentation yielded two larger factors and 
one smaller factor, the random presentation 
produced a more balanced, differentiated struc- 
ture (see Table 6). 

The factor interpretations for the random 
presentation are presented first (see Table 7). 
Table 7 presents, for each factor, the slides with 
the highest loadings; also presented are the 
rankings of the slides on “arousal ratings” 
(from Table 1). Only loadings above .40 are 
presented. 

The first factor from the random presen- 
tation yielded high loadings on slides involving 
the female genitalia. 

The second factor, foreplay, loaded heavily 
on the slides depicting petting and foreplay. 
In contrast to the female genitalia factor, the 
slides loading heavily on this factor were all 
ranked rather low (17-23) in arousal value. 

The third factor was identified as a fellatio 
common for the only three slides de- 
this kind of sexual behavior. These 
anked in the middle range of arousal 


factor, 
picting 
slides r 
values obtained. 

The final factor extracted was the intercourse 
factor, loading heavily on slides depicting 
yarious positions of intercourse. Slide 14 
(intercourse, ventral-ventral, M over F) had 
a loading of .39. 

With respect to the story-line presentation 
of stimuli, Table 8 presents, for each factor, the 
slides with the highest loadings. Table 8 also 
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TABLE 6 


FACTOR LOADINGS 


AND COMMUNALITIES 


| Story-line factors 
| Random factors | Story-line fac = 
i J ] 2 
Content gooey 7 | = l = 3 5 
1 2 3 | 4 
| | 7 60 
E 7 5 OL | 03 a4 ‘ 

5 | 3 27 10 | 25 ; 7 E 
1. Fand M clothed a Wo 3 | o | 3t | 0 | 00 | 24 z = 
2. Dressed petting 5 06 | 72 | o7 | 10 | 62 | a | i32 212 
3. Dressed petting i 20 | .00 | 69 .02 08 | .73 | .05 | At A ‘59 
mel 2 i 2 7 
a E | i | 36 | as | 39 | 56 | .02 a | 2s | 
A wean | sh | os ae | a | 106 67 | 
a ob | Ae | a7 | te | 35 73 | .20 | 57 o | 74 
Certa 17 16 43 2 | aa | 96 ag R | oe “60 

8. Petting | E a ae 4 o 
Jude F 16 44 j s | os | can | i 
9. Nude 4 f 24 %5 | 28 | 30 | 42 | 32 24 a 2 a 
n Tmt rbating 6 66 | 06 | 04 | 02 | 80 | .65 | .06 ‘04 85 
11. F mas urbs i 3 | 40 01 01 | 29 74 56 | 26 n a 
reene 19 36 10 | 53 | 60 24 . $ 
itals on breasts 13 | 10 F 36 60 7 8 
lk Fellatio ae ee ee a ee e | se 
15. Fellatio 10 | 05 | o5 72 ii | ge 73 03 05 9 
16, Fellatio 20 | 03 | 753 me | | oo | me o a 
16. Fellatio 11 20 | .03 oe 5 hs se ‘ag 13 72 
wh: (a 4 25 | 25 | .02 wl 16 58 09 13 a 
de petting 2 E 58 7 
18 Cana 3 3 a6 | in 8 
18. Cunnilingus T 50 06 03 | 14 i ou 10 10 98 
19. Cunnilingus 8 -63 3 A14 08 79 T4 5 i A "6 
20. Cunnilingus 9a | 64 | 03 | io | 06 8 | 75) 4 Os = 
21, Intercourse 1 | 05 os | oy | | SS ee) ee 
22. Intercourse ae | 08 | oo | cae kt coe 77 | 03 a ‘88 
23. Intercourse 2 | 38 | o3 | 01 | 28 | 68 a 07 AC Wi 
oust vate oe oe eee or | o2 | 2 

25. Intercourse 14 | 10 | 08 00 | ‘ag Bi psi T 
26. F and M under | 

blanket 21 [a a N g 65 | 09 | 50 | 06 |- i 
Note.—Slides are listed according to the Presentation order of aa 


presents the order of Presentation 
the arousal rankings 
first factor is general 
course slides which 


along with 
and loadings over 40. The 


to heavy belting and inter- 
Progress in content from 
nude petting, 


through oral-genital Contact, to 
intercourse. These slides were among the most 
erotic (13 out of the highest 1 


5 on arousal rat- 
ings) and appeared toward the end of the 
sequence (11-15, none earlier), 

The second factor js a foreplay factor which 
yielded large loadings on the slides showing 
petting and foreplay activities. The foreplay 
slides were ranked next lowest in arousal level 
and were intermediate in the story sequenee 
(all but one of the slides between third 
tenth in the sequence), 

The final factor extracted w 
anticipation factor, consisting of the first set 
of slides in the story sequence This storyline 
presentation tended to differentiate the antici- 


and 


as labeled an 


P = male 
thestory-line sequence, Abbreviations:F = female, M 


pation f 


‘tor. but 
actor from the foreplay factor, 
the rand 


om sequence did not. 


Differentiation of Offenders by Factor 
Our original 


i t 

n ifferen 

analyses, comparing differen; 
Sroups of offend 


„ores 
ers, were based on total sae 
Summed across all 26 slides. Would a m- 
differentiated results be obtained if we exa? 
ined the Specific 
the factor 
analyses, 
group were 
slides which 


arousal scores, represe! ach 
gs on the slides within east 
were calculated for soit 
Similarly, three arousal scores pe! 
cach Storyline factor were computed for ¢ 
offender, 
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TABLE 7 
AROUSAL RANKS AND HIGHEST Factor LOADINGS FOR SLIDES PRESENTED IN RANDOM ORDER 
Factor Arousal rank Content Loading 

Female genitalia 6 F masturbating -66 
9a Cunnilingus 64 

8 Cunnilingus -63 

7 M approaching cunnilingus 50 

16 Nude F reclined on back H 

3 Nude petting, M hand on F genitals 40 

Foreplay 23 F and M dressed, about to kiss -80 
22 F and M dressed, about to kiss 12 

20 F and M undressing each other 69 

21 F and M under blanket -66 

19 F partially clothed undressing M 57 

17 Petting, partially clothed, F hand on M genitals 43 

Fellatio 10 Fellatio, M over F 72 
15 Fellatio, M over F 71 

11 Fellatio, F over M 353 

Intercourse 1 Intercourse, ventral-ventral, F over M 61 
12 Intercourse, ventral-dorsal, each on side 8 

5 Intercourse, ventral-dorsal, F over M 44 

24 Nude M .42 


Note.—Abbreviations: M = male, F = female. 


While these results must be regarded as 
preliminary and in need of cross-validation, 
several findings may be of particular interest. 

The results indicate that SOs appear to 
have a flatter profile of means across the four 
factors than do the CCs. In particular, the 
SOs convicted of indecent liberties with females 
had significantly lower means on the fellatio 
factor slides than either the other SOs 
(£ = 1.98, df =30, p< .05) or the CCs 
(t = 2.25, df = 40, p < 05). 

For the story-line group, it was found that 
the scores on the heavy petting and intercourse 
factor were significantly higher for rapists 
(those convicted of rape and attempted rape) 
than for the other SOs (¢ = 2.10, df = 29, 
p < .05) and for the CCs (¢ = 2.14, df = 47, 
p < .05). 


DISCUSSION 


The overall immediate arousal effects of 
exposure to the pornographic stimuli were 
generally characterized by a lack of differences 
among the offense groups. Both of the major 
samples, SOs and CCs, exhibited relatively low 
overall arousal, as compared to that of college 
students who viewed highly similar erotic 
material (Brady & Levitt, 1965). On the other 
hand, the rankings of slides by both SOs and 


CCs closely parallel the rankings by the college 
students. A large majority of all Ss reported 
little change in behavior during the 24 hr. 
after viewing the material. 

Numerous differences were found between 
the two major samples in their previous ex- 
posure to pornography. In contrast to the CC, 
the SO had less frequent exposure to por- 
nography during preadolescence and adoles- 
cence and, when exposed, it was usually to 
milder forms of pornography. These findings 
are compatible with other recent evidence 
(Goldstein et al., 1970; Johnson et al., 1970) 
and suggest that childhood experiences that 
encourage sexual repression are associated with 
antisocial sexual behavior. In short, these data 
indicate a negative relationship between ex- 
posure to pornography and the tendency to 
commit a sex crime. 

The exploratory factor analyses revealed a 
number of interesting points about erotic 
responses to pornography. The identification 
of independent factors of erotic response indi- 
cates that there are marked differences in 
arousal patterns to depictions of different 
forms of sexual behavior and that these 
patterns cluster into a limited number of 
identifiable factors. Thus, for most purposes 
it is less meaningful to talk about overall 
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TABLE 8 


= ies Been wes 
PRESENTATION, AROUSAL RANKINGS, AND HIGHEST FACTOR LOADINGS 
a ners OF SToRx-LINE SLIDE PRESENTATION 


| a ng 
Presentation | Arousal Content Loading 
Factor order | rank -| 
Heavy petting 22 $ l Intercourse, ventral-dorsal, F over M | a 
and intercourse 20 9a Cunnilingus p 
19 I g| Cunnilingus . 74 
24 12 | Intercourse, ventral-dorsal each on side 33 
15 10 | Fellatio, M over F f 73 
21 1 Intercourse, ventral-ventral, F over M 2 
14 15 Fellatio, M over F É 2 
23 2 Intercourse, ventral-dorsal, F over M 2 
16 11 Fellatio, F over M 65 
11 6 F masturbating ‘60 
13 13 | M over F, genitals on breasts ý 
17 4 | Nude petting, M hand on F genitals, 38 
kissing breast 37 
18 7 M approaching cunnilingus 36 
12 3 Nude petting, M hand on F genitals Pii 
25 14 Intercourse, M over F, ventral-ventral k 
Foreplay 6 | 19 F partially clothed, undressing M T 
5 | 18 F partially clothed, undressing M 62 
10 24 | Nude M i 6l 
4 20 F and M undressing each other ‘ 
8 17 Petting, partially clothed, 58 
F hand on M genitals ol 
7 9b MonFin underclothes, hands 57 
on genitals oe) 
3 22 F and M dressed, about to kiss | ‘st 
| 9 16 Nude F reclined on back 30 
26 21 F and M with blanket over them D 
Anticipation 1 25 Fully clothed couple | a 
2 23 F and M dressed, about to kiss | ae 
7 oD M and F in underclothes, hands 52 
_ On genitals 6 
3 22 F and M dressed, about to kiss Ta 
Note.—Abbreviations: M = male, F = female, 


arousal to pornographic stimuli th 
talk about more specific patterns, T 
cation of these facto 
as well as the limits of these patterns, For 
example, arousal to female genit 
separable pattern but is 
involving nude females at r 


an it is to 
he identifi- 


alia seems a 


y depictions defined 
by the other factors of intercourse, fellatio, 
or foreplay. 

The materials used in the Present study are 


e 
structul 
work on the struc rer 
ild employ a wei 
erms of media as 


0 
ee ag sure 
ditional response meds 

(eg., Physiological), 


finding of a somewhat different pieter 
Structure with a random sequence, as pre 
pared with a Story-line sequence, has a num o- 
of implications, The random sequences a 
duced a much more differentiated wir 

Matrix, separate factors having been extracte 
for each of the major sexual themes presente e 
in the slides, With the more realistic story 
Sequence, the response seems to follow a me 
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integrated course, and the factors extracted are 
shaped according to sequence as well as theme. 
Story-line sequence probably involves a greater 
amount of vicarious experiencing with the 
entire act, with S viewing them as a realistic 
integrated flow of events. Yet different indi- 
viduals appear differentially aroused by factors 
at different stages of the sequence. The more 
fragmented random presentation probably 
lessens S’s identification with the entire act 
atid evokes more independent arousal re- 
sponses. Which sequence is preferable in future 
research depends on the purpose of the study. 
Our results indicate that sequence is a con- 
dition related to factor structure. For most 
research purposes, the random sequence may 
be preferred, since it yields factors more closely 
related to particular stimuli. 

_ Of particular interest is the suggestion that 
individuals convicted of specific sexual offenses 
may react uniquely to one OF more of the 
several arousal factors identified. For example, 
the greater arousal level of rapists to slides 
depicting heavy petting and intercourse and 
the low arousal level of the group convicted of 
indecent liberties with females to the fellatio 
slides are tentative findings which deserve 
further study. Upon further refinement, verbal 
report of sexual arousal to particular erotic 
stimuli might serve as a useful aid in the 
diagnosis and treatment of individuals with 
sexual maladjustments. Similarly, studies need 
to be done on the potential value of such 
profiles in predicting subsequent 


response 
These results provide some 


sexual behavior. 
initial suggestions. 

The factor analyses indicate that a total of 
perhaps 5-10 slides could adequately represent 
the major sexual themes, eliminating the 
redundant slides (i.e., slides heavily loaded on 


511 


the same factor). In this regard, the factor 
analyses presented can be useful in the de- 
velopment of subtests of erotic material and 
can aid the selection of the most diagnostic 
slides for each factor in future research. 
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SHOULD WE MEASURE EARLY OR LATE? 
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Task difficulty and the time at which we | 
interrelated as follows: if a task (or job) is he ¢ > independent 
riable, in terms of optimal discriminability on a E N ead will ieee 
de yin R31 k is “difficult,” the effect of an indepen- 

most clearly early in a task; if a task ; n - 
a will show up Zater in the task, The present research tests this gemu 
as with groups as Ss and corroborates it. In addition, a method of treating 


hypothesis statistically is offered, ong with a conjectural means of classifying 
tasks into easy or hard categories. 


take our criterion measures seem to be 
s “easy,” the effect of an independent 


al 


Weitz (1961) has suggested three major 
dimensions along which criteria may vary: 
(a) the time at which the criterion is measured, 
either early in the performance of a task (job) 
or late; (b) the level of difficulty of the cri- 
terion, easy or hard; and (c) the type of cri- 
terion, such as ratings, errors, etc, 

This conceptualization takı 
for two reasons. First, given the relatively 
rudimentary state of criterion measurement, 
it is evident that different Psychological truths 


may emerge when any or all of the three cri- 
terion dimensions change. 


dictor may be related to 
not to a more objectiy 
actual performance data, 
tion of the differences 
arise when different pro 
used, and their develo 


€s on significance 


For example, a pre- 
a rating criterion but 
€ measure based on 
Second, an examina- 
between results that 
ficiency measures are 
Pmental trends, may 
the independent vari- 


In attacking some of the problems that de- 
rive from this concept of criterion me 
ment, Weitz (1961, 1964, 1966) has shown both 
in his own research and by analysis of others? 
data that two of the criterion dimensions 
interrelated as follows: If a t 
maximal differentiation on ad 
able will appear relatively early in the task; 


if a task is “hard,” maximal differences will 
appear lale in the task, 


asure- 


are 
ask is “easy,” 
ependent vari- 


1 Requests for reprints should be sent to 
Weitz, Department of Psychology, New 
versity, 21 Washington Place, Ne 
10003. 
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York Uni- 
w York, New York 


iss of 

This relationship should hold, a 
the independent variable employed in os = 
vestigation. That is, it should be lear i vati 
are talking about the manner in which mee 
ables are interrelated, regardless of ee 
Furthermore, the hypothesis makes no $ ee 
ment about discriminability at other te 
most “appropriate” times for cepa “one 

For example, consider the dificult a rgu- 
dition for a moment (but note that the ue 
ment presented below can be made, if PD, An 
modified of course, for easy tasks as ck oat 
independent variable may have an effect ja of 
in a difficult task in terms of siginn 
difference on a criterion measure; and, ka a 
all trials are lumped together, there may vocals 
significant mean difference as well. ee a 
What is in question here is relative mye a 
nability, or, the best point in time to ta lifi- 
criterion measure, We maintain that for d d 
cult tasks, the best time to take a criterion me 
sure is late in the task, 

There are three 
the dependent v 
difference betwe 
Some independe 
Pothesis holds ( 


conceivable alternatives ' 
ariable does indeed reiste a 
en two groups that vary by- 
nt variable, and if our 
again, for difficult tasks). 


Case 1. 


The most sizable 
Sroups occurs late, 
the same direction 
that an Overall m 
early and late 
difference, 


discrimination between os 
but there is a difference a 
early, This would soci 
easure lumping all sco" 4 
would give an indication of 
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Task DIFFICULTY AND 
Case 2. 


The most sizable discrimination between the 
groups occurs late, but there is no difference 
between the groups early. This would suggest 
that late measurement is most likely to yield 
a significant difference. Tf all trial scores were 
lumped together, the difference might not 
attain significance. 


Case 3. 


The most sizable difference between the 
groups occurs late, but there is a trend in the 
early trials that is opposite to the significant 
difference obtained late. This would be more 
likely than Case 2 to cause a nonsignificant 
result when all trials are lumped. Also, if only 
early measures were taken, the researcher 
might form an erroneous impression of the 
operation of his independent variable. 

In view of the fact that more powerful 
statistical tests (i.e., tests more likely to show 
significant results when the null hypothesis is 
indeed false) would result if measurements were 
taken late, and in view of the fact that it may 
not always be possible or desirable to gather 
criterion data at many points in time, it is 
clear that evidence in favor of the hypothesis 
outlined above concerning the best time to 
measure for difficult tasks has both practical 
and theoretical relevance. Both experimental 
and industrial psychologists could more efi- 
ciently plan their experiments and validity 
studies. In addition, theories may be more 
rigorously tested by usual statistical tech- 
niques if their power is increased by consider- 
ing the point in time when a dependent vari- 
able is most likely to register a difference be- 
tween groups that do differ on some indepen- 
dent variable. 

Tasks may be easy or hard for a number of 
reasons, including complexity and ambiguity. 
Also, a given task may become easy Or difficult 
due to the aptitude or suitability of the indi- 
vidual for the task. Thus, if an individual has 
high aptitude for a particular job, as measured 
by a valid test, then we might think of the task 
(job) for this individual as a relatively easy 
one. On the contrary, if an individual has low 
aptitude for the particular job, then the task 
might be considered more difficult for him. If 
the earlier formulation regarding task difficulty 
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and dependent variable relationships is cor- 
rect, this should mean that the effect of an 
independent variable will show up carly for 
people with high aptitude and later for people 
with low aptitude. One example of this is that 
a job-relevant personality measure might be 
valid late for low-aptitude individuals and 
early for high-aptitude individuals. 

Time of measurement, early or late, must be 
defined relative to the task that is undertaken. 
For experimental tasks used in a laboratory, 
early may mean the first few trials; for job 
performance, it may mean the first few months. 

Several investigators have attempted to 
verify the hypothesis in its extension to per- 
sonnel selection. Hakel (1968) demonstrated 
some support for the concept, and in the work 
by Dodd, Wollowick, and McNamara (1970) 
the hypothesis seems rather well supported. In 
both instances the validity of a personality 
measure appeared higher if aptitude was used 
as a moderator, and the personality measure 
related to performance at the appropriate 
points in time. 

While this extension is interesting in and of 
itself, we are not testing it here. The task diffi- 
culty in the present experiment is defined inde- 
pendently of the suitability of the groups 
(groups are the Ss in this study) for the task 
demands made upon them. 

Tt should be noted that the time dimension 
of the criterion has been handled, but in a 
somewhat different context, by others (e.g-, 
Fleishman, 1957; Fleishman & Fruchter, 1960; 
Fleishman & Hempel, 1954, 1955; Ghiselli & 
Haire, 1960). Their major orientation has been 
one of showing that relationships between pre- 
dictors and performance measures change over 
time as a result of changes in the skill, or 
motivational, components of the job (task). 

The present research proposes to test and 
extend Weitz’ hypothesis. The criterion mea- 
sures to be reported here are group performance 
measures as opposed to individual performance 
measures. This is, of course, & major departure 
from previous tests. In addition, the data were 
derived from a social psychological experiment 
with which Weitz had no connection (Levine, 
1970). The principal investigator had no inten- 
tion of testing the concept at the outset and, 
therefore, no expectations about outcomes. The 


SSS = 
SS 
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analysis of the data and testing of the hypothe- 
sis was suggested after the study was completed. 
The hypothesis to be tested is that the prin- 
ciple stated above, relating effectiveness of an 
independent variable and time and task dith- 
culty, holds for groups as well as individuals. 
Discrimination between two sets of groups that 
differ on an independent variable will be greater 
relatively early (as opposed to late) when the 
groups are confronted with an easy task; but 
when groups are faced with a hard task, the 
differences will be greater late in the task. 


METHOD 


There are two independent variables to be considered, 
both of which are aspects of a group's control structure: 
the amount of control or influence possessed by group 
members over their group’s decisions, and the distribu- 
tion of control or influence in the group. 

Four sets of three-man groups were set up as follows: 
Each member of the group was given role-playing in- 
structions separately. These could be either “high- 
control” or “low-control” instructions. 

The high-control instructions indicated that S should 
concentrate on maintaining a high rate of interaction 
and on giving suggestions. He should also try to avoid 
asking for opinions, information, and Suggestions. 

The low-control instructions indicated that § should 
concentrate on remaining relatively quiet, and when S 
did interact he would do so by asking for opinions, infor- 
mation, and suggestions. He Was to give o 
mation, and suggestions only if 
other group members, (Manipulation checks showed 
Ssigned role very wel 


16 groups—high amount of contro]— 
balanced distrit 


bution.—All three Ss received high 
control roles, 


Set 2, 16 stoups—high amount of control—un- 
balanced distribution. Two of three Ss received 


high-control roles, and the third received low-con- 
trol instructions, 


Set 3. 16 stoups—low amount of Control—bal- 
anced distribution. All three Ss received low-control 
roles, 


Set 4. 16 stoups—low amount of control—un- 
balanced distribution, One of three Ss received the 


high-control role, the other two had low-control 
roles, 


In all sets, eight groups had easy tasks and eight had 
hard tasks. In comparing high versus low amount of 
control, Set 1 and Set 2 were compared to Set 3 and 
Set 4. In comparing balanced versus unbalanced dis- 
tribution of control, Set 1 and Set 3 were compared to 
Set 2 and Set 4. These comparisons are based on an 
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> sidered 
analysis of variance approach and may be considere 
orthogonal. 


Task 


The task is a modified version of one used by, Hollan- 
der (1960). Three criteria were instrumental in i A 
tion: (a) Does it hold S's interest? (b) Does it ze a 
definite and specifiable outcomes? (c) Is it readily 
manipulable in terms of case and difficulty? oes ioe 

The task is centered around a 7X7 matrix (see 
Fig. 1) in which columns are designated by color names, 
rows by letters. 


The elements are payoffs ranging from —12 to Tas. 
Groups must decide upon a particular row for each o 
15 trials. After the row has been determined, Æ declares 
the column for that trial—the column choices corre- 
sponding to a predetermined system. The object of the 
game is a maximization of payoffs to the group or, in 
other words, to figure out the system. 

Instructions, taped, were as follows: 


Before you is a matr 


ix of plus and minus num- 
bers. The columns 


have color names and the rows 
have letter names. Your task is to choose one row 
for each of 15 trials to maximize your group's pay- 
off, and to do this as quickly as possible. 

How is the Payoff determined? It is the sum of 


plus and minus values you accumulate over 15 
trials. 


How are these v: 
example. Trial 1, Y 
discussion and 


alues determined? Here is an 
our group chooses a row after = 
announces their choice to the experi- 
menter. Then the experimenter announces the col- 
umn color for that trial, The column color has heen 
selected according to a predetermined system. For 
instance, the system may be consecutive columns. 
Thus, if “req” ere the first column (Trial 1), then 


the color for Trial 2 would be “blue,” for Trial 3, 
“yellow,” and so on. 


Your best row choice; 


s under these conditions 
would be “E" for Trial 1, which yields a +15 with 
“RED”. n, D for Trial 9 which yields a +10 
with “blue”, . , + “G” for Trial 3 which yields 4 
+10 with “yello 


t Wi... and so on. The sum of 
your gains and losses is the payoff, 
Obviously, 


your row choices are there- 
discussion, 


Also, as part of your first trial you must discuss 


the following issues; 
1. Which member 
row choice to the e 


2. Which member of the group will pose questions 
about the task, if 


you find that you don’t under- 
stand what you have to do. 
3. Whether Supplies are necessary, for example, 
Taper and pencil, and who will be responsible for 
these, 


4. How the grou 
PY majority, unan 


of the group will announce the 
Xperimenter, 


P's decisions are to be reached— 
mity, or some other method. 


Task DIFFICULTY AND 


| 


CRITERION MEASURES 


on 
m 
on 


Row Green | Red Blue Yellow | Brown | Orange Black | 
A We a V tp +5 ay | a +15 | 
o B p0 "g ~2 =? mE -3 =i | 
Er: | a a| Se 43 | =u = a) 
D i +5 —7 +10 | -2 | —5 +1 —2 
E | = aiy -1 +1 +13 —10 +2 
F | —a | #6 a. jA +1 
=a 1 ay | S sö | +10 44 —2 | -8 


Fic. 1, The game matrix. 


A list of these issues will be given to you by the 
experimenter, and they are to be discussed during 


your first trial. Good luck! 


form of the matrix and other 


The rationale for the 
Hollander (1960, pp- 


procedural aspects is given by 
366-367). 

The two output variables are time-to- 
all trials, and the group's payoff. 


completion of 


Problem Difficulty 


The means of making the problem easy or difficult 
obviously consists of alterations in the system of column 
choices for the trials. Hollander? suggests as & “simple” 
system, one with a “shift of two columns to the right 
on each trial, viewing black and green as adjacent ina 
circular arrangment.’’ A more complex system might 
approach random selection of the column colors. After 
some preliminary piloting it was decided that each 
group would play two games with two separate systems. 
The easy systems were (e1) three consecutive columns, 
skip one, and (e:z) two consecutive columns, skip two. 
‘The difficult (hard) systems were (hy) 1,3, 5, 75.75 5,3, 
columns forward from the color of the preceding trial, 
and (hə) 1, 2, 3, 4, 5 columns backwards from the color 
of the preceding trial. 

For each of the four problems, the initial color was 
randomly selected, and the rest followed from the sys- 
tem. Thus, the same two easy color sequences, and the 
same two difficult color sequences were used through- 
out. Maximum total payoff for the easy set of 30 trials 
is +360 and for the difficult set, +370; so any differ- 
ences between the easy and difficult condition can 
hardly be due to differences in maximum payoff ob- 
tainable. Also, the order of presentation of the two 
as randomly determined for each group, 
traint that the two possible orders for 
d conditions be used an equal number 
ived either two easy or two hard 


problems w 
with the cons 
both easy and har 
of times. A group rece! 
problems. 


oe 
2, P. Hollander, personal communication, June 4, 


1969. 


Problem ease or difficulty should afiect both output 


variables such that easy problem: 


s yield higher payoffs 


and take less time to complete than difficult ones. 
Furthermore, the “meat” of the group's discussion 


should be centered around the 


system and how to 


maximize payoffs within the system. 


Subjects 


The 64 groups were composed of three male or three 


female undergraduates enrolled 


in a basic psychology 


course. (Results are not reported separately for male 


and female groups since they d 
group performance measure of cu 
payoffs.) Participatio! 


id not differ on the overall 


rrent interest— group 


n in experiments was & require- 


ment of the course. Subjects were randomly assigned 


to one of the four sets of groups ar 


ad to the easy or hard 


i Each experimental treatment in the 


problem conditions. 


design had equal numbers of male and female groups. 


Procedure 


Three subjects were taken into the lab and seated 


around a small tal 
experiment and then given their 


ble. They were told a little about the 


role instructions in- 


dividually in separate rooms with coaching when 


appropriate. 


After a practice session, and more feedback in the 


separate rooms, t 


task instructions were played. T 


15 trials of Game 1. At this point 


separate room i 
role playing- 
again completed. 


RESULTS 


hey returned to the lab and the taped 


hey then completed 
they were taken into 


ns once more and given feedback on their 
Game 2 was played, and 15 trials were 


The task difficulty manipulation worked. 
Easy problems yielded higher payoffs and 
shorter completion times than did difficult 


problems (see Table 1). In tl 


he Results section 


| 
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TABLE 1 


CHECK ON THE Easy VERSUS DIFFICULT 
Task MANIPULATION 


OVERALL EFFECTS OF 


TABLE 2 


THE CONTROL 
VARIABLES ON PAYOFF SCORES 


Easy | Difficult) p 
Measure tasks | tasks 
' Time-to-completion* 31.43 44.46 9.05* 
Payoff score* 291.44 | 178.72 | 40.87** 


a Summed over both games. (A constant of 100 has been 
dded to payoff scores.) 
*# p < .001. 
** p <.0001. 


ve are concerned only with the payoffs achieved 
yy the groups for each trial since time scores 
-eflected the length of the total game only, not 
the length of each trial. 

In considering the data presented below, the 
reader might wish to know the effect on payoff 
scores over all trials of the control structure 
variables, amount of control, and distribution 
of control. Table 2 contains the data, which 
reveal that high amount of control groups 
scored significantly higher than low amount of 
control groups, and balanced distribution 


SUM OF GROUP PAYOFFS 


Oo High Control 
== Low Controt 


E S S Ss tes 
GAME | 


8 9 p 
GAME T 


BLOCKS OF THREE TRAILS 


Fic. 2. Relationship between payoffs and trials 
for easy tasks (high-low control), 


Amount of control 
Score — a —— 


Low | F 


| git 


X Payoff score 260.19 209.97 


Distribution of control 


Balanced | Unbalanced 


214.56 


X Payoff scores | 255.59 5.41* 


è Summed over both games. (A constant of 100 has been 
added to payoff scores.) 

*p <.03. 

> <01. 
groups scored significantly higher than un- 
balanced distribution groups. 

Graphical evidence bearing on the hypothe- 
sis of interest appears in Figures 2-5. 

Figure 2, which depicts the performance (in 
blocks of three trials) of high- and low-control 
groups on easy tasks supports the hypothes!s- 


SUM OF GROUP PAYOFFS 


o—o Balanced 
e———e Unbalanced 


+ 

ae O E E E! 

‘ 2 sa aile so gN 
GAME | GAME I 


BLOCKS OF THREE TRAILS 


Fic. 3. Relationship between payoffs and trials 
for easy tasks (balanced-unbalanced). 
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The early trials of the first game offer, the 
best discrimination between thejhigh- and low- 
control groups. The evidence is not consistent, 
however, between first and second games since 
in the second game some=difference appears 
relatively late. (It may be pointed out here 
again, as Dodd, Wollowick, & McNamara 
[1970] note, that the hypothesis does not 
predict an absence of discrimination late in an 
casy task, only that early discrimination is 
greater than late.) 

When we compare the balanced distribution 
versus the unbalanced distribution sets of 
groups on the easy task (see Figure 3 again 
plotted in blocks of three trials), we find no 
consistent pattern in the first game. 

However, in the second game, the later trials 
rentiation, in 


reveal a greater amount of diffe 
rior. This is 


which balanced groups are supe 
contrary to the hypothesis. 

Turning to the hard task 
4 shows that the hypothesis 
more difficult tasks is supported. 


condition, Figure 
as it related to 


SUM OF GROUP PAYOFFS 


GAME I 


GAME | 


BLOCKS OF THREE TRAILS 


Fic, 4, Relationship between payoffs and trials 


for hard tasks (high-low control). 


450 


400} 


50 


SUM OF GROUP PAYOFFS 


-50F 


GAME | GAME T 


BLOCKS OF THREE TRAILS 


Fic. 5. Relationship between payoffs and trials 
for hard tasks (balanced-unbalanced). 


Theater trials in both the first game and the 
second game show the greatest differences 
between’ the two sets of groups with the high- 
control groups being clearly superior late in 
each of the sessions. 

Figure 5 also demonstrates that the block 
trial data for the later trials for Games 1 and 
2 (hard task—balanced versus unbalanced dis- 
tribution) are more discriminating than the 
early trials. The balanced’groups show consis- 
tently better performance late in each game. 


Statistical Treatment of the Data 


In an attempt to develop a statistical tech- 
nique to handle the results of these kinds of 
problems, it was decided to compute a i ratio 
for each trial (1-15, for both games). 

These ¢ ratios can then be compared in two 
ways: 

1. The mean / ratios for Trials 1-8 for Games 
1 and 2 (16 scores) can be tested against the 
mean { ratio for Trials 9-15 for Games 1 and 2 
(14 scores). It should be noted that this divi- 
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sion, first eight versus second seven, is not 
necessarily the best one possible, especially for 
the hard condition. It was used merely to split 
the games approximately into first half versus 
second half. The significance tests might there- 
fore be considered somewhat conservative. 

2. Another way of comparing the / ratios also 
stems from the hypothesis. Since the two games 
in each condition differ only in the systems, 
both games (30 trials altogether) might be 
considered one task. The first half of the first 
game would then be early, and the rest of the 
trials would be late. Therefore, the first eight 
t ratios of Game 1 may be contrasted with 
those of the sum of the last seven trials of 
Game 1 and all 15 of Game 2. 


In view of the high degree of similarity be- 
tween Games 1 and 2, the comparison between 
the first 8 and last 22 Z ratios seems most 
apposite as a test of the theory, although both 
kinds of comparisons are presented and dis- 
cussed to assess the possibility that the two 
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produce different results. This use of { ratios 


may be seen as a rather novel way of handling 


the question of relative discriminability, that is, 
significant mean differences between sets of 
t ratios indicates that one set shows greater 
differentiation between the experimental groups 
of interest. (As required by the Z test, both 
variance estimates in each comparison were 
tested. All pairs were homogeneous.) 

The results of these comparisons offer some 
support for the hypothesis, 

Table 3 contains the results of these com- 
parisons for the easy task condition. 

The data reveal that for the primary com- 
parison (first 8 versus last 22 / ratios), the high- 
control groups are significantly more success- 


ful, compared to low-control 


; 1 groups in the 
early going. 


Groups with a balanced control 
structure, however, are significantly more 
successful than those with unbalanced struc- 
tures later in the tasks, In fact, if we confined 
our criterion measurement to the first eight 


TABLE 3 


COMPARISONS OF THE ME 


ANS OF EARLY SETS OF £ R 
L RATIOS FOR E 


ATIOS AND THE ME, 


ANS OF LATE SETS OF 
ASY TASKS 


Comparison Early Late p 
I. High versus low amount of control comparison 
Primary comparison 7 
8 trials versus last 22 trials 
SD fae aa 
Secondary comparison oer l 
F it half versus second half of both games 
tig = .45 
an 16 tis = .02 
tis = .94 tis = .96 1.24" 
I. Balanced versus unbalanced distribution = D 
Primary comparison 7 7 
ma 8 trials versus last 22 trials ts = —.22 
a le = .63 7 
SD t= .76 Gem o aaa 
Secondary comparison 
First half versus second half of both games 
x | 
lis = 17 
Sp | 16 t -66 5 
| te= .82 ta = 05 = 12" 
Note.— Abbr 
a The minus si 
25 


Bp < 15 
eee b <.05 
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TABLE 4 


COMPARISONS OF THE ME OF 


RLY SETS OF {f RATIOS AND THE MEANS OF LATE SETS OF 


? Ratios FOR Harp Tasks 


Comparison 


Early Late t 


I. High versus low amount of control 


Primary comparison 
First 8 trials versus last 22 trials 
x 
SD 
Secondary comparison 
First half versus second half of both games 
xX 
SD 


t=.359 | 9 STB 
tp= 934 | 252 

| 
tis = .668 tia = 1.793 pelts 
tie = .965 trace 876) Mite? 


II. Balanced versus unbalanced distribution 


Primary comparison 
First 8 trials versus last 22 trials 


Xx ts = .126 tee = .948 ae 
SD ts = .828 te = 1.047 21001 
Secondary comparison 
First half versus second half of both games 
xX ts = 487 tu = 1.005 sya 
SD tig = 1.207 tig = .780 > 


Note.—Abbreviation ! = trial. 
* p <,20 (two-tailed test). 
xx p < .06 (two-tailed test). 
weet p <.01 (two-tailed test). 


trials, we would find, contrary to the overall 
results, that the unbalanced structure groups 
were superior to the balanced structure groups. 
The data of the first-half versus second-half 
comparison (the secondary comparison) yields 
a similar picture to the primary results with 
one supportive and one contradictory result 
for the amount of control and distribution of 
control variables, respectively, although the 
results here do not approach conventional 
significance levels. 
_ Thus, the hypothesis is su 
instance and contraverted in 
be possible, although on an ex post J 
only, to resolve this negative finding in favor 
of the hypothesis based on the suitability of 
the various group structures for the tasks they 
faced. The matter is explored further in the 
discussion. 
Table 4 contains the data for the results of 
the ¢ ratio comparisons for the hard task condi- 
tion. Here the primary comparisons (first 8 
versus last 22) are both supportive of the 
hypothesis. Thus, high amount of control and 


ipported in one 
another. It may 
facto basis 


balanced distribution of control groups are 
relatively more successful late in the task than 
their counterparts. The secondary comparisons 
show similar results, though in one of the two 
cases (balanced versus unbalanced) the ¢ ratio 
difference does not approach the conventional 
alpha level. 
DISCUSSION 


Both the graphic and statistical data offer 
support for the hypothesis. It seems, therefore, 
that within the limits of our experiment early 
measurement is better suited than late mea- 
surement to unearth differences between 
groups, when tasks are easy; when tasks are 
hard we should measure performance later in 
the task. This was so for all comparisons but 
one, the balanced versus unbalanced distribu- 
tion condition for the easy task. What of this 
discrepancy? 

In answer to this problem, recall that at the 
outset, we stated that an examination of the 
variations in performance at various stages in 
the task might shed some light on how the inde- 
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pendent variables operate. This is borne out to 
a degree by the contrary findings. 

The discrepant results of the balanced versus 
unbalanced groups on easy tasks stem pri- 
marily from the performance of one set of 
groups, the low-control, balanced distribution 
condition. Since everyone in these groups had a 
low-control role, there was little movement 
toward the goals of quick completion of trials 
and maximum payoffs. In essence any task 
would be difficult for this type of group, and 
their scores served to depress the scores of the 
balanced set of groups as a whole in the early 
going. In fact, the significant results of Table 3, 
which contravert Weitz’s hypothesis, indicate 
that this comparison behaved in a manner to 
be expected of hard tasks, that is, maximal 
differentiation later in the task, This is an 
admittedly ex post facto explanation, but it 
should serve to illustrate that a task may indeed 
be made more or less difficult as a function of 
individual or group suitability for the task. In 
terms of Weitz’s hypothesis, this explanation 
of negative results would most appropriately 
be considered under the heading of the postu- 
late or extension to the hypothesis, although, 
again, we were not explicitly testing it in the 
present study. 

In addition, our und 
pendent variables is furthered by 
tion of the graphic evidence (se 


the 
9 ig i 
some bearing on this by Seine pie tae 
of classifying tasks based on the qualit: rr 
and quantitative shifts in ability and ei 
tional components at various Stages of a 
performance. Easy tasks might undergo f va 
shifts than difficult tasks, Fleigh 50 fewer 


! shman in ar- 
ticular has shown that late stage proficitne 
n 
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is harder to predict than early proficiency and 
has sought to identify variables especially pre- 
dictive of late proficiency. The present study 
perhaps contributes in showing that task difi- 


culty may be an important moderator in such 
predictions. 


At any rate, the conceptualization of cri- 
terion measurement described here seems @ 
heuristic one and might lead to a better under- 
standing of what the independent variable is 
actually doing, It may also serve as a starting 
point for a concerted attack on that most 
important problem—the criterion, 
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: RELATIVE INFLUENCE OF JOB AND SUPERVISION 
ON SHARED WORKER ATTITUDES! 
DOUGLAS B. CAMPBELL? 
Survey Research Center, University of Mi ichigan 
The question addressed is whether similarities of job attitudes among workers 


are more closely associated with the nature of the job itself or with the nature of 
ived. Comparisons of attitude similarity between pairs of 


the supervision rece! 


workers were made for four classes: workers having the same job as one another, 


the same supervisor, b 
of occupations in the 
class means to assess the 
similarity between workers. 
significantly associated (p < .05) with 
not significantly 
linking Maslow’s hierarch 


Many studies have explored the relative 
relevance and importance to worker attitudes 
of various factors in the work environment. 
Herzberg, Mausner, Peterson, & Capwell 
(1957) and Vroom (1964), among others, 
have reviewed and summarized much of this 
literature. Herzberg et al. present data showing 
that the factor mentioned most often in 150 
studies of worker attitudes and satisfactions 
was “intrinsic aspects of the job,” followed by 
“supervision,” and then “working conditions,” 
thus establishing “supervision” as a factor 
independent of other “job” factors as well as 
suggesting the relative importance of the 
factors. Further data in Herzberg et al. show 
the same rank order of these three factors when 
their average importance is compiled from 16 
studies involving over 11,000 workers. These 
tables may be interpreted as giving a slight 
edge to the job itself, while other sources 
(e.g, Ash, 1954; Ott, 1965; Putnam, 1930) 
suggest the greater influence of supervision. 
Still other studies report one view but are 
open to other interpretations than those made 
by the original investigator. For example, 
Morse (1953) found a relationship between 
attitude toward supervision and general morale. 
ered as part of a 
the United States 


1 Data for this report were gath 
study supported by a grant from 
Department of Labor (L-68-84). 

The author is indebted to Robert P. 
Stanley E. Seashore for hel 
Preparation of this manuscript. on 8 

? Requests for reprints should be sent to Doug! a : 
Campbell, Survey Research Center, Institute for 
Social Research, University of Michigan, Ann Arbor, 


Michigan 48106. 


Quinn and 


Ipful comments made in 


associated. It was sugge: 
y of needs with job and supervision. 


oth, or neither. The Ss were 311 workers from 
southern Michigan area. Comparisons were made between 
relative association of job and supervision with attitude 
Overall results showed shared supervision to be 
attitude similarity, while shared job was 


a wide variety 


sted that findings could be understood by 


To test whether it was a halo effect or super- 
visors actually were affecting overall morale, 
she compared work groups having the highest 
average morale index scores with those having 
the lowest scores, for all measures of attitude 
toward supervision. Only two of the nine 
comparisons were significant (p < .05, two- 
tailed / tests), and she concluded it was merely 
a halo effect. Inspection of her tables, how- 
ever, shows that all differences were in the 
predicted direction. Furthermore, her hypo- 
thesis dictated a one-tailed test, in which 
Case 3 of the nine tests were significant at the 
05 level and six of the nine were significant 
at the .10 level. The supervisor may have had 
a more important effect on morale than Morse 
allowed. In general, then, no conclusive 
findings have emerged indicating the relative 
influence of various work factors, especially 
the factor of job itself and that of supervision, 
on a worker’s attitudes. 

Demographic variables complicate matters 
further. Herzberg et al. (1957) report that the 
relative influence of the job itself as opposed 
to supervision on worker attitudes is con- 
founded by such variables as education, 
occupation level, income, and sex. 

In an attempt to clarify past findings, this 
study examined the relative importance to 
worker attitudes of the job itself versus super- 
on. The study employed a somewhat 
different methodology from those used in the 
past. Many past studies have been concerned 
primarily with relationships between various 
work factors and “high satisfaction” or “good 


attitudes.” This study assessed the es 
which the job itself and the Sord 
worker receives are associated Fh an jal 
holding similar attitudes and a 
one another, be those attitudes oa Lal 
tions good or bad. This study er eit 
from most previous ones in the w idely A un 
occupations included in the population studied. 


METHOD 
Subjects 


Data were obtained from interviews with 311 workers 
in the southern Michigan area. These Ss were selected 
aa 7 a very wide range of occupations and not as a 
N sample. As a result, Ss were ting a 
overrepresentative of males, white-color w orkers, older 
workers, the more highly educated, and those 
status occupations. : 

Part of the population interv € | 
preselected pairs of workers, that is, workers having 
the same job with the same employer, or having the 
same supervisor, or both, If S had been designated 
prior to his interview as a pair member, the interviewer 
verified this at the close of the interview by sking § 
if his assigned pair mate did indeed have the same job 
with the same employer and have the same supervisor, 
Additional pairs were generated by asking each un- 
paired S if he could name someone fulfilling the pair 
qualifications listed above. In this Way, four classes of 
S pairs were generated: (a) a class in which the pair 
members had the same job and the same supervisor as 


one another (SJ/SS); (b) a class in which the pair 
members had the same job as one another but different 
supervisors (SJ/DS); (c) a class in which pair members 
had different jobs but the Same supervisor (DJ/SS); 
and (d) a class comprised 


of random pairings of the 
rest of the Ss, giving those pairs different 


in higher 


iewed was composed of 


jobs and 
different supervisors (DJ/DS). Since the original 
number of Ss was 311, one S, chosen at random, was 
deleted from the analysis, 


Procedure 


Deriving the indexes. The 
professional interviewers em 
Research Center of the Univ 


the data obtained from these interviews Were the 13 
indexes of worker attitudes and satisfaction used in the 
subsequent analysis, Six of the indexes came from a set 
of items designed to assess job satisfaction, The $ was 
asked how true each of these items was of his job 
(e.g., “the work is interesting,” “J have enough time 
to get the job done”). Four alternative answers could 
be chosen: “Very true,” “somewhat true,” “not too 
true,” or “not at all true.” The items were then com- 
bined on the basis of a factor analysis that yielded the 
six satisfaction indexes The § y also asked how true 
each of several phrases was of his immediate supervisor 
(eg, “is friendly and approachable,” “checks fre- 
quently on the work of those Supervises”), These 


Ss were interviewed by 
nployed by the Survey 
ersity of Michigan. Among 


he 
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items also were factor analyzed and yielded four super 
vision indexes. ‘ alysis. 
ieee indexes were included in the a 
One was an index of job challenge, derived fror Tei or 
of questions concerning how much S's Do e a high 
allowed him to do a lot of planning ahead, ha À Lines 
level of skill, be creative, do a variety of aren aa 
and have to keep learning new things. ANON 2 
was composed of “content-free -Job pa friend?, 
questions (e.g., would § recommend his job to n things 
would S take the same job if givena chance tot o ley 
over again?). The final index was the job tenson ahd 
constructed by Kahn, Wolfe, Quinn, Snock, 
Rosenthal (1964). ntrol for 
Controlling for item reference. In order to Gan caved 
the extent to which the items in each index rts an 
job referent than supervisor referent, or a Kaea "the 
attempt was made to categorize cach in aa ‘Three 
basis of the predominant refe ence of its items. or ihe 
colleagues who were not familiar with the aims scale 
study were asked to rate each item on a Foie ail 
according to the degree to which they felt 4 i A to 
respondent” would answer that item with — 3 
his job itself, his supervisor, or some Se at 
the two. One end of the scale was labeled “job S d 
only,” the other end, “supervisor referent only, DRR 
the midpoint was labeled “equal reference. 1.00 to 
index was then assigned a score ranging from n ‘items 
7.00, based upon the mean of the ratings of a erent 
in that index by the three judges. Interrater agreer 
computed according to Guilford (1954), was ee 
The indexes were then divided into two s tS score 
upon the judges? ratings. Those indexes having P arsti 
of less than 4,00 were placed in a set called Lee sees 
half, while those with a score of greater than 4.01 ng Te- 
placed in the supervision-referent half ae gs 
ceiving predominantly supervision-referent ert 
(scores Sreater than 4.00) were the four indexes the 
the Supervision factor analysis and one index font 
job satisfaction factor analysis which measured ue 
faction with Supervision. The other indexes—the -free 
remaining joh satisfaction index the sage 
job Satisfaction index, and the job tension Ind nes 
received Predominantly Job-referent ratings ot a 
ess than 4.00), The job challenge index receive 


ati . sses 0 
rating of exactly 4.00 and Was not used in analy 
the two sets 


Computing the 
each inde: 


-omn 
ach 5 0! 
The scores for each 


re O! 
5 a score 
taking the mean s x for 


pair scores, 
l X Were derived by 3 
all items in the index. After scores on cach inde 


1 n ‘ferent? 
each $ had been obtained, the absolute differ 
between the scores of ca 


ed 
f ach pair of Ss was comput ne 
for each index. This absolute difference represente o 
extent of attitudinal dissimilarity between member 
a given pair, The absolute differences between pa 
Ss were the basic data that were used in the analye 
The mean of the absolute difference scores ot 
for the pairs Comprising a given class was then ©? cy 
puted, Providing the average absolute discrep e 
between workers characterized as having either (4) job 
Same job and the Same supervisor, (b) the same ‘me 
but different Supervisors, (c) different jobs but the $ 


: ae ni super 
Supervisor, or (d) different jobs and different sul 
Visors (see Table 1). 


SS LE Pees a eee an eee Te 


as 
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TABLE 1 
nes MEAN ABSOLUTE DIFFERENCE (Map) SCORES FOR Eacu INDEX BY CLASS 
M ap scores 
Index | | | 
| Class 1 Class 2 Class 3 Class + 
| (SJ/SS)* (SJ/DS)” (DJ /Ss)e (DJ,/Ds)# 
Job-referent indexes | 
Job satisfaction 1 | AS -60 45 53 
Job satisfaction 2 | 38 .56 253 252 
Job satisfaction 3 | 55 61 80 -60 
Job satisfaction 4 | 53 17 53 83 
Job satisfaction 5 49 75 25 -60 
Content-free job satisfaction | 69 71 .94 .91 
y Job tension .58 84 53 66 
Equal-reference Index 
_ Job challenge 32 56 53 T4 
Supervision-referent indexes | 
Supervision 1 43 1.05 50 | 66 
Supervision 2 68 -28 58 85 
Supervision 3 -66 1.25 -03 37 
Supervision 4 | 55 32 66 it 
Job satisfaction 6 .66 52 AD 74 
M ap— Total array | + 68 37 69 
M yp—Job-referent half 53 69 58 .66 
M \y—Supervision-referent half .60 -68 57 ral 


Abbreviations: SJ = same job, 


s supervisor influence. To assess 
ance of the job as opposed to 
supervision, the following between-class comparisons 
between Maps were made for the total array of indexes 
and for the job-referent and supervisor-referent sets as 
rated by the judges: 

Comparisons a: ing job influence were: (a) 
Mav(DJ/SS) versus Map(SJ/SS) and (b) Man(DJ 
DS) versus M ap(SJ/DS). 

Comparisons assessing supervisor influence were: (c) 
K lara versus Map(SJ/SS) and (d) Man(DJ/ 

S) versus Af p(DJ/SS). 

As can be seen in the job influence comparison 
each instance the supervisor factor remained constant 
between classes (either SS or DS in cach class) while 
the job factor changed. The opposite was true in the 
supervisor influence comparisons; the job factor 
remained constant in each comparison while the 
supervisor factor varied. In comparisons 4 and b, then, 
to the extent that differences between Maps were 
significant, an association was indicated between 
workers’ jobs and their attitudes. Likewise, in com- 
parisons c and d significant differences between means 
indicated the association of workers’ supervisors with 
those workers’ attitudes These comparisons were made 
for each of the 13 indexes separately (for purposes of 
sign tests), for the indexes pooled ove all, and for the 
two sets of indexes, pooled into job- and supervisor- 
referent halves as described above- 


Assessing job versu 
the actual relative import 


s, in 


SS = same supervisor, 


DJ = different job, DS = different supervisor. 


RESULTS 


For each of the 13 indexes, / tests were run 
r all four types of comparisons such that a 
positive / value always indicated results in the 
predicted direction. A sign test (Hays, 1963) 
was performed on the ¢ values for the com- 
parisons designed to assess supervisor influence 
and was significant at the .02 level. A second 
sign test was performed for the job influence 
comparisons. The proportion of these com- 
parisons that turned out in the predicted 
direction, however, Was not significantly 
greater than one would expect by chance; in 
fact, it was slightly less than .50. 

For each of the four classes represented by 
the columns in Table 1, the Map was com- 
puted for the entire array of indexes (n=13), 
the job-referent half (n=7) and the super- 
vision-referent half (n=5). These means are 
shown at the bottom of Table 1. 

The i-test comparisons for job and super- 
vision influence were then made among the 
four classes. The results are shown in Table 2. 


fo 
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TABLE 2 
Crass Comparisons ON POOLED INpDEXEs (f VALUES) _ 
|s vision i ence comparisons 
Job influence comparisons | pic sini sass e 
7 j= i 
| a Class 2 Class 
Class 3 Class 4 “lass 2 aa 
(DJ/SS) | (DJ/DS) | (Sj/Ds) ( A 
Pooled indexes a | ea | U vers 
Class 1 Class 2 Class 1 D 38) 
(SJ/SS) (SJ/Ds) (SJ/SS) 
7 2.01* 
PY | 09 2a | o 
Total array 5 | 2 Ly 86, 
Job-referent half | A | 08 = J 
Supervision-referent half | ? | ‘ 
N Abbreviations: SJ 7 same job, SS = same supervisor, DJ = different job, DS = different supervisor. 
ep < 05 -tailed test). 
sp S201 Gnetailed test): 


The first row of the table shows that, overall, 
having supervision in common was much more 
highly associated with shared attitudes between 
workers than was having the same job as one 
another. Both comparisons assessing the 
influence of supervision were significant 
yond the .05 level, while neither 
assessing job influence was significant. Results 
in the last two rows show that, even with the 
predominant reference of the index items con- 
trolled by dividing the indexes into job- and 
supervision-referent halves, no comparison 
assessing job influence was significant. Re- 
garding supervisor influence, one of the two 
comparisons for both the job-referent half and 
the supervision-referent half was significant 
at least beyond the .05 level. 


be- 
comparison 


Discussion 

How can one account f 
association of common su 
tude similarity, 
lack of association 


or this consistent 
pervisor with atti- 
coupled with a consistent 
of common job and attitude 
similarity? It is possible that the distinction 
made between factors that are “job influenced” 
as opposed to those that are “supervisor 
influenced” js mainly artificial, Reconsider 
the extent to which a Supervisor may indeed 


various job factors, 

Myers (1964) points out the importance of the 
isor i i h motivation and 

» Many of Herz- 
dissatisfiers (eg. re 
cognition, responsibility, advancement, 


personal relations with the superviso 


inter- 
r) can be 


“ated 

Viewed as heavily, if not completely, arar 
by the supervisor. And, too, these shown 
mediated by the supervisor have been H anc 
to be more important for both —* not 
dissatisfaction than other job dimension: E, 
so likely to be mediated by the SURE en 
(Dunnette, Campbell, & Hakel, 1967). T sever! 
more, these findings held up across Š are 
diverse occupational categories, such p der’s 
found in the current study. ed e aa 
(1963) factors of job satisfaction also E not 
many items that seem under partial, 1 
entire, control of the supervisor. ei 

But a more fruitful explanation cai y a 
one related to the well known hierarc nail er 
needs proposed by Maslow (1954). eet are 
the following argument: Lower order P iighes 
met mainly through the job itself. EE o 
order needs (self-actualization and, to „ough 
extent, esteem) also are met mainly aro 
the actual job. But the intermediate Td 

belongingness and such esteem m one 
as the desires for Prestige, recognition, pa 
tion, and aAPpreciation) can be met to 2 work 
extent only by people, and, in the -yisol 
environment, especially the super ec a 
Furthermore, the supervisor probably 2 a 
both the amount of the need and the degras 
which the need is satisfied, which combin 
produce 


o be 


, love, 


Mors? 
satisfaction, according to i: pov’ 
(1953) among others. If one grants the # 


3 - indings ° 
argument and then examines the fin 


a 
= ; a that, * 
this study, the conclusion may be tha idie 
. . . . 7 s siU 
this point in tme, workers of the kinds sti dle’ 
. 10 
are generally at the stage of meeting M 


' 
? 
| 


Fin. 
i 
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to upper-level needs such that job character- 
istics are not so influential as they once were. 

The job environment of workers at all levels 
has been improved, especially at the higher 
levels that are prevalent in this sample. Haire, 
Ghiselli, and Porter (1966) found greater 
satisfaction and fulfillment of needs in higher 
level than in lower level managers in the United 
States, Blai (1964) found Maslow’s higher 
level needs selected as important by workers 
with increasing frequency from low in trades- 
manual and service occupations to high in 
professional occupations. Lower level needs 
were selected as important more frequently 
by Ss in low socioeconomic status groups. 
Following this reasoning, one might expect 
that in some other countries less economically 
advanced than the United States, need satis- 
faction would be lagging comparatively, and 
this should be expected to result in a different 
order of salience than is found in a more 
advanced economy. This seems to be con- 
firmed by Lahiri (1965), who found salary and 
security to be judged as the two most im- 
portant job factors by both government and 
nongovernment clerical white-collar workers 
in India. “A good and sympathetic supervisor” 
was ranked consistently low in importance. 
Lahiri & Choudhuri (1966) reported similar 
results for technical and nontechnical em- 
ployees of an industrial organization in Cal- 
cutta. In addition, an international study by 
Haire et al. (1966) found Indian managers 
reporting the highest degree of fulfillment of 
security needs but the lowest fulfillment among 
managers in all 14 countries of the sample for 
esteem and autonomy needs and the second 
lowest for fulfillment of self-actualization needs. 

Maslow (1954) proposed that “needs cease 
to play an active determining or organizing 
role as soon as they are gratified [p. 105.” 
If the above argument has any validity, it is 
little wonder that the job itself, if it mainly 
meets already gratified needs, may show little 
association with worker attitudes. It is like- 
wise easy to see why the supervisor, who plays 
a key role in meeting the needs speculated as 
most salient at this time, is strongly 
attitudes and 
work 


being 
associated with those same 


perceptions. One might suggest that 


environments will continue to improve, meet- 
ing higher and higher level needs, and if the 
above argument indeed holds, then as time 
passes we shall witness the reemergence of the 
job characteristics as the critical factors in 
worker attitudes through their relationship to 
the highest order needs. 
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beyi m a simple additi sk was sxert a significar 
revious performance a simple ition task was found to exert a sigr | $ ur 
ive mpact on the goal levels set by 80 Ss. When performance effects were 
positive 1 if 


accounted for, knowled 


lge of results (KR) also influenced goals significantly. 
g ( 


When the effects of four forms of KR were compared, correct KR eres gom 

si ificantly above that generated by no KR, while incomplete inc ne i 
—j A mificantly, and erroneous low KR decreased performance below the 
acta ae, with no KR. Implications of the findings for administrative practice 
eve! 


and goal-setting theory were discussed. 
a goa ? 


The question of the motivational determi- 
nants of work performance is now one of the 

imary concerns in the literature of applied 
non Aa (e.g., Porter & Lawler, 1968; 
Schwab pa Cummings, 1970; Vroom, 1964). 
Several theoretical orientations have been 
taken on this issue. The two-factor theory 
(Cummings & El Salmi, 1968; Herzberg, 1966; 
Herzberg, Mausner, & Snyderman, 1959), for 
example, emphasizes the role of job satisfaction 
in determining performance levels. In expect- 
ancy theory (Porter & Lawler, 1968; Vroom, 
1964) a crucial role is played by perceptions 
of the relationship between performance and 
the receipt of valued outcomes. Equity theory 
(Adams, 1965; Pritchard, 1969) emphasizes 
social comparisons and the impact of feelings 
of dissonance on performance. 

Recently Locke and his associates (Locke, 
1968a, 1969) have emphasized the importance 
of conscious goal levels as determinants of 
employee performance (Locke & Bryan, 
1969a, 1969b; Locke, Cartledge, & Koppel, 
1968). In summarizing his work on this issue 
Locke (1968b, p. 157) has argued that: (a) hard 
goals produce a higher level of performance 
(output) than easy goals, (6) specific hard 
goals produce a higher level of output than a 
goal of “do your best,” and (c) behavioral 
intentions regulate choice behavior, 


1 The authors would like to acknowledge the assist- 
ance of Leonard Berkowitz, Lee Dyer, and William 
Devitt in conducting this study, 
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Wisconsin 53700. 
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ersity 
Madison, 


40s Angeles, 
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Locke (1968b) has also argued that insic 
and intentions mediate the effects of eo 
rewards such as monetary incentives, mami 
edge of results, and time limits on per ani 
ance. In general it is argued “that a do 
incentives such as instructions or footie ats 
not affect behavior independent of a 
on goal setting or intentions [Locke, Cartlede 
& Knerr, 1970, p: 152].” diate 

If goals determine performance and me “ei 
the effects of extrinsic factors, it is prc 
less possible for various types of external 1 ate 
tives to influence the level of goals ee 
though Locke et al. (1970) have ay ane 
that their “model does not delineate the fac pt- 
which influence the establishment and acce} in 
ance of (end) goals and general values ae > 1o 
the first place [p. 154],” it is interesting 
note that at least one study (Locke & BIR) 
1968) suggests that knowledge of results that 
may influence the level of goals set. In med 
study two groups (KR and no KR) perlo! vals 
on a computation task for 16 5-minute on 
At the end of the eighth, before the ninth, ed 
at the end of the sixteenth trials, Ss were © list: 
to specify goals on a 10-item goal chee fie 
The authors report that the goals sper’ ily 
on the third questionnaire were signilic” ted 
related to KR, higher goals being associ 
with the group that received KR. gest 

Taken at face value these results SUED are 
that correct KR results in higher goals. l vet 
are several limitations of the study, howe! i 
which reduce the confidence one can plae fot 
the findings and thus suggest the pee goal 
additional research on this issue. First, rou? 
items were specified in potentially ambigui y 
terms (e.g, “working along at a reason 


als 
goal 


20 


orp 
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fast pace but not the fastest possible”). Second, 
scores of Ss who responded to 3 of the 10 items 
were dichotomized and correlated with KR. 
This procedure resulted in the elimination of 
an unknown number of Ss who checked one 
of the other seven goal items. Third, the rela- 
tion between KR and goals was described for 
the three goal statements, but statistical results 
are only presented for the last one following 
Trial 16. 

Finally, inadequate account was taken of 
the potential relationship between previous 
performance, KR, and goals. Specifically, pre- 
vious performance may be instrumental in 
the establishment of goals, the presence Or 
absence of KR adding nothing, once diler- 
ences due to past performance are accounted 
for. Unfortunately, KR and previous perform- 
ance may have been confounded in Locke and 
Bryan’s (1968) design. Higher goals were not 
stated until after the last trial. By that time, 
however, the KR Ss had performed at a signi- 
cantly higher level. Thus, it is impossible to 
tell whether the higher goals were a function of 
the KR manipulation or the higher perform- 
ance since the timing of the goal estimates 
(after the last trial) makes this an ex post 
facto, not an experimental, study. 

One might expect poth variables to influence 
the level of goals set. Both, for example, 
presumably reflect on individual’s abilities and 
motivational states (e-8- achievement motiva- 
tion), Persons do not normally aim for goals 
that their previous performance indicates 
would be unrealistic or nonchallenging. Thus, 
previous performance would be expected to 
exert a direct influence on goal level. Knowl- 
edge concerning the favorableness OT accept- 
ability of past performance, however, is nor- 
mally mediated by relevant others in one’s 
environment. Thus, the effects of inte 
KR are also potentially 


ntional 


manipulations of 
important. 

The present study was designed to investi- 
gate the impact of past performance and sev- 
eral KR conditions 0? the level of goals set. 
Two hypotheses were tested: (a) the higher 
the level of previous performance, the higher 
the level of goals set for future performances 
and (b) when the effects of previous perform- 
ance are accounted for, the greater the amount 


and accuracy of KR, the higher the level of 
goals set. 


METHOD 

Subjects 

The Ss were g0 undergraduate student volunteers 
(22 male: 58 females) enrolled in an introductory 
psychology course at the University of Wisconsin. The 
experiment was conducted in a regular classroom during 
four consecutive sessions. Each of the 80 Ss had signed 
for one of the four designated sessions thus creating 
four groups of 20 each. 


Task and Procedure 


The study employed a simple addition task. Each S 

was given partially completed artificial copies of the 
monthly time sheets used by the hourly student 
assistants employed at the University. The daily totals 
had been entered in a column on the right-hand side 
of each sheet. The task involved totaling the column 
of daily hours to determine the monthly total and 
entering this total at the bottom of the column. The 
total number of hours worked during the month ranged 
from 139 to 159 with a mean of 148. 
Each S was given 20 different time sheets repeated 
4.5 times to make a total of 90. A pilot study indicated 
that Ss would not be able to complete all 90 during the 
time allotted to the experiment. The time sheets were 
ordered so that each S in a group would have a unique 
combination that was matched by one equivalent 
combination in every other group. 

To control competition, KR was provided only in 
numbers and not in relation to a group standard or 
average. Thus, each S had only his past performance 
to use as & standard for appraising KR in order to set 
goals. Talking was also prohibited during the experi- 
1 period and was monitored during the rest 
period while Æ was absent. These controls may not 
have been successful in entirely eliminating competi- 
tion, howeveT, since within each group SS could observe 
one another. The Ss could have used these observations 
to establish a rough group standard for evaluating 
KR and for setting goals. However, there jis no reason 
erential effects of this type of possible 


to expect diff is 
contamination across the four conditions of the study. 
he following procedure 


In each group, E employed t 
to obtain Period-1 performance data, Each S was given 
a manila folder containing @ blank time sheet, a blank 
orange-colored sheet, and the 90 time sheets, in that 
order. The Ss were instructed to open the folder and 
look at the blank time sheet. The addition task was 
then explained. The Ss were told to complete each time 
sheet in the order in which they appeared in the folder. 
At a given signal, Ss removed the blank time sheet and 
orange-colored paper and started working on the time 
sheets. After 20 min. Ss were asked to stop working on 
the task and to insert the orange-colored sheet after 
last time sheet worked on whether or not it was 


menta 


the 
completed. 

The Æ then collected the folders and told Ss that 
they would have a rest period of 5 min. During this 


528 


period, Ss were required to remain seated at their 
desks and talking was not permitted. An O monitored 
the groups as Æ took the folders into another room to 
tabulate the results. During the 5-min. period, E 
counted the number of completed time sheets in each 
folder and checked each time sheet against a correct 
listing of the total number of hours for each sheet. 
From this tabulating procedure a master sheet was 
prepared listing each S according to the number on 
his folder and giving the number of time sheets he com- 
pleted and the number completed correctly. After the 
rest period, Æ returned to the experimental room and 
distributed the appropriate folders to each of the 20 Ss. 

At this point conditions in each of the four groups 
differed. Each S in the KRe (correct) group was in- 
formed of the total number of time sheets he completed 
and the number completed correctly. The KR, (incom- 
plete) group was informed only of the number of time 
sheets completed incorrectly. (The S was not told the 
total number of time sheets that were completed.) 
The KRy (no) group was given no information about 
individual performance. The KRgr, (erroeous low) 
group was informed that 50% of the number actually 
completed were correct. x 

After the KR conditions we 
lished, all groups were again given the same instructions, 
Ona printed form Ss were asked to indicate “How 
many time sheets do you expect to complete” in the 
second work period of 20 min. The latter data con- 
stituted the measure of goals employed in the present 
study. After working on the task for an additional 20 
min., all time sheets were collected. The Ss were then 


thanked for their participation in the 
hank a a study a 
dismissed. e a 


re differentially estab- 


Resets 


The hypotheses Stated earlier now can be 
elaborated as follows: (a) Goals are a Positive 
function of i and (b) 

S of previous performance a 
accounted for, highest goals will be set by the 
Ro Sroup, next highest goals will be set by 
t a stoup, second lowest goals will be 
set by the KRy group, and the lowest goals will 
be set by the KRgr group. 


The design employed one f. 


and one continuou 
Period-1 performance (B). Give 
employed and the 
data ca E: 
‘ ane be an using either 
analysis of Varlance/covariance (AV/ACV) 
multiple regression (MR). The latter tót d a 
was chosen because est} of ieee 
lained i eed 
p al correlations 
output of the 
al for our use. A 
amiliar with tradition 


iate, 
design 


am available 
reader who is more f l 
al 
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AV/ACV is referred to Cohen (1968) for & 
discussion of the similarities between MR and 
AV/ACV procedures and output. k 
Analysis was first conducted to determine 
if KR alone was associated with variation in 
goals. For analytical purposes KR was cate- 
gorized into three zero-one “dummy” variables 
with the KRy control group coded zero 1m all 
three categories. Significance of the effect 0 
KR on goals is provided by an F test of the 
multiple coefficient of determination (R°?) er, 
goal level is the dependent variable and al 
three dummy variables are included as inde- 
pendent variables, R? indicates the proportion 
of goal variance attributable to KR and i 
exactly analogous in this situation to eta 
Square in analysis of variance. Moreover 
given the procedure employed, the regression 
coefficients (b;) associated with each KR; Coy 
dition indicate the mean goal-level different? 
between that KR group and the KRy grouP 
(Cohen, 1968). As such, each b; reflects ae 
average impact of KR; on goal levels relativ? 
to receiving KRy, . 
Using the Procedure described above; i 
was found that KR had a significant impa 
F = 5.18; P < .01) on the level of goals ve 
(R = 17), Table 1 shows mean goal leve” 
for all four KR groups and mean goal differ 
ences between each KR treatment and pa 
contol group (KRx). The partial correlati! 
coefficients show the relevance of these g0% 
ie rpk À correlational terms. It Toant 
diffe uy the KRo group set signi up 
et (higher) goals than the KRw gr? of 
Gare i was hypothesized that previous Pi i 
ance would influence the goal levels $ 


TABLE 1 


KRe, KR, 


Unanvyustep 
Ss AD. 


. as KEN 
AND KRpy VERSUS K 


Seek ETERMINANT OF GOALS ed 
= SS = 
Condition | Unadjusted Difference Partial | t 
M with KRy 
aea 0) | | 
lae de 
KRy 14.85 Th ee 3° 
8: = — 8 
KRe 20.90 | 6.05 30 30 
KR; 15.15 30 02 95 
KRer 15.30 AS 03 A 
a2 p 
= à edie of realis, Page 
"Of results, KRi = incomplete KNOW 


= erroneous low knowledge of results. 


| 


| 


DETERMINANTS OF GOAL SETTING 


additional analysis was conducted to identify 
this effect, if any, and to determine whether 
performance differences would mediate the 
KR effects observed in Table 1 where perform- 
ance was not accounted for. In an MR context, 
Performance is treated as a continuous indepen- 
dent variable introduced into the regression 
equation prior to KR; the latter continues to 
be categorized into three dummy variables. 
Using this procedure it is possible to identify: 
(a) the impact of performance on goal levels, 
(b) an overall incremental impact of KR on 
goals when performance is partialled out 
(exactly analogous to ACV treating perform- 
ance as the covariate), and (c) mean differences 
between each KR and the KRx groups after 
adjustment for any between-group perform- 
ance differences. 

Regression analysis showed that 26% of the 
goal variance was accounted for by the differ- 
ences in previous performance (F = 28.90; 
p < .01). When KR was entered into the equa- 
tion following performance, the amount of 
goal variance explained increased to 4 
(F = 19.80; p< 01). The latter equation 
Was employed to calculate adjusted mean goal 
levels (i.e, taking account of previous per- 
formance differences) and to test for differences 
between each KR treatment and KRy. The 
results of this analysis are shown in Table 2. 
In addition, partial correlations are reported 
showing the effect of each KR condition rela- 
tive to KRy in correlational terms when per- 
formance is held constant. The results generally 
confirm the findings obtained when previous 
performance was not controlled, The KRo 
group set significantly higher goal levels than 


TABLE 2 


Apjustep KRe, KR, AND KRer VERSUS KRw 
AS A DETERMINANT OF GOALS 


| 
| Difference | partial 


Condition Adjust with KRN] ¢ t 
(bi) | 7 
KRy 15.20 — = | = 
KRe 20.74 5.54 40 3.74 
KRr 16.37 1.17 | 09 ‘78 
KRex 13.69 | seS —11 |—1.00 


:KRn = no knowledge of results, KRe 
i KERN, KRr = incomplete knowledge 
low knowledge of results. 


_ Note,—abbreviations 
of correct knowledge of 
results, KReL = erroneous 


b <01. 
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the?KRy group. When previous performance 
is accounted for, the KRyr group set lower 
goals than the KR» group as hypothesized. 
However, neither KRer nor KR; groups dif- 
fered significantly from the KRy group. 

Thus, the results of the present study show 
that: (a) KR influences goal levels. (b) Cor- 
rect KR increases goal levels significantly 
above those generated by KRx, while KRy 
and KRgr do not. (c) Previous performance 
on the task affects goal levels, the greater the 
performance, the higher the subsequent goal. 
(d) When the effects of previous performance 
are accounted for, AR still influences goal 
levels. And (e) when the effects of previous per- 
formance are accounted for, correct KR still 
increases goal levels significantly above KRx 
while KR; results in insignificantly higher, and 
KRez results in insignificantly lower goal 
levels than KRy. 


DISCUSSION 


It appears clear that KR influences the level 
of goals set for subsequent task performance. 
The relationship is, however, more complex 
than this conclusion implies. Previous perform- 
ance also exerts a significant impact on level 
of goals. In the present study, performance 
and KR accounted for almost half of the goal 
variance. This finding is particularly important 
in view of the possible confounding effects of 
KR and performance on goals in the Locke and 
Bryan (1968) study. In that study, Ss made 
goal statements three times; first, they were 
asked what their goals had been on the first 
eight trials after the eighth trial; second, they 
were asked what their goals were for the next 
eight trials; and third, they were asked (after 
the sixteenth trial) what their goals had been 
for the second eight trials, All Ss were given 
knowledge of the correctness of their scores 
after each of the 16 trials. The KR Ss kept 


track of their scores while the KRy Ss were 
ignificant differences in 


not allowed to do so. S 
performance between the KR and KRy groups 
were found only for the second set of eight 
trials. It was also found that significant differ- 
ences in goal statements were obtained only 
on the third goal statement questionnaire. 
Thus, at that point, the KR Ss had performed 
at a higher level making it impossible to dis- 


tinguish between KR and performance as 
determinants of goals. Since no differences in 
goals were found between the KR and KRy 
Ss at the end of Trial 8 (where no significant 
performance differences had emerged), the 
suspicion might have been that past perform- 
ance was more instrumental in determining 
goals than was KR. On the other hand, the 
present study more clearly demonstrates the 
independent and combined effects of KR and 
performance on goals. ; 

It is interesting to note that approximately 
half of the variance was not explained by KR 
and performance. Additional possible deter- 
minants of goals might be the Motivational 
states (need for achievement and competence) 
and aspiration levels of performers. As noted 
earlier, these characteristics may be reflected 
in past performance and goals, but only parti- 
ally so. In addition, goal levels may be influ- 
enced by the performer's perceptions of the 
credibility of the task upon which he is 
to establish goals and the meaningfulness or 
value of the task to his own self-esteem and 
central interests. The influence of these po- 
tential determinants on goal levels were not 
investigated in this study. 

In addition, all forms of KR 
goals to the same degree. In 
vious performance is 
actually resulted in som 
KRy. KR;, 
goal levels on 


asked 


do not influence 
fact, when pre- 
accounted for, KRe, 
ewhat lower goals than 
even though accurate, incre 
ly slightly above KRy. 

If replicated in field Studies, the potential 
implications of these findings for administra- 
tive practice are somewhat disquieting. Man- 
agers are often urged to let employees know 
how they stand, that is, provide KR. If the 
Standards against which performance is to be 
evaluated are not clearly defined and/or if the 
measurement processes used are not sufficiently 
Specific to generate accurate KR, little m 

gained by Providing performance feedback, 
At best, incomplete feedback may have little 
impact on goals over Providing no KR, A 
policy of Providing conservative performance 
feedback (erroneous low) on the assumption 
that it is better to Jet the subordinate be 


ased 


ay be 


L. L. Cummincs, D. P. SCHWAB, AND M. ROSEN 


) 
: . . ly 

surprised on the “upward side” may si 

be less functional than providing no feedba 
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ENT AND NONCONTINGENT REWARD ON 
BETWEEN SATISFACTION 


PERFORMANCE 


EFFECTS OF CONTING 
THE RELATIONSHIP 
AND TASK 
DAVID J. CHERRINGTON 
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It was proposed that there is no inheren 
productivity, and that relationships betwee 
upon performance-reward contingencies. 
monetary reward was then delivered to 21 
performers. The Ss next completed self-repo 
and performed the same task for another h 
satisfaction and second-hour productivity 
correlations, however, were found between 
riately reinforced Ss (rewarde 

while significant negative corr 
Ss (rewarded low performers and 


on the relation- 


Many current speculations 
ction and task 


ship between worker satisfa 
performance, as reviewed by Schwab and 
Cummings (1970), still imply that performance 
and satisfaction are causally related in one 
direction or the other. Some theorists have now 
added moderating variables to their behavioral 
formulas in hopes of facilitating the prediction 
of one variable from the other. 

In contrast, the present authors postulate 
not only that (a) there is no inherent relation- 
ship between satisfaction and performance, but 
also that (b) one can produce about any 
empirical relationship between task perform- 
ance and self-reports of satisfaction that one 
wishes. The first proposition is consistent with 
the conclusions of empirical reviews such as 
Brayfield and Crockett (1955) and with 
theoretical positions such as those of Porter 
and Lawler (1968). The second proposition 
was derived from operational proposals by 
Skinner (1969) and Bandura (1969) and from 
speculations by a variety of reinforcement 
theorists (e.g. Berlyne, 1967; Bindra, 1968; 
Rescorla & Solomon, 1967; and Weiskrantz, 
1968). 

Skinner (1969), fer inst 
feelings are, at best, accomp: 


ance, insists that 
animents rather 


ts should be sent to H. Joseph 
Graduate School of Business, 
on, Indiana 47401. 


1 Requests for reprin 
Reitz, Business 650, c 
Indiana University, Bloomingt 
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Nine 


d high performers a 
elations were foun 
| nonrewarded high performers). 


t relationship between satisfaction and 
n the two variables are highly dependent 
ty Ss performed a task for 1 hr. A 
f 42 high performers and 21 of 42 low 
asures of satisfaction and attitudes 
our. Correlations between self-reports of 
over all Ss was .00. Significant positive 
satisfaction and productivity of approp- 
nd nonrewarded low performers) 
d for inappropriately reinforced 


o! 
prt me: 


than causes of behavior and that both are the 
products of common environmental variables. 
Bandura (1969, p. 598) pointedly suggests 
that we might better treat self-reports of 
satisfaction simply as another class of behavior 
rather than as indexes of an underlying state 
endowed with special causal powers. From this 
point of view, there exists no inherent relation- 
ship between performance and self-reports 
of satisfaction, and the empirical problem 
becomes that of examining the conditions 
under which the different response systems 
are correlated or independent. 

We suggest that the type of reward system 
under which workers perform might strongly 
influence the satisfaction-performance relation- 
ship. One should be able to directly influence 
satisfaction by proferring or withdrawing 
rewards. To significantly influence perform- 
ance, however, one must use performance- 
contingent rewards. Theoretically, then, by 
manipulating the contingencies of a reward 
system, one should be able to create conditions 
under which satisfaction and performance 
can be, empirically, either independent, or 
positively related, or negatively related. 

To test these propositions let us define three 
types of performance-reward systems: 


1. Random rewards. Rewards are distributed 
on bases independent of performance. The 
percentage of high performers receiving re- 
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wards does not differ from the percentage of 
ow performers receiving rewards. Under pam 
system we hypothesize that —*, 
vetween satisfaction and performance wou 
10t significantly differ from zero. 

2. Positively contingent rewards. Rewards 
are based directly on performance. All high 
performers are rewarded; all low performers 
are not rewarded. Under this system we 
hypothesize a positive correlation between 
satisfaction and performance. 

3. Negatively contingent rewards. Rewards 
are based on factors inversely related to 
performance. Low performers are rewarded ; 
high performers are not rewarded. Under this 
system we hypothesize a negative correlation 
between satisfaction and performance, 


METHOD 
Subjects 


The Ss were 90 undergraduate students, both m 
and female, enrolled in a junior-level business cou 
Students were enlisted as volunteers 
Flexibility Tests for which they would 
$1 per hour. 


ale 
rse. 
to score Closure 
be paid at least 


Procedure and Task 


The Ss reported to the laboratory in groups that 
varied from seven to nine individuals and were met by 
E who introduced himself as a graduate student who 
would be their supervisor, 

When all Ss had arrived and hi 
selves with the Closure Flexibili 
was told that his task would be to score the tests at 
his work station. The Ss were al: 
had been completed by e 
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All of the Closure Flexibility Test booklets had in 
fact been marked according to 24 patterns of seo 
Therefore, the difficulty of the task was controlle 
and the correct responses were known in advance. 

The Ss were told that they would be paida staan 
of $1 per hour but that the best performers ae 
of quality and quantity would receive an adatan 
$1 bonus. The Ss were told that each had as ha 
chance of receiving the bonus. The task of scoring: a 
test booklets was explained to Ss, aided by diagran 
showing a partially marked answer sheet. in 

After Ss began the task, Æ returned every 10 T 
to bring additional booklets and collect the boo! ie 
and answer sheets that had been completed. At fee 
end of 1 hr., E stopped Ss and asked them to inie 
on four 7-point scales an estimate of their quanti A 
quality, and overall performance and probability o 
receiving a reward.? The experimental manipulations n5 
financial reward were then performed, which consis! 
of the following statements and payments. 


T have collected the t as you have scored ae 
and selected a sample from cach of you to chec 
your work. I’ve used a rather complicated index 
which combines quantity and quality of perform- 
ance into one index, Based upon this index four © 
you will receive $2 for the last hour’s work. The 


other four will get $1. The four winners are «++ and 
the four losers are ---, 


In fact, the mor 
tributed to half of the high-performing and ha! 
the low-performing Ss. tal 

The performance score for each S was the ity 
number of rows of figures in the Closure hae 
Test booklets that S scored correctly. ‘This was de the 
mined by subtracting the number of errors from ce 
total number of rows scored, Thus performan 
Scores were a measure of both quality and quantity. y 

The Ss were then asked to complete a self-reP is 
measure of satisfaction developed by Scott and vie 
colleagues (Scott, 1967; Scott & Rowland, 1970): * io 
format of the self-report measure was a seman 
differential questionnaire as shown below. 


» dis- 
netary bonus was randomly if 6 


Me At This Task 
Neither 


Extremely 

Appreciated — 
ored — = = 

Efficient i , E 


Quite 


Bipolar adjective pairs were set 
me at this task, my pay, my fellow 
task. The response to ez 


> DY averaging 
‘ales Previously found to 
Eight factor scores were’ 


) computed for each § 
General affective tone score w. ` 


as obtained by averaging 


7 one nor 
Slightly the other Slightly 


Quite Extremely aa 
we — Unappreciates 
= : - Interested 
“3 Inefficient 


inst 
S's responses to the following bipolar scales set agains 
the concept, me at this task : appreciated-unapprec!# ure 
rewarded-penalized, Satisfied-dissatisfied, and ones as 
aged-discouraged. The remaining self-report meee 


a 
x g 8 
and the semantic scales defining each factor We" 
as 


e 
2 f SE” n 
There were no significant differences betwee? sive 
reward and nonreward group in either the subjec’ e- 


Probability of being rewarded (5.42 vs, 5.22, resP 
tively) or in estimated performance (5.37 vs. 5.32): 


ee 


£ 
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S general arousal (me at this task): interested- 
T and alert-listless; personal 

e me at this task): efficient-inefficient, 
productive-unproductive, reliable-unreliable, and effec- 
= veuneticetive ; general satisfaction with pay (my pay): 
ae agit bi reasonable-unreasonable, superior- 
pay fe an rewarding-penalizing ; equitableness of 

y pay in comparison with what others in my 
group received): fair-unfair, high-low, and reasonable- 
unreasonable; adequacy of pay (my pay in comparison 
with what others get for similar work on the campus) : 
superior-inferior, high-low, and reasonable-unreason- 
able; interpersonal attractiveness (my fellow workers) : 
sociable-unsociable, helpful-obstruct pleasant-un- 
pleasant, unselfish-selfish, and cooperative-uncoopera- 
tive; task attractiveress (the task attractive-repul- 
Rive, exciting-dull, good-bad, interesting-boring, super- 
lor-inferior, and wholesome-unwholesome. 

After completing the semantic differential scales, 
Ss took a 5-min. break, after which they were given 
More test booklets and asked to continue scoring them 
for another hour, ‘The procedure at the end of the 
second hour was identical to that at the end of the first 
hour. First, Ss were asked to estimate their perceived 
of reward on the four 


performance and probability 
scales. Then they were paid and asked to fill out the 
semantic differential questionnaire. The monetary 
bonus was distributed to the same Ss who received it 
at the end of the first hour. 

After the questionnaires were 
second time, Ss were debriefed. They were told that 
all rewards were given at random, and since it was 
only by chance someone received more than the others, 
the winners were asked to share their rewards with the 
losers, although / did not insist that they do so. 


filled out for the 
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RESULTS 


We have hypothesized that there is no 
inherent relationship between satisfaction and 
performance, but that there is a direct relation- 
ship between the rewards and self-reports of 
satisfaction. Whatever covariation one does 
observe between satisfaction and performance, 
then, may depend upon other environmental 
he kind of perform- 


conditions, particularly t 
that has been ar- 


ance-reward contingency 
ranged or has evolved. 

The results summarized in Table 1 demon- 
strate the differential effects of rewards on 
performance and satisfaction. Performance 
scores of rewarded Ss did not differ from those 
of nonrewarded Ss. 

Mean satisfaction scores of rewarded Ss, 
however, were significantly greater than mean 
scores of nonrewarded Ss on each of the eight 
satisfaction indexes. Thus self-report measures 
of satisfaction can be highly dependent on 
the occurrence or nonoccurrence of monetary 
reward, but if that reward is delivered in- 
dependent of performance, it fails to have 
incremental effects on subsequent performance. 

To test the effects of variations in perform- 
ance-reinforcement contingency, Ss were classi- 

3 Raw scores are repo 
scores, however, were Use! 


rted in the tables. Normalized 
d in all statistical analyses. 


TABLE 1 


COMPARISON OF MEAN PERFORMANCE 


REWARDED VERSUS + 


AND SATISFACTION SCORES OF 
ONREWARDED SUBJECTS 


X Scores 
Item Dm 
Rewarded Ss Nonrewarded Ss 
Performance 
First hour 285.6 n 5 Le 
Second hour 403.8 7 
Satisfaction indexes a | 383 age 
General affective tone 5.16 Henn A 
General arousal 4.98 i Ty 
Personal competence 5.82 A joe 
General satisfaction with pay 5.61 oA | om 
Fc hak ner is 3.40 eet 
Adequacy of pay re et a 
Attractiveness of fellow workers £39 ts | on 
Attractiveness of task © ` | d 


Note.—Abbreviation Dm = Difference oF means. 


*p <.01. 
> < 005. 
weep < 001. 
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TABLE 2 


EAN R SCE AND SATISFACTION SCORES OF APPROPRIATE 
COMPARISON OF MEAN PERFORMANCE AND SATI 


VERSUS INAPPROPRIATELY REINFORCED SUBJECTS 


X Scores | 
] | Dm 
Res Appropriately | Inappropriately 
reinforced Ss | reinforced Ss 
| = 
ey a | - 
a a i 291.6 280.9 10.7 
cond Hour 428.4 385.9 | 42.5 

Satisfaction | 3 
General affective tone 4.26 4.11 | at 
General arousal 4.12 4.02 a 
Personal competence 5.30 5.07 paa 
General satisfaction with pay 4.56 | 4.11 A5 
Equity of pay | 4.40 4.18 En 
Adequacy of pay 4.18 3.64 DF 
Attractiveness of fellow workers 4.85 | 4.78 07 
Attractiveness of task | 3.31 3.27 04 

Note.—Abbreviation Dn = Difference of means, n ~ 

*p <.05. 

ep <1, 


fied on the basis of first-hour performance. 
The 42 Ss whose performance scores were 
above the median of their group are referred 
to as high performers and the 42 Ss below the 
median as low performers. Six Ss who per- 
formed at the median were excluded from 
further analysis. Rewards at the end of the 
first hour were distributed randomly 
high and among low performers so th 
the Ss in each performance classific: 
rewarded. The appropri 
reinforcement was then 
formance-rewa 


among 
at half 
ation were 
ateness of each S’s 
defined by the per- 
rd contingency that happened 


TABLE 3 


CORRELATIONS BETWEEN SATISFACTION AT END oF 


First Hour AND SUBSEQUENT PERFORMANCE 
DURING SEconp Hour 


Appro- | Inappro- 
ately | Driately 


Satisfaction indexes inn | 


General affective tone 
General arousal | 
onal competence 

isfaction with pay 


of fellow workers 
f task 


AAA I It 


to occur for that S. The 21 rewarded high 
performers and the 21 nonrewarded low 
performers were classified as appropriately 
reinforced. The 21 rewarded low performa 
and the 21 nonrewarded high performers We 
classified as inappropriately reinforced. oat 
The results shown in Table 2 reveal, frst 


9 cee 
that the average second-hour (postreinfor 


> i rei oa 
ment) performance of appropriately reinfor o! 
Ss was significantly higher than that he 
mMappropriately reinforı 


ced Ss, whereas g 
inforcement) performance y 
the two classes did not differ significant” 
Comparisons between the two classes on 5° a 
reports of satisfaction, however, reveale i 
Significant difference on only one of the “The 
satisfaction indexes, adequacy of pay- ie 
tests summarized jn Tables 1 and 2 ee 
establish that, as hypothesized, it iS yal 
contingency of a reward on performance ae 
enables it to bring about increments in Pie, 
formance. That contingency character n+ 
however, has little or no effect on satisfac" is” 
Two different relationships between ee 
faction and Performance were investiga’ gf 
(a) the relationship between self-report? nd 
Satisfaction and Subsequent performances ‘yd 
(b) the relationship between performance j 

subsequent self-reports of satisfaction. saa 


pi 
Table 3 presents the results of correla? 


first hour (prere 
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analyses performed between the eight satisfac- 
tion indexes recorded at the end of the first 
hour and subsequent performance during the 
second hour. The data for all Ss, among whom 
as many low performers as high performers 
were rewarded, revealed only one significant 
correlation between satisfaction and sub- 
sequent performance. However, when the data 
Were separated according to Ss’ performance- 
reward contingencies, distinct differences in 
the satisfaction-performance relationships were 
observed. For the 42 appropriately reinforced 
Ss, significant positive correlations were found 
between six of the eight satisfaction indexes 
and subsequent performance. But for the 42 
inappropriately reinforced Ss, negative correla- 
tions were found between seven of the eight 
satisfaction indexes and subsequent perform- 
ance, of which three correlations were statis- 
tically significant, and one closely approached 
significance at the .05 level. 

Table 4 presents the results of correlational 
analyses performed between performance dur- 
Ing the second hour and the eight satisfaction 
indexes subsequently recorded at the end of 
the second hour. The similarities between 
Table 3 and 4 are striking. The data for all Ss 
revealed no significant correlations between 
performance and subsequent satisfaction. How- 
ever, when distinctions were made between 
appropriately and inappropriately reinforced 
Ss, where the performance C fications 
were based on the second hour’s productivity 
scores, widely disparate performance-satisfac- 


TABLE 4 


CORRELATIONS BETWEEN PERFORMANCE DURING 
SEcoND Hour AND SATISFACTION AT END 
or SEcoND HOUR 


Appro- 
priately 


All Ss* 


Satisfaction indexes 


es eral arous: 
‘onal compe 


Įuity of p: 
Adequacy of 


ay 
of fellow workers 
of task 


mn 
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tion relationships were observed. For the 
appropriately reinforced Ss, significant posi- 
tive correlations were obtained between per- 
formance and each of the eight satisfaction 
indexes. For the inappropriately reinforced 
Ss, negative correlations were obtained between 
performance and seven of the eight satisfaction 
indexes, with four of the correlations being 
highly significant. 


DISCUSSION 


The results of this study support the 
hypothesis that the nature and magnitude of 
the relationship between satisfaction and 
performance depend heavily upon the per- 
formance-reward contingencies that have been 
arranged. The findings are consistent with the 
predictions of the Porter and Lawler (1968) 
model, although the theoretical bases for 
our hypothesis differ from that model. For 
Lawler’s model implies 


instance, Porter and 


that, under contingent-reward conditions, 
performance causes satisfaction because per- 


formance leads to rewards which, in turn, 
cause satisfaction. Our theory implies no cause- 
effect relationship between performance and 
satisfaction; instead, it stresses the perform- 
ance-reinforcing as well as the satisfaction- 
increasing potential of contingent reinforcers. 
The random reward system described here 
has as its organizational counterpart companies 
s such as pay increases and 
promotion are alleged to be performance- 
contingent but, in fact, are contingent upon 
factors independent of performance, such as 
age, seniority, Or upward influence of one’s 
an organization we would 
and performance to be 


in which reward 


supervisor. In such 
expect satisfaction 
unrelated. 

The negatively contingent reward system is 
analogous to an organization in which rewards 
are based on factors that happen to be inversely 
related to performance. Such systems are not, 
as rare as one might think. 
encountered a 


unfortunately, 
One of the authors recently 
company in which pay increases were based 
on supervisor ratings, until it was discovered 
that the supervisors’ ratings were negatively 
related to performance. In such an organization 
we would expect to find satisfaction and 
productivity negatively related. High perform- 
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rs are dissatisfied because they feel cheated; 
ow performers are more than satisfied with 
their relative increases in pay. 

So the importance of taking into account 
the contingencies between performance and 
rewards is again emphasized for those who 
seek to affect employee attitudes and/or 
behavior through various reward systems. 
If rewards are not positively contingent, then 
the administration of rewards will not only 
fail to encourage performance increments, it 
may also increase dissatisfaction and, ul- 
timately, absenteeism and resignations among 
the highest producing employees. 
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WORK VALUES DISTIN- 


GUISHING SATISFACTION LEVELS AMONG ENGINEERS! 


DONALD A. WOOD: 


Graduate School of Business, Indiana University 


This study attempted to d 


sample of 1,500 professional engineers could be distinguis! 
lower levels of satisfaction on the basis of personal, backgroun 
variables. Linear discriminant function anal 
best discriminated with 67° and 65% cl: 
tion groups, respectively. Discrimination for those least s 
with 6359—4790 hit rates. Group discriminators were best desi 
values, and participation in technical activities. An 


training and knowledge, job v 
adapted method for applying 


Individuals within different occupations, 
skill classifications, and organizational levels 
often indicate marked differences in job 
satisfaction (e.g, Centers & Bugental, 1966; 
Doll & Gunderson, 1969; Porter, 1962). In 
addition, demographic variables like age or 
education have been significantly correlated 
with satisfaction measures, implying subgroups 
determined by such variables also vary on 
job satisfaction (Fournet, Distefano, & Pryer, 
1966). Little research, however, has addressed 
itself to determining the characteristics that 
best distinguish workers grouped according 
to how satisfied they are with their work. 

A fundamental question emerges as to what 
background or structural variables can be 
identified that contribute most to the dis- 
criminability between groups with high and 
low job satisfaction, using the job satisfaction 
measure itself as a classifier, The more tradi- 
tional interest regarding the identification of 
attitudinal correlates is replaced by the desire 
to seek out attitudinal discriminators instead. 

Ronan (1970) has extensively reviewed 
research using a continuous satisfaction mea- 
sure for correlational analysis within employee 
groups. These efforts mainly involve simple 
and multiple criteria-satisfaction correlations. 


_' This study was prepared under a grant from the 
Office of Manpower Policy, Evaluation and Research, 
Grant No, 91-16-68-36, United States Department © 
Labor under the authority of Title I of the Manpower 
Development and Training Act of 1962. The author 1s 
indebted to W. K. LeBold for permitting access to the 
data used in this study 
* Requests for reprints should be sè 
Wood, Graduate School of Business, 
versity, Bloomington, Indiana 47 


nt to Donald A. 
Indiana Uni- 


etermine if those with higher job satisfaction in a national 


hed from those with 
d, and structural 
disclosed those most satisfied were 


fication accuracy in initial and replica- 
ed was second best 
cribed by professional 


function weights to individual cases was described. 


Such results are important, but since regres- 
sions across different employee groups vary 
considerably, conclusions concerning attitud- 
inal correlates within one group cannot be 
generalized to other groups. If the question 
concerns which variables can best distinguish 
between groups differing by the degree of 
overall job satisfaction, then discriminant 
function analysis, instead of regression analysis, 
becomes appropriate. The more customary 
concern of satisfaction regression within 
groups is replaced by the concern for selecting 
variables that best preserve the identity of 
groups differing in satisfaction. The discrimi- 
nant function approach would appear capable 
of providing a clearer picture of the optimum 
combination of variables that differentiate 


such groups. 

The purpose of this study is to determine 
what background, structural, and demographic 
items most distinguish various satisfaction 
levels among engineers and to identify an 
optimal weighting scheme by which such 


levels can be maximally discriminated. 


METHOD 


Sample 

The sample consisted of 1,500 graduate engineers 
randomly selected from a larger national survey of 
engineers in industry and government (LeBold, 
Perrucci, & Howland, 1966). Seventy-one percent had 
bachelor’s degrees, 17% had master’s degrees, and 12% 
had doctorate degrees ~ all in various engineering fields. 
Eighty-five percent of the sample were employed in 
industry, with chemical, electrical, and aircraft com- 
panies highly represented. The remaining included 
engineers in Federal government (5¢¢); state and local 
government (856); and nonmanufacturing, including 
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research and testing laboratories (2%). ancy 
of the sample worked in organizations exceeding j i 
employees, but their work unit size remained sma b 
usually less than 10. The total sample had a median ag 
of 39, 


Procedure 


Preceding the actual analyses, groups ame ae 
differing in job satisfaction had to be identified. This 
was done by summing across 60 job epee ‘eons 
cerning areas such as autonomy, recognition, cl ae 
security, status, and interpersonal relations, al of which 
had been responded to on a 3-point scale—very char- 
acteristic (3), some characteristic (2), and not char- 
acteristic (1). For example, an engineer would have to 
indicate how characteristic is using your abilities in 
challenging work” with respect to his current job, He 
responded in like manner to the remaining items After 
adding across all 60 job items, a general attainment 
index resulted (range of 60-180). Th 
the Porter (1962) index obtained by ask 
is there now?” and reflects the sati: 
process described by Evans (1969, D 
for such a score is that as work attain 
characteristics increase, job satisfaction will also in- 
crease, This relationship has been previously demon- 
strated and supported (Wood & LeBold, 1970). 

After the characteristic scores were calculated, the 
score distribution was divided into fifths, resulting in 
five different groups ranging from those who gave their 
job a low characteristic Score (dissatisfied) to those who 
saw their position worthy of a high Score (satisfied), 
Score intervals were determined from the total sample 
using the BM DO2S contingency table analysis program 

(Dixon, 1968, pp. 341-357) under the umption that 
the distribution was normal in form. The resulting Vs 
within the interval groupings used indicated the dis- 
tribution was very nearly normal, After the total sample 
record tape was ordered by the characteristic score, it 
was decided to construct two Separate data tapes, the 


ment or perceived 


the total sam] z 
Tape 1, then the next S5 was ; 
etc. Both tapes were 
acteristic score magnitude so th 
would be sequentia], 
The groups for Sample 
low (L1) for those whose 
or below; Jow middle (LM1) 117-127. 
Scores from 127-136; hi 
ranging from 136-145; hi 
or above. Comparable S 
intervals are design 
respectively, 
Program BMDOsM, linear discriminant analysis for 
several groups (Dixon, 1968, pp. 196-207), Was used 
for the analyses. Twenty items were selected for the 


: 
, Tape 2, 
e char- 
at group membership 


ample 2 g 
ated L2, LM2 


*It may be well to distinguish between the 
discriminant function to be employed in this stud 
the linear regression function more commonly 
predictive studies. A linear regression function 


linear 
y and 
used in 

makes 
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discriminant analysis, primarily on the basis of a 
correlation with the characteristic score. sew, 
this selection Process did not include items with a bi 
relationships to Job satisfaction, as a a 
Previous research, they were included to the ex 

ower correlated items, PE ee 
a set selected as potential ap, 
of job satisfaction included: (a) the po go na a 
currently employed in; (6) number of ie nal} 
sponsible for, both professional and pn elated 
(c) hours per week spent at home on worl an don 
activities; (d) amount of work done that could > tent 
by someone with less technical training; (2) es (f) 
of employer's continuing education oppormaine s a 
extent of formal tra ining ina variety of specific pe thé 
Subjects and detailed engineering activities, $ z from 
engineer's state of knowledge in each area a ty eared 
extensive to none; (x) organizational size; Deca 
of supervi y responsibility; (7) degree of eet 
responsibility; (j) a description of job function ra tive; 
from entirely technical to entirely administrat a 
(k) relevancy of engineering background to P redt, 
job from “must have” to “unnec ary”; (0) a g from 
overall, single item job satisfaction scale sa as or 0 
very satisfied to very dissatisfied; (m) ATE 
memberships in national professional and nea for 
societies; (1) number of technical articles AEs like 
publication ; (0) number of technical activihes sas 
meetings and outside courses engaged in ce ae 
year; (p) gross annual income: (4) peak gross = ol 
income expected during carce (r) Se enri 
doubts about engineering after having made . score 
Choice; (s) age; and (1) a summated inpar pence Fa 
some importance, 2; no impor! E # 
across the same 60 job statements use was 
ainment index 


(very important, 3: 
1) estimated 


by 


jin 
prising Sample 1 ere as 
ant analysis employing all 20 m i the 
ollowing the 20 variable analysts us 
iminating items in Sample 1 were "7 


4 + dentili- 
analysis in Sample 2. These ie dis- 


‘ t Lon 
Matrix (Cooley & 1 ative 
ng weights give the r is- 


é . h 
wer for each variable in eac 


ant function, 
Resurrs 


mM . . tec 
The scaled discriminant functions ans sis 
for each group in the 20-item, Sample 1 ar The 

are given in the fi le t- 
arst part of Tab A 


f 
: ete 
use of continuous dependent variable values for d at 


Paes : 
mining function weights to estimate other depender 
values. The discriminant function weights come “it 
due to classifications made on the dependent Hidas i 
Not to continuous Values. ‘The “prediction” is 8" 
membership, Not criterion value as in regression. 
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TABLE 1 
Five SCALED DISCRIMINANT Functions For Five Groups Usinc TWENTY [TEMS 
(SAMPLE 1) AND Ercut [ress (SAMPLE 2) 
Scaled discriminant functions 
Ttem Sample 1 | Sample 2 
| LI LM! | Mi | HM1 | H1 L2 | LM2 | M2 |HM2| H2 
|. 
1. Department size 12 A fl A | | | | | 
2. No. employees responsible for -14 16 17 19 | 
3. Hours of homework —.06 | —.06 | —.06 —.05 | | 
4. Do work others should do 40 Al H| 38| 3| .34) 32] 31] 30 
5. Employer's policy on education 27 | .30 82 | 382] | | 
6. Training and state of knowledge 1.13 1.13 1.14 | 1.14 | 1.37 | 143 | 1.38 | 1.40 | 1.38 
7. Size of organization | 09} 10) 10 10 | | 
8. Supervisory responsibility | .18 AS 19 18 | 
9, Technical responsibility | 18 -20 -20 -20 | 
10. Technical-administrative function Jt 34 a5 ae 58 60 61 62 63 
11. Relevance of background 40| .4l 40 39) .57 + 4 53} S2 
12. Overall job satisfaction 25 -20 16 12 | | 
13. No, professional societies —.30 | —.31 | —.31 —.34 | 
14. Articles written .06 07 06 10 | | | 
15. Technical activities 49 „51 50 1 91 95 93 | 94 96 
16. Annual income 25 25 22 25 | | 
17. Expected income —.10 | —.11 | —.08 —.Al | 
18. Doubts about engineering 40! 40 Al 42 29 | 3 33 33 33 
19, Age 44| 45] 46 ay] 43) 44] 44] | 45 
20. Importance score 75 80 83 92 | 1.08 | 1.12 | 1.45 | 1.17 | 1.24 
| 


ed function 
middle, HM 


Items italicized have eight highest sc 


Note. 3 
low middle, M = 


Abbreviations: L = Low, LM = 


weightings resulting from scaling are relative 
indexes of discriminatory efficiency and should 
not be confused with the original weights 
estimated in the discriminant functions. In 
this and successive analyses, Functions 1-5 
are related to Groups L1(2) through H1(2), 
respectively, The item concerning training 
and state of knowledge (Item 6) appears to 
be the most effective single discriminator 
between the five satisfaction groups of engl- 
neers after all relative weightings in the scaled 
functions are compared. The importance score 
received the second highest weighting and 
the item pertaining to technical activities was 
third. It is noteworthy that weightings, 1n 
general, do not change drastically from one 
function to another. 

Other notable discriminants included age, 
doubts about engineering as i career, doing 
work that could be done by others, technical- 
administrative function, and relevance of 


engineering background. 


s across all five functions in Semple 1. Group Ns = 150 


v s 
h middle, H = high. 


The classification matrix resulting from the 
first analysis is given in Table 2. This matrix 
actually summarizes the results of estimating 
five probability densities, one from each 
discriminant function, for all Ss in the sample 
after which the engineers are assigned to the 
in which the probability density is 
For example, the standardized raw 
for Sı, Group 1, is multiplied, in 
discriminant function. The 
result is five discriminant scores from which 
probability densities are determined (for 
computational procedure, see Dixon, 1968, 
p. 202). If the highest density occurs with 
Function 1, then S, is as igned to Group 1. 
If Function 3 gives the greatest density, this 
S is then assigned to Group 3, etc. More 
simply, each individual’s response vector is 
compared to the mean response vector for 
each group. A person is assigned to the group 
where this difference is smallest. Lf all five 
group functions discriminated perfectly, the 


group 
greatest. 
score vector 
turn, by each 
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TABLE 2 


FIVE JPS IN LE 1 U 
CLASSIFICATION MATRIX FOR Five GROUPS IN SAMP: 


SING TWENTY DISCRIMINANT [TEMS 


Discriminant function % Group 
5 hits 
P 2 | 3 4 5 Total 
1 
5 63 
5 7 150 63 
95 21 | 15 12 7 5 35 
T 47 37 33 15 18 150 A 
i 22 26 53 27 22 150 37 
MI 17 ul 32 55 35 150 a 
uy 6 10 12 | 21 101 150 P 
eats 187 105 145 | 130 183 750 
netion hi 5 5 7 .42 .55 
% Function hits 1 35 | .37 


‘ i-square value for table of 393,68 with df = 
aie wane eae obtained from cack discriminant 
HM = high middie, H = hich 


matrix diagonals would all be 150, 
other entries would be zero, 

The italicized diagonal entries in T 
indicate the number of hits or correct cla: 
tions obtained from each discriminant function. 
The group hit percentage is that proportion 
of the total stoup accurately placed, The 
greatest hit rate is 67% for the Most satisfied 
group, H1. This is followed by a 63% rate for 
the least satisfied group, L1. Group LM1 
members were discriminated least effectively 
with only one-fourth placed in this group by 
its respective function, Just over one-third of 
those in Groups M1 and HM1 were considered 
hits. From these findings, it would appear that 
very satisfied and very dissatisfied engineers 
can be categorized rather accurately without 
knowing their actual attitude levels, 

Another way 
efficiency is to ¢ 
highest densities 


and all 


able 2 
ssifica- 


discriminant 


ercentage of 
determined by a function 


g associated 

This was done by dividing 
the column totals (see Table into the 
corresponding di 


age of function 


Function 5 associated with 
most effective in discrimir 


2 for Group LM1 is leas 


Group H1 jg 
and Function 
t effective), It seems 


nating, 


16 is significant at p = 001, Group 
function. Abbreviation 


es repre” 
150, Italicized values tePile. 
low middle, M = 


ms: L = low, L 


its 
reasonable that the number of grou? a 
would be reflected by the discriminant fu 
tions’ efficiency index, plica- 
In order to apply a rather stringent rep in 
tion of the discriminant analysis c 
Sample 1, the eight highest-weighted Cara 
mined by the scaled functions) R item 
Sample 1 were used in a Sample 2, valed 
discriminant problem. Table 1 gives ae With 
functions resulting from this analysis. hers, 
the exception of Item 1, doing work of ot F 
and Item 6, doubts about engineenngi Si 
the relative discriminant weights are the pare, 
or higher here than in the Sample 1 coer 
It is most noteworthy, however, that Saget 
and state of knowledge, job values, shest 
technical activities again received the hig 
weightings, in that order, p65 
The hits percentage for Group H2 0 the 
(see Table 3) was only slightly lower than ed. 
H1 rate of 67% when all 20 items were US L2 
he next highest rate of 47% for Group 150 
reflects a decline in discriminability ihat Ser- 
occurred in Groups M2 and HM2. Cons! u 
ably better discrimination occurred in GTO 


h : ing 
LM2 using eight items than in LM1 us 
20 items, 


Surprisingly similar, 
number çf non-H2 designations made ye! 
Function 5 and the comparatively ka 
non-L2 assignments made by Function 


sip a ve the 
This finding would Seem to indicate that 


— 


aie 
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TABLE 3 
CLASSIFICATION MATRIX FoR Five Groups IN SampLe 2 Usixe EIGHT Most-WEIGHTED 
DISCRIMINANT ITEMS FROM SAMPLE 1 
Discriminant function ör 
Group — Jo Group 
| | hits 
1 2 | 3 | 4 5 Total 
L2 71 34 13 12 20 150 AT 
LM2 44 48 16 17 25 150 32 
M2 27 32 22 31 38 150 AS 
HM2 21 28 23 29 49 150 A9 
H2 9 15 13 16 97 150 -65 
oe Total 172 | 157 87 | 105 229 750 
“© Function hits atl | 3l | 25 | 28 | .42 | 


sent Note —Ghi-sauare value for table of 185.78 with df = 
HM the number of hits obtained from eac 
M = high middle, H = high. 


hit rate for engineering groups widely divergent 
in overall job satisfaction does not necessarily 
reflect the classifying efficiency of the corre- 
sponding group functions. From this dis- 
criminant analysis using the reduced data set, 
classification in the high and low-moderate 
groups does not suffer appreciably, while the 
hit rate for those least satisfied is reduced but 
still remains substantial. Data reduction is 
most severe on those in the middle and high- 
middle groups with their hits percentage 
going below that expected by chance (.20). 


DISCUSSION AND CONCLUSIONS 


concluded from this study that 
five engineering groups differing in general 
job satisfaction as measured by a perceived 
work attainment index can be significantly 
distinguished by work value and background 
variables, particularly if satisfaction oF dis- 
satisfaction is extreme. Linear discriminant 
function analysis has demonstrated that item 
weights can be derived such that extreme 
satisfaction group identities can be reasonably 
Maintained or approximated. Lowered classi- 
fication success found in the three intermediate 
Satisfaction groups could well have been due 
to the abbreviated satisfaction ranges used in 
their formation. Had they been merged into one 
large moderate group, discrimination might 
well have increased. But this would have been 
accomplished by sacrificing the distinctiveness 
of the classification scheme involved. Perhaps, 
more practically, the extremes of job satisfac- 


It may be 


with d 16 is significant at p 
h discriminant function. Abbreviation: 


001. Group Ns = 150. Italicized values repre- 
= low, LM = low middle, M = middle, 


tion are most crucial if discrimination is 
desired. Behaviorally, it may make a greater 
difference if one is extremely happy or unhappy 
with his work as opposed to being mildly 
satisfied. 

Since it has been reliably established that 
engineering attitude-grouP differences do exist 
with respect to professional training, values, 
and activities, the question of using the 
resulting discriminant weights arises. Stinson 
(1958) suggests @ procedure that is adapted 
for use here. For illustration purposes, the 
three best discriminators are used in determin- 
ing function scores (V) for the L, M, and H 
Sample 2 groups- Scaled item weights from 
the S-item analysis functions and respective 
group means (training, values, and activities, 
in that order) are used in the following 


equations ** 
1.367 (189.250) + 1.082 (134.510) 


Vi: = 
à (911 (16.830) = 419.57 
ue = 1.382(190.220) -+ 1.147 (141.490) 

E 934(17.530) = 441.54 
Yue = 1.382(192.740) + 1.239(152.310) 

“4.959 (18.250) = 472.57 


The midpoints between these critical V 


scores can be used in the assignment of 


4 For illustration purposes, 
criminators from the 8-item analysis are used. More 
definitive classification for questionable cases could be 
achieved by successively adding discriminators to the 
equation, much like adding predictors to a regression 
equation. A complete table of item means and standard 
deviations can ‘be obtained from the author. (See ad- 


dress in Footnote 2.) 


only the three best dis- 


individual cases. For example, if a person (Si) 
had raw scores on the training, importance, 
and activities items of 175, 138, and 17, 
respectively, his V score of 418.36 would place 
him far below the L2-M2 midpoint score of 
430.55 and very close to the V score for L2 
(Vs, = 1.391(175) + 1.152(138) + .938[17]). 
Thus, he would be assigned to the least 
satisfied group, L2. The discriminant weights 
used with raw scores are the averaged scaled 
values across the five functions. The slightly 
increased precision gained from applying 
every function in turn to each raw score 
vector and then calculating probability den- 
sities for each does not seem justified. This 
adaptation of Stinson’s (1958) method appears 
to be more practical and feasible with respect 
to assigning new Ss to satisfaction groups. 

It should be emphasized that data other 
than the V score should be taken into account, 
especially in borderline cases. But the dis- 
criminant V score does help to integrate the 
predictions that come from the three most 
valid variables in these samples into a single 
number. 

This classification scheme offers a workable 
method for placing Ss in job satisfaction 
categories when their job satisfaction is un- 
known. More research is needed, however, 
to determine if state of training, work values, 
previous participation in professional activities, 
and other pertinent data gathered at the time 
of hire could designate initial groupings that 
Faire toile ay riminant analysis could 

entify those with propensities to be 
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satisfied or dissatisñed with their job prior 
to hire and placement. 
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ANXIETY AS A FUNCTION OF INTRINSIC-EXTRINSIC JOB 


ORIENTATION. THE 


PRESENCE OR ABSENCE OF 


OBSERVERS, AND TASK DIFFICULTY * 


SHOUKRY 


D. SALEH? 


Department of M anagement Sciences, University of Waterloo 


Ninety-six intrinsically job-oriented 


1and 96 extrinsically job-oriented maleuniversity 


students were divided into four comparable subgroups. Two subgroups worked on 


simple anagrams, one alone 


and one observed by two individuals of higher status. 


The other two worked on difficult anagrams under the same two conditions. Felt 


anxiety and perception of task difficulty 


ratings were elicited along with reactions 


toward the experimental conditions. The intrinsically oriented expressed more 
uneasiness than the extrinsically oriented while being observed on the difficult 


anagrams. They also tended to perceive 
than when alone, 


the task as more difficult when observed 


while the opposite was true for the extrinsically oriented. The 


importance of job orientation in the reaction to the presence or absence of observers 


was discussed. 


The objective of this study is to clarify 
experimentally how intrinsically oriented and 
extrinsically oriented individuals would react 
to the presence of higher status observers. 
Intrinsically oriented individuals are those who 
would emphasize the actual performance of the 
job. They, for instance, would be concerned 
with the challenge that their job offers and 
with their accomplishment on it. Extrinsically 
oriented individuals, however, are those who 
would emphasize the environment in which 
the job would be performed. They would be 
more concerned with such factors as security, 
working conditions, and supervision than with 
challenge and accomplishment. 

The reviews by Cummings and El-Salmi 
(1968), House and Wigdor (1967), and Whit- 
sett and Winslow (1967) show that the litera- 
ture has concentrated on supporting or Te- 
futing the proposition that the intrinsic and 
the extrinsic categories are two dimensions © 
job attitude. Very few studies, however, have 
dealt with their dynamics. Weissenberg and 
Gruenfeld (1968) for example, found a posi- 
tive relationship between satisfaction with the 
intrinsic factors and job involvement. In 
another correlational study, Saleh and Grygier 
(1969) found that the intrinsically oriented Ss 
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? Requests for reprints should be sent | kry 
D. Saleh, Department of Management Sciences, Uni- 
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had a stronger tendency to be independent and 
felt that they needed less guidance or reassur- 
ance than the extrinsically oriented ones. 
It has been also pointed out (e.g., Herzberg, 
1966; Saleh, 1964) that underlying the intrinsic 
orientation is the need for self-actualization, 
while physiological, security, and acceptance 
needs are the basis of the extrinsic orientation. 

The main interest in the audience effect 
literature has been concern with the effects 
of the presence of others on performance. 
Zajonc (1965), in an attempt to explain the 
opposing findings in this area, proposed that 
the presence of others has drive-arousing 
effects on those who are performing, and this 
arousal increases the probability of occurrence 
of the responses that are dominant in a per- 
son’s repertory. Thus, if the task was a simple 
one in which the person has almost learned 
the solution, the correct response would be 
dominant and the presence of others would 
facilitate performance. On the other hand, if 
the task was complex and the correct response 
ot dominant in the hierarchy of possible 


was n 4 
the presence of others would impede 


responses, 
performance. 

In his proposition, Zajonc indicated the 
mediating arousal effects only for the presence 
of others. The situation as @ whole, however, 
should be considered a source of arousal, and 
in this case the nature of the task and its 
difficulty should not be ignored. That task 
difficulty is related to arousal has been demon- 


strated by Hess and Polt (1964) and by 
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Schnore (1959). Therefore, assuming = 
there is a curvilinear ee “Hebb, 
of arousal and performa b 
fee), it is possible that the eroga! ee 
by a difficult or complex task adde pane 
audience presence may result i a ae 
arousal level that exceeds the level or op the 
O od —— $ alae task 
i ask, however, the arousa a 
ie ae ‘does not add | a the 
arousal induced by the Pies Sree “te 
Thus, the total level of arousal may not exceec 
that necessary for good performance. hn 

Any complete explanation of , audience 
effects, however, should not ignore individual 
differences. People differ in their susceptibility 
to audience presence. Ganzer (1968), for 
example, with serial learning as the criterion, 
found that audience presence was more detri- 
mental for high- than for low-anxious indi- 
viduals. In another study Kohfeld and Weitzel 
(1969) concluded that susceptibility to peer 
group status was greater for Ss who scored 
relatively low on five personality scales of 
the California Psychological Inventory (Ai, 
Achievement via Independence; Ac, Achieve- 
ment via Conformance; Re, Responsibility ; 
To, Tolerance; and Gi, Good Impression) 
than for individuals scoring relatively high 
on the scales. They argued that people who 
work equally hard for both low- and high- 
status observers tend to Possess certain posi- 
tive personality attributes (Ai, Ac, Re, To, Gi) 
to a greater degree than individuals who are 


more sensitive to the social status of their 
audience, 


While most of the audience 
concentrated on their effects on 
the present study focused on 
feelings of Ss toward the preser 
More specifically, it investig: 
of the intrinsically oriented 
ally oriented towa 
status observers wi 
task difficulty. 

An interaction was ex 
three variables 
hypotheses: 


effect studies 
performance, 
studying the 
nce of others, 
ated felt anxiety 
and the extrinsic. 
rd the presence of higher 
th respect to two levels of 


pected among the 
as indicated by the following 


Hypothesis 1: More anxiety would 
expressed by the intrinsically oriented than 
the extrinsically oriented while being ob- 
served performing on the difficuli task, 


be 
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Ta in expressed 
Hypothesis 2: No difference in espressi 
anxiety would be expected ee 
: ; i > N sical!) 
intrinsically oriented and the oe a 
oriented while being observed performing 
the simple task. 


These hypotheses are based on two ee. 
tions: (a) The intrinsically oriented empha 
achievement and task involvement more ana 
the extrinsically oriented, and enoa e 
former would be more concerned wn D 
nature of the task than the latter. Ane k 
the presence of higher status Os is of re? 
not only to the extrinsically oriented, nee 
stress the importance of supervision, but a 3 
to the intrinsically oriented, who meut: pa 
ceive Os as evaluators of their abilities. i 
view of these two assumptions, the intrinsica ik 
oriented under the observed difficult: veal 
condition are expected to experience ani 
from both the task and Os, while arousa re 
the extrinsically oriented would be pamar 
induced by Os. Under the observed smki 
task condition, however, the main Sour 
arousal for both groups is expected to be 
presence of superior Os. 


Metiop 


Measures & 

. Saleh & 

The Job Attitude Scale QAS, Saleh, 1964; Sale 120 
Grygier, 1969; Saleh 


& Lalljee, 1969) consists of 5 
atements, each being paire! the 
forced-choice format. Six a ent, 
intrinsic factors (achieve 
responsibility, recognition, advancement, „hatu e ex 
work, and growth in skill). The other 10 involv icy 
trinsic factors (working conditions; company Polary 
salary; security; status; technical supervision; “ions 
needs for family’s sake; and interpersonal inherar tne 
with supervisor, subordinates, and equals). Job on i 
tion was determined by scoring only the 60 see al 
which an intrinsic factor was paired with an aan: 
one. By giving one point whenever the intri seus rna 
is checked, the possible score range is 0-60. The inte R 
consistency (spl 


e ki ie 
it half) of this scale was .94 and 
retest reliability was 88. 


items involving 16 st 
the other 15 in a 
Statements involve 


Poslexperimental Questionnaire 
see Wa 
The main item of concern in this questionnaire © 
the scale used by Schachter (1959) in studying tpe ow 
tionship between anxiety and affiliation. “Indicate | a 
you feel about being a subject in the expe! join! 
Cp. 311.” The Ss indicated their feelings on 4 Ae K 
scale ranging from “I feel extremely uneasy” 0) en of 
feel completely calm” (6). The second objective ite; 


3 5 

on 

. rf . ma ` ərcep 
concern in this questionnaire involved Ss’ percel 


ne, re 


EN 


ANXIETY, JOB ORIENTATION, OBSERVERS, AND Task DIFFICULTY 


of the task. They were asked to indicate if the task w 
simple by checking one of three choices, yes, no, or °. 
Scores given were 3, 1, and 2, respectively. In two open- 
ended questions the effects of working alone or ob- 
served were elicited. 


Task 


Most experiments on anxiety have used the task of 
rate learning of nonsense syllables. Although such a 
task has its experimental advantages, it was felt that the 
kind of learning involved is not of great importance 
in human life, and what there is of it seldom takes 
place under the kind of conditions imposed by the 
researcher. A better approximation to “real-life” 
Situations is to deal with more complex intellectual 
activities such as problem solving. Therefore, the task 
chosen for this study was the solving of anagrams, 
a task that has been used successfully in anxiety experi- 
ments (i.e., Harleston, 1962; Russell & Sarason, 1965). 

This study's anagrams were tested empirically for 
their difficulty by Brown (1963) on an undergraduate 
university sample, and they have since been used in 
several studies (Feather, 1965, 1966). The difficulty 
was defined as the percentage of the individuals who 
solved the anagram in 30 sec. Forty anagrams with a 
percentage of correct answers ranging from 5156 to 
80% comprised the simple task while 30 anagrams 
varying in difficulty from 2196 to 50% formed the 
difficult task. 

The JAS was administered to 428 first-year uni- 
versity male students whose participation in this 
experiment was a requirement of the introductory 
course in psychology. Scores ranged from 6 to 55. The 
top quarter of the sample scored 36 or more and were 
considered the intrinsic group (usable N = 92).3 The 
bottom quarter scored 22 or Jess and they constituted 
the extrinsic group (usable V = 96; see Footnote 3). 
Both groups were invited to participate in the experi- 
ment that took place 4 wk. after the administration of 
the JAS. Each of them was further divided into four 
comparable subgroups (24 Ss in each) on the basis of 
the JAS scores. Two intrinsic and two extrinsic sub- 
groups worked on the simple task while the other two 
worked on the dificult task. In one of the subgroups 
each S worked alone while in the other, S was observed 
by two individuals who were presented as graduate 
Students (and, hence, having higher status than 5). 
Two observers were used instead of one in order to 
magnify the effect of being observed. The experiment, 
then, used a 2 X 2 X 2 factorial design, two types of 
job orientation, intrinsic and extrinsic; two levels of 
task difficulty; and two working conditions, alone and 
observed. 

The Ss were asked to solve as many anagrams as they 
could in the specified time (15 min). Because it was 
intended that Ss should be able to solve all anagrams 
in the specified time, the number solved within 15 min. 
was tested before the experiment on another comparable 
group. In the “alone” condition, Æ told S that he was 


3 Few Ss did not show up for the experiment and some 


did not have all required data. 
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going to leave him to work alone and that he would 
return after 15 min. The postexperiment questionnaire 
that included the dependent variables was given 
immediately after the experimental session. 


RESULTS 


The analysis of variance for the anxiety 
scale is presented in Table 1 which shows that 
the interaction effect among the three inde- 
pendent variables, orientation, task, and 
presence or absence of observers was significant 
(p < .05). Figure 1 and Table 2 show the 
nature of this interaction. They indicate that 
the intrinsically oriented Ss expressed more 
anxiety than the extrinsically oriented under 
the observed difficult task condition and thus 
support the first hypothesis. The results also 
show no difference between the two groups 
under the observed simple task condition and 
therefore support the second hypothesis. By 
examining Table 2 and Figure 1 we also find 
no differences between the intrinsically 
oriented and the extrinsically oriented either 
under the alone simple task condition or under 
the alone difficult task condition. 

It should be noted that Table 1 indicates 
two significant main effects, the task and the 
audience variables. In the case of the simple 
task, the mean anxiety score was 4.99 (SD 
= .79), and for the difficult task the mean was 
4.58 (SD = 1.12). (The higher the mean the 
lower the anxiety). The mean for the alone 
condition was 4.94 (SD = .95) while it was 
4.63 (SD = 1.00) in the observed condition. 


TABLE 1 


ANALYSIS OF VARIANCE OF ANXIETY SCORES AS 
A FUNCTION OF JOB ORIENTATION, AUDIENCE 
PRESENCE, AND TASK DIFFICULTY 


Source MS df F 
Total .98 189 
Between 3.24 7 k 
Job orientation (A) 1.84 1 2.05 
Task difficulty (B) 7.79 1 8.68" 
Audience presence (C) 4.40 1 4.90 
AXB 1.53 1 1.70 
AXC 2.70 1 3.01 
BXC .05 1 05 
AXBXC 4.39 1 4.89* 
Within -90 182 
*p <.05. 
ap <.01. 
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TABLE 2 


RATED ANXIETY OF THE INTRINSICALLY ORIENTED 
AND THE EXTRINSICALLY ORIENTED UNDER THE 
AUDIENCE PRESENCE AND Task DIFFICULTY 


SHoukry D. SALEH 


TABLE 3 


ANALYSIS OF VARIANCE OF PERCEIVED TASK SIMPLICIT 
AS A FUNCTION OF JOB ORIENTATION, AUDIENCE 
PRESENCE, AND TASK DIFFICULTY 


CONDITIONS 7 
Source MS df | 
Intrinsically Extrinsically | 
oriented | oriented Total 34 190 | 
| > T Between .99 7 z3 
i- pees x: ase Job orientation (A) 70 1 Ee ig 
4 Simple | Difficult cea —— Task difficulty (B) 479 i 1516 
task | i ği i Audience presence (C) .07 1 te 
ea 9 | ] AXB 30 1 : 2 
We) SD | M | SD| M | SD| M \sp WC | 1.07 1 3: 
sl il Kael R ia BXC | ow 1 | 00 
Alone 5.08 | .86 |4.83| .75| 5.17 -69 | 4.67 | 1.11 AXBXC -01 1 04 
Observed | 4.88 .93 | 3.96 | 1.44 4.83 | 1.31 |4.88| .83 Within 32 | 183 
| —— — = 
Note.—The higher the score, the less the anxiety. ont $ 10. 


These results support the assumption intro- 
duced earlier that both task difficulty 
presence of Os would be sources of anx 
therefore, arousal. 

Although no specific hypothesis Was intro- 
duced with regard to Ss’ perceptions of task 
simplicity, the results (see Table 3) tended 
to be consistent with those of anxiety. The 
main effect for the task variable y 


Was significant 
at the .001 level. The mean for the simple 


anagrams was 1.44 (SD = -68) and the mean 
for the difficult anagrams was 1.12 (SD = 36). 
(The higher the score the simpler the task.) 


Table 3 also shows that the interaction effect 
between the orient 


and the 
iety and, 


y 4 ation variable and the 
audience variable approached significance 
(p < 10). The nature 


of this interaction is 
shown in Figure 2, which indicates that the 
main difference between the intrinsically 
oriented and the extrinsically oriented in their 


5.4} SIMPLE TASK 


perception of task difficulty was in the alon” 
condition. In this case the first group pace 
the task as being simpler (M = 1.44, SL 
= .71) than the second group did (M = 1.14 
SD = Al) (= 2.25, p < .05). There was nO 
difference between the two groups under the 
observed condition (M of intrinsically oriente 
= 1.25, SD = .56; M of extrinsically orient 
= 1.28, SD = .57). The results also showee 
a tendency on the Part of the intrinsically 
oriented to Perceive the task as being more 
difficult when observed than when they wer? 
alone, while a slightly reversed tendency 4P” 
peared for the extrinsically oriented. . 
_The analysis of the two open-ended ques 
tions showed reactions consistent with ae 
previous results. For instance, the disturbing 
effect of Es? Presence was iar pronounce’ 
for both the intrinsically: oriented and t” 
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5.0 
> > 
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4.2 
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ERVED ALONE OBSERVED 
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Anxiety as a functi j i i 
y unction of job ontentation, Os presence, and task difficulty. 


A 


ANXIETY, JOB ORIENTATION, OBSERVERS, AND Task DIFFICULTY 


1.45 


INTRINSICALLY 
ORIENTED 


1.35 


1.25 


2 
7 EXTRINSICALLY 
te gr ORIENTED 


OBSERVED 


ALONE 


Fic, 2. Perception of task simplicity as a function of 
job orientation and presence of Os. 


extrinsically oriented while working on the 
difficult task rather than the simple task. 
Furthermore, the analysis showed that Os’ 
Presence tended to arouse debilitating anxiety 
in more intrinsically oriented than in ex- 
trinsically oriented Ss. 

It is of interest to note that four Ss in each 
of the observed extrinsically oriented groups 
Mentioned that the presence of Es had a 
Positive effect. and improved performance. 
In the alone condition three extrinsically 
oriented Ss stated that they would have 
worked better if Æ had stayed. Under no 
condition did any intrinsically oriented S 
mention such an effect. 


DISCUSSION 


The differences between rated anxiety levels 
of the intrinsically oriented Ss and the ex- 
trinsically oriented Ss in the present study 
could be explained in view of the differences 
ìn their job attitudes. As mentioned previously, 
the intrinsically oriented emphasize the task 
itself and the challenge it offers to test skills 
and abilities more than the extrinsically 
oriented. In a working situation the former 
group also would have a stronger need for 
achievement than the latter group. In this 
view, it might be argued that the intrinsically 
oriented more than the extrinsically oriented 
should be expected to concentrate on task 
performance and ignore the presence of Os. 
We should not expect, however, that the 
intrinsically oriented, who would like to be 
recognized for their abilities and their achieve- 
ments, would not react to the presence of two 
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higher status Os who watched each of them 
closely while he performed. Based on this 
understanding, the experimental situation 
presented this group with two sources of 
arousal, the task and Os, while the extrinsically 
oriented were presented only with Os’ source. 
This should explain why the main difference 
in anxiety between the two groups was under 
the observed difficult. task condition (see 
Fig. 1). 

Based on the above discussion, it can be 
assumed that the intrinsically oriented per- 
ceived solving the anagrams not only as getting 
a credit toward the fulfillment of a course but 
also as a challenge. For them, not achieving 
and not performing up to a high standard 
would be regarded as_ highly undesirable, 
as threatening to the self-concept, and there- 
fore would develop “achievement anxiety.” 
It is expected that this threat and anxiety 
would be greater if the task was perceived 
more difficult, Therefore, since all Ss perceived 
a different difficulty level for the two tasks of 
the study regardless of being observed or not 
(see Table 3), and since the presence of the 
two Os increased the perception of task 
difficulty only for the intrinsically oriented 
(see Fig. 2), the difficult. anagrams under 
the observed condition would be the most 
difficult task for this group and consequently 
the most anxiety arousing. 

To the extrinsically oriented, on the other 
hand, who are assumed to be basically less 
interested in successful performance, task 
difficulty should not be of as much concern 
as it is to the intrinsically oriented. Moreover, 
it appears that the presence of observers had 
some differential effect on the two groups with 
more positive influence on the extrinsically 
oriented, for whom good supervision is im- 


portant. 

Although the present study has presented 
and consistent results, the limita- 
tions of using only a single item anxiety- 
reaction scale and a single item task-difficulty 
scale should be kept in mind. Moreover, it 
should be pointed out that the presence of two 
Os, as in this study, might have different 
reactions than the presence of one O. 

In conclusion, the different reactions of 
intrinsically oriented individuals and ex- 


trinsically oriented individuals to the high- 


meaningful 
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status Os suggest implications resulting from 
close supervision for the two groups, particu- 
larly when the task is comprised of difficult 
cognitive problems. Such reactions and im- 
plications should be tested in real working 
situations to find out if the theory of “job 
orientation” presented in this laboratory 
study could be extended to the field. 
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ERROR AND VARIABILITY IN THE VISUAL PROCESSING 
OF DENTAL RADIOGRAPHS: 


IRWIN L! GOLDST 


N? AND WILLIAM H. MOBLEY 


University of Maryland 


The primary objective of the present study was to 
agreement in a sample of senior dental students. The Ss were presented 


error and d 


with a series of 40 radiographic stimuli and were requir 


sponse, a recognition response, and a difficu 
aled a high false positive rate and moderate miss rate. Analyses of 


responses reve; 


the recognition responses showed that Ss were fairly 
vere correctly detected. No consistent relationship was found between 
difficulty ratings and either detection or recognition performance. 


malities th: 
observers 


As psychologists examine the problems sur- 
rounding the processing of complex displays 
(Goldstein & Allen, 1971; Howell, Johnston, 
& Goldstein, 1966; Kibler, 1965), it has become 
apparent that there are many variables and 
tasks that have not been subjected to experi- 
Mental analysis. While psychologists have dis- 
covered that they do not have a data base 
Soverning absolute levels of performance, pro- 
fessionals from other fields like dentistry and 
Medicine have begun to seriously question their 
ability to make judgments from their complex 
displays, for example, radiographic materials. 

In relation to radiographic displays, Fletcher 
(1954) stated that, 
observer error is a subject whose importance should be 
More widely recognized. Until clinicians freely recognize 
pe bluntness of the instruments they use, they will 

rden their students unnecessarily, deceive them- 
selves, waste a lot of time, and reach false conclusions 


from which their patients may suffer unnecessarily 
Cp. 212). 


A number of studies illustrate that observer 
error is indeed an important problem. Garland 
(1950) reported that 399 chest X-rays read 
twice by each of three readers resulted in 32% 
Underreading (misses) and 2% overreading 
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assess the magnitude of observer 


ed to give a detection re- 
lty rating. Analyses of the detection 


accurate in describing abnor- 


(false positives). Other researchers (Yeru- 
shalmy, Harkness, Cope, & Kennedy, 1950) 
investigated the detection of shadows in mass 
chest roentgenograms and found that as many 
as 30% of the roentgenograms showing evi- 
dence of disease were missed and that 2% were 
reported as false positives. 

Unfortunately, much of the medical and 
dental. literature can be criticized because of 
the lack of proper experimental control. There 
are too few studies that present reliability 
data, adequately specify the sample employed, 
and present valid criteria for judgments. Some 
of the investigations (Galagan & Vermillion, 
1956) used the majority opinion of the subjects 
in the study as a basis for deciding if pathology 
was present in the radiograph. 

Specifically, the present study is designed to 
determine whether there are visual errors asso- 
ciated with the observation of dental radio- 
graphs. If observation errors are found, further 
indicators of performance like the types of 


errors and the confidence of the observer will 


also be examined. 
METHOD 


Subjects 

senior dental students from 
the University of Maryland School of Dentistry served 
as Ss. All Ss had completed their course work in radi- 
ology and oral diagnosis, had clinical experience, and 
would earn their degrees within 6 mo. Each S received 


$6.00 for participating in the study. 


Forty volunteer, male, 


Stimuli 

The stimuli were 40 bitewing dental radiographs that 
had been converted to Kodachrome A slides. The film 
sample, selected by the Head of Oral Diagnosis at the 
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University of Maryland Dental School, Wi sera 

to be representative of films a practicing Heaths mig t 

be required to diagnose. The sample included 2 posi- 

tives, 15 negatives, and three slides with known proc- 
: 

essing defects. 


Procedure 


The films were presented via slide projector. The 
series of 40 stimuli were presented twice, each presen- 
tation independently randomized. For each film, 5$ 
were required to respond in one of three categories: 
(a) abnormality present, (b) processing defect present, 
(c) normal. This represented the detection’ response. 
In the case of the first two response categories, Ss were 
asked to identify the abnormality or processing defect. 
This represented the “recognition process response. 
Finally, Ss were required to rate each slide on a 7-point 
bipolar verbally anchored scale of interpretive difficulty. 
Sixty seconds were allowed for viewing and responding 
to each slide. Following the presentation, Ss were re- 
quired to complete a postexperimental questionn 
a self-rating of effort, a question regarding prefer 
to avoid misses or false positives, identific: 
tion in the experimental room, and name, 


aire, 
ence 
ation of posi- 


RESULTS AND Discussion 


Detection Error 


Performance on the detection task 
analyzed in terms of correct detection 
sponses, misses, and false positives, 
sulted when a positive film or 
processing defect was called 
present.” A detailed summary 
performance, subdivided by 
given in Table 1. For both pr 
bined, that is, 80 slides, 
were correctly classified. Th 


was 
re- 
A miss re- 
a film with a 
“abnormality 
of detection 
Presentation, js 
esentations com- 
73% of the films 


ere was a slight but 
nonsignificant improvement in detection 
performance on the second Presentation, 


TABLE 1 
SUMMARY oF DETECTION PERFORM 
PERCENTAGE oF Correcr DEtEctio 
AND FALSE Positives 


ANI 


NUMBER AND 
3 Misses, 


Performance | % | SD 
No. correct, both resen tations 5 
No. correct, Bra 1 os y | a 
No. correct, Presentation 2 29.10 | 73 2.20 
No. misses, Presentation 1 3.76 | 15 202 
No. misses, Presentation 2 3.60 | 14 202 
No. false positives, Presentation 1 7.30 41 2.30 
No. false positives, Presentation 2 | 7.30 | 41 | 2.28 
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The miss rate was computed by dividing tig 
number of misses by 25, the number of poa 
and processing defect films. The poranna 
missed for both presentations combined he 
15. The false positive rate was computed 18 
dividing the number of false positives by ; a 
the number of negative and processing de ah 
films. The percentage of false positives for bo 
presentations combined was 41. iá 

The analysis of detection error revealed 5 
moderate miss rate and an extremely me 
false positive rate. The fact that the false P 
tive rate was higher than the mis rate is err 
sistent with the frequent observation woes 
students tend to overread films. It is also go t 
sistent with the indication by 75% of Ss m 
they would prefer to avoid missing an wet 
Normality as opposed to avoiding a false Pit 
tive when confronted with a diflict 
interpretation. vas 

The fact that the false positive i 
higher than the miss rate is also Slee 
with the results of Galagan and egg er 
(1956). However, a number of aul =a 
(Garland, 1950; Webber, Benton, & i 
1967; Yerushalmy ct al., 1950) have repor 
higher miss than 


false positive rates. Direo 
comparisons among studies are dificult WE 
differences in observers (students versus dif 
Perienced radiologists and dentists) and ta 
ferences in stimuli (chest X-rays versus aed 
film, difficulty: level, and complexity of puma 
and differences in criteria, 


Recognition and Overall Performance 


iluation of recognition performan 
noting the correct roc- 
for positive and eae 
at were properly class! en- 
ponse. On the first prest 


. u 
Series, 10% of the pos! ig- 


Ms that were correctly l 
tected were incorrectly described. ON t 
Second presentation 9% were incorrect” 
described, , 


These va 
detected the 
rate in describing it. yha 

at if the observe 7 eani 
he detects 
Galanter (1 
error often n 


not report a detec’, 
(6 vogn! 

962) has noted that recog ti 
Nay not be too large since detec 
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itself may be dependent upon the observer's 
ability to recognize the stimulus. 

In order to assess overall performance, a 
total score was computed for cach observer. 
This score reflected the number of correct de- 
tection responses for negative films and the 
number of correct recognition responses for 
positive and processing defect films that were 
correctly detected. The results of this analysis 
are presented in Table 2. 

The results presented in Table 2 show a 
slight but nonsignificant improvement in over- 
all performance from Presentation 1 to Presen- 
tation 2. Considering both presentations com- 
bined, 33% of the films are incorrectly detected 
or described. 

The magnitude of this error figure gives 
strong support to Galagan and Vermillion’s 
(1956) proposition that diagnosis of dental 
radiographs is subject to considerable error, 
and further research effort must be directed 
toward understanding and reducing this error. 


Reader Agreement 


to evaluate reader con- 


It is important 
hen investi- 


sistency as well as reader error wh 
gating observer performance. Intrareader 
agreement refers to a measure of how often 
single readers agree with themselves on two 
readings of a set of films. Interreader agreement 
refers to a measure of how often different 
readers agree among themselves over all possi- 
ble combinations of readers. If any measure- 
ment is presented, pairs in agreement is the 
Index of reader agreement most frequently em- 
ployed in the radiology literature (Yerushalmy, 
1955), i 
Intrareader agreement for the 
sponse was analyzed in terms of th 
first- and second-presentation responses 


agreement. The mean number of pairs of de- 
agreement was 36 or 1%. 


detection re- 
e number of 
in 


tection responses in 


TABLE 2 


SUMMARY OF OVERALL PERFORMANC è 


> aa " 
Performance | X | o | SD 
Ire = 
No. correct, both presentations | 53.89 67 | 4.69 
No. correct, Presentation 1 26.73 67 | 2.58 
No. correct, Presentation 2 27.15 | 68 | 2.50 
SS SS 


al 
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TABLE 3 


SUMMARY oF INTRA- AND INTERREADER AGREEMENT: 
PROPORTION OF PAIRS IN AGREEMENT 


| 


Agreement erection boinc 
responses 
ance 
Intrareader agreement 91 90 
Interreader agreement | 
Presentation 1 70 69 
Presentation 2 -70 Aii 


Average intrareader agreement for overall per- 
formance was 36 pairs of responses in agree- 
ment or 90%. 

Interreader agreement was eva uated by 
noting the number of pairs of detection re- 
sponses in agreement among the 40 readers 
over all slides. For the detection response, 
average interreader agreement for both presen- 
tations combined was 70%. For overall per- 
formance, average interreader agreement was 
also 70%. A summary of reader agreement is 
given in Table 3. 

Since the film series was presented twice 
within 2 hr., the intrareader disagreement is 
probably an underestimate due to short-term 
recall. However, even within this relatively 
short time span, the intrareader disagreement 
was 9% for the detection responses and 10% 
for overall performance. These figures are 
comparable to the 21% intrareader disagree- 
ment reported in the chest radiology literature 
by Yerushalmy et al. (1951) and the 6% re- 
ported by Galagan and Vermillion (1956) for 
dental radiographs. The 30% interreader dis- 
agreement for detection responses and 30% for 
overall performance found in the present study 
are comparable to the 30% reported by 
Galagan and Vermillion (1956). 

The relatively high intrareader disagreement 
tes that Ss were less than completely 
own interpretations even 
st period. The relatively 
high interreader disagreement suggests that 
Ss had not employed a common frame of 
reference or stereotype in interpreting the 
films, exhibited differential response bias, 
and/or differed in abilities contributing to per- 
formance on this task. 

Another measure of reliability, the test- 
retest correlation, was obtained in order to 


indica 
consistent in their 
within the short rete 
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have information about the stability of er 
order of individuals over time. In the pres 5 
study, the retest was within 2 hr. aad, — 
biased due to short-term memory. Ho ie 
even™within this short period, the ie 
correlation of .56 for detection is quite lo i 

Further analyses showed that there ae 
either no errors or a very small Fico fi 
errors on over one-third of the films a that 
the low or no error films were generally posi- 
tive. To the extent that the film series was a 
representative sample, there appears to be: a 
relatively large group of positive films E 
offer the observer no difficulty in detecting the 
existence of an abnormality. On the other hand, 
the poor performance on the remaining slides 
is even more striking. The problem for further 
research is to isolate the factors that dis- 
criminate between these easy detections and 
difficult detections. The increase in the test- 
retest reliability to .72 for overall performance 
may in part be attributed to the increased 
variance in total score and to the fact that 
overall performance includes a þr 
of observer behavior. 

In summarizing the reader agreement and 
reliability results it may be concluded that, 
under the conditions of the present study, 
interpretation of dental radiographs is not a 
highly stable, consistent Process. Further 
research is needed to establish whether the 
relatively low test-retest reliability 
function of the particular films, Ss, and 
tions of the present study. In view of th 
larity of the reader agreement results 
study and earlier studies, it may 
the low reliability encountered wa: 
to this study but m 
problem. 


oader sample 


was a 
condi- 
e simi- 
in this 
well be that 
S not specific 
ay be a more pervasive 


Additional Analyses 


The relation betw 
performance (right- 
computing 


cen difficulty 


or the detection 
the p < 05 level 
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for only 13 of the 40 Ss with the average co 
efficient being .19, j F 

The reliability of the difficulty ae 
assessed using the procedure outlined NY al 
(1951) and Guilford (1954, 1965). ee ae 
provides an estimate of the reliabi 1 ica an 
composite ratings for an Ente group ‘er, For 
estimate of the reliability of a single n al 
Presentation 1, the estimated composi bility 
liability was .87 and the estimated mn 
for a single rater was .14. For the e E 
2 these values were 90 and 18, sone single 
Since the primary interest was in ei 
observer, the 14 and 18 tollapitty se in- 
were of particular interest. ‘These resu nship 
dicate that there was very little relatior ame 
between different observer's ratings of the si 
films. d the 

The relations between performance an ate 
self-ratings of performance and effort myers 
examined by computing product-momen score 
relations between overall performance jelas 
and each of the two self-ratings. The cor 36 
tion between total score and the gunn gon 
performance was -08, while the corre effort 
between total score and the self-rating of ° ly 
was —.09. Neither coefficient was significa 
different from Zero. ‘culty 

The fact that the relation between diffic ant 
rating and item performance was signin. 
for only 17 of 40 Ss in the case of overall oe 
formance and 13 of 40 Ss in the case of oe 
tion performance Suggests that there 15, 
best, a weak relationship between men re- 
variables. This finding, along with the jo: an 
liability of an observer’s difficulty ratings 2 i 
the nonsignificant correlation between, "t 
rating of performance and total score ee 

4 E: 

own performance. al 
t with Yerushalmy et ship 
a significant relation 
d performance. . 
f significance to thos 
ading of “difficult 
is unable to identify he 
Te difficult, the radiograP a 
dual reading may not be ene 
misreading, The above Te is- 
h the relatively high reader pat 
nd low reliability, suggest 


ole 
is er 
1S Not a panacea for observer 


a 
a 


two 


findings are consisten 
(1955) failure to find 

etween difficulty an 

These results are o 
recommend dual rei 
graphs. If a reader 
radiographs a 
Passes on for 

e is in fact 
combined wit 
agreement a 
dual reading 


é wh? 
radio" 
pich 
he 
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Limitations of Generalizations 


oo get at to note that the students 
eiicnt Lage mig to clinically examine the 
Patient’s hn, aoe have access to the 
rather than pe ee isg; senior dental students 
as Ss. While i nuts or radiologists were used 
trained obse i may be assumed that Ss were 
from ietan it is apparent that they differ 
set, Shs wie © dentists in experience, age, 
o nais ner variables that may be relevant 
details ele on this task. There are also 
limit f S the experimental procedures that 
atudents Pe eb In this study, the 
slide anion the X-rays as presented by a 
against gos rather than viewing the film 
dents DS e “ven though the stu- 
similar va ‘soca with the technique because 
Work in i hods were used in their classroom 
is not Te a radiograph interpretation, this 
dentists tran procedure used by practicing 
that was j he the time allowance of 60 sec. 
Ss did oa ged as completely adequate by all 
Which = ace a time limitation on the task, 
Pretation ta common to normal dental inter- 
generaliz: ae Finally, as with any study, 
ims ; ability is restricted by the particular 
“Sand methods used. 

ol oe Tesearel with different stimuli and 
issues a necessary in order to examine the 
Attention om in this experiment. Particular 
Positive ne be given to the high false 
different Bats that has been found in several 
1956 ahaa shneir (Galagan & V ermillion, 
Ocus on 7 research may find it fruitful to 
Clateg wi W perceived rewards and costs as50- 

ith the available response alternatives. 
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The effects of directional and nondirectional cues were investigated on a bape ad 

peer d to moving visual stimuli that entered critical zones in each of three 
S znne ~The Ss given directional cues that supplied spatial and temporal 
visus! T +: evfcmed significantly better than Ss in the nondirectional group, 
information pip temporal cues, and a control group that did not receive any 
tie neni cues. Furthermore, Ss in the control group performed as well as Ss 
Se ofen cues. Thus, temporal cues did not lead to enhanced per- 
eA e Gules the display requiring attention was desi nated by a spatial cuc. 
These data also indicated that increased speed stress led to linear decrements in 


performance. 


Today’s complex man-machine systems 
frequently include human tasks that require 
the operator to attend and respond to multiple- 
information sources. Weisz and McElroy 
(1964) and others (Allen, 1969; Jeantheau, 
1959; Mackworth & Mackworth, 1958; Mc- 
Cormick, 1964; Mowbray & Gebhard, 1961; 
Olson, 1963) have found that complex operator 
tasks may momentarily overload the informa- 
tion processing capacity of the individual 
resulting in the deterioration of performance. 
These data have led to the common-sense 
notion that equipment design must specify 
the acceptable rate of input information (Mc- 
Cormick, 1970). Unfortunately, there are man- 
machine situations (e.g., air traffic control) 
where speed stress is an essential aspect of the 
task. In these tasks, especially those involving 
a perceptual-motor component, 
timing has been postulated as an important 
factor in coping with increases in speed stress. 

Timing is defined by Conrad (1955, 1956) 
as the characteristic of skill that creates the 
most favorable temporal conditions for 
response. Good timing produces 
layout cf a series of responses n 
most appropriate moment and 
smooth performance while b 


response 


a 
a temporal 
nade at the 
leading to 
ad timing results 
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in irregular, jerky performance. Schmidt 
(1968) has postulated that high levels of 
skill attained by well-practiced Ss would not 
be possible without proper timing, Timing 
difficulties have been noted by monitors who 
must attend to several displays. In many 
of these situations, the rates of change on each 
display are different, and the monitor has tO 
make critical decisions based on the moment 
that the various pointers have reached their 
prescribed levels. McCormick (1970) speci- 
fically notes that monitors in time-sharing 
tasks should be directed toward these sub- 
tasks which require immediate attention. 
Cueing as a technique provides information 
that should permit Ss to predict and anticipate 
critical stimuli that require immediate atten- 
tion. While research (Mudd, 1965; Mudd & 
McCormick, 1960; Pollack & Teece, 1958; 
Smith, Farrell, & Gonzalez, 1965) investigating 
the effect of cueing has established its effec- 
tiveness, the relationship of cueing to pe 
formance with a timing task at different spee. 
stress levels is not nearly so clear. Thus, the 
purpose of the present research was to I 


vestigate the effects of spatial and tempor 
cues on performance in a perceptual-mol? 
task, 

METHOD 
Subjects 


om 
Seventeen male volunteer psychology students fr 
the University 2 


of Maryland served as Ss. 


Apparatus 
a re 
. na 
a visual display, vist 
an operator control panel. The 


Che apparatus consisted of 
Sponse panel, and 
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display was composed of three lighted dials, 4.5 in. 
long and 1.5 in. high, placed side by side approxi- 
mately 6 in. apart. Two black vertical target lines 
marked off each dial into three areas or zones. The 
pointers, one on each of the three dials, moved at a 
constant velocity to the right and then to the left. The 
pointer stopped when it reached the end of the dial 
if S failed to push the response button and thereby 
reverse the direction of the pointers. A light source 
mounted on the top of each of the three dials provided 
an anticipatory warning signal (visual cue) that was 
electronically controlled and became activated when 
the pointer approached a target line. 

Response buttons mounted on a panel in front of S 
were located in spatial correspondence with the three 
dials and placed directly in front of S the S placed one 
hand on the response panel and used his index, third, 
and fourth fingers to push the response buttons. 

The movements of the three pointers were per- 
manently recorded by three Esterline Angus chart 
recorders. ‘There was an exact correspondence between 
the pointer movement and the record on the chart 
because each pointer was attached directly to the chart 
recorder pen. Thus, the following events were recorded: 
the location and moment at which a pointer reversed 
direction, the moment at which a pointer stopped and 
then restarted, and the time at which a signal occurred, 

The Æ controlled the velocity of the pointers and the 
cueing conditions by means of a control panel. The S 
sat 7} ft. from the display and was screened from Æ 
and the operator panel. 


Task 


The S was instructed to at 
on the visual display and reverse the direction of the 
pointers on each dial by pushing the response button 
corresponding to the pointer whenever it crossed the 
target lines. The target lines divided each dial into a 
central zone and two border zones, one on each side. 
The S was instructed to keep the pointers continuously 
in motion during the task; failure to reverse the direc- 
tion of the pointer resulted in the pointer stopping at 
the end of the dial. Thus, S was required to determine 
the order of the arrival of signals (pointers crossing the 
target line) in the three different dials as well as timing 
his response to the individual signals. Visual cues 
provided § with additional information to determine 
the order of the arrival of signals. 


tend to the three dials 


Cueing Conditions 


There were three visual cueing conditions: direc- 
tional, nondirectional, and no cues. For the direction- 
oriented (spatial and temporal) condition, the light 
above each dial only became activated when the pointer 
on that dial approached the border zone. In the non- 
direction-oriented (temporal) condition all of the lights 
were activated simultaneously when any one of the 
three pointers approached a border zone. T he direction- 
oriented cues therefore contained more information 
than the nondirection-oriented cues, as the former cues 
pointed out the exact dial that needed attention while 
the latter informed S that one of the three dials needed 
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attention. The Ss in the control condition were not 
provided with any cueing information. 


Speed Stress Conditions 


The levels of speed stress were determined by the 
potential number of signals per trial where a signal 
was defined by a superimposition of a pointer on a 
target line. The total number of signals per trial con- 
sisted of the sum of the number of signals per trial per 
dial. The levels of speed stress were 2 39, 53, and 73 
signals per 45-sec. trial. For ease of expression, the 
speeds were labeled as: slow, 29 signals per trial; me- 
dium, 39 signals per trial; medium-fast, 53 signals per 
trial; and fast, 73 signals per trial. The velocity of a 
given pointer during a 45-sec. trial remained constant, 
but each poinier had a different velocity from the other 
pointers. 


Procedure 


Each experimental session lasted approximately 1 hr, 
and 20 min., and was composed of 12 practice trials 
(3 cue conditions by + speed conditions) and 24 experi- 
mental trials (2 trials for each of the three cue condi- 
tions by + speed conditions) each lasting 45 sec. with 
30- intertrial intervals. When the initial practice 
trials were completed and all questions were answered, 
Ss were given a 10-min. rest before proceeding through 
the experimental trials. The order of presentation of 
cueing condition was randomized for each S as well 
as the order of presentation of speeds within the cueing 
conditions. 


RESULTS 


The data were analyzed by a two-factor 
ANOVA with repeated measurements over 
both factors. Tukey’s multiple-comparison 
method was used to determine the significance 
of differences in mean performance in the 
three cucing conditions and in the four levels 
of speed stress. 

The Ss’ responses were scored according 
to the number of correct reversals (hits). Hits 
were determined by the displacement of S’s 
response from the target line and were re- 
corded when S responded within .2 cm. before 
the target line or within 1.1 cm. after the target 
line. A measure of percentage of signals correct 
was obtained by dividing the number of hits 
for the dial per trial by the potential number of 
hits possible per dial per trial. The percentage 
of correct responses in each dial was averaged 
with the percentage of correct responses in th 
other two dials to form a composite measure 
of performance per trial. 

An analysis of variance of the data indicated 
that both speed and cueing conditions were 
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TABLE 1 
PERCENTAGE OF Correct RESPONSES 
Cueing 
an Direc- | Xondirec- Goutal 
tional | tional | 
Slow a a f 
Medium i pe os 
Medium-fast = | n | a 
Fast | | 
significant (for cueing: F = 21.02, df = 2, 32, 


p < 001; for speed: F = a a 2 4, 5 
p < 001), but the interaction effect of spee! 
and cueing was not significant (F = 1.36, 
df=6, 96, p> 05). Table 1 shows the 
percentage of correct responses for all experi- 
mental conditions, 

The effects of speed stress on performance 
are consistent, with performance deteriorating 
with increased speed stress. Tukey’s multiple 
comparison indicated that each level of stress 
was significantly different from all other levels 
of stress. An examination of these data showed 
that performance in the directional cueing 
condition consistently exceeded performance 
in the nondirectional and control conditions 
for all levels of speed stress. Performance in 
the nondirectional condition was only slightly 
superior to the control condition in the medium 
and medium-fast speeds and sli 
or equal to the control condition in the slow 
and fast speeds. The results of a Tukey 
analysis indicated that the directional-cueing 
condition was significantly different from the 
nondirectional (p < .01) and control-cueing 
conditions (p < .01). The nondirectional con- 
dition was not significantly different from the 
control condition (p > 05). Thus, perform- 
ance for the directional-cueing conditions 
exceeded performance in the control 
directional cueing conditions, w 
two were essentially equal. 


ghtly inferior 


and non- 
hile the latter 


Discussion 
Speed Stress 


As might be expected, m 
(Alluisi, Muller, & Fitts, 19 
Gould & Schaffer, 1967; 


zany researchers 
Ss Conrad, 1951; 
Jeantheau, 1959; 


ZIN 
Peter W. DORFMAN AND IRWIN L. GOLDSTEIN 


z z RET ie 
Mackworth & Mackworth, sea ag Sue 
son, 1962; Olson, 1963; Wagner, Fit i ee 
1954) have found performance bere per 
with increased speed stress. (A o am m 
for the present study indicated than i a: 
stress accounted for 67% of the varia = 
The form of the relationship, however, 950 
nearly as well documented. Conrad (1 ion 
warning that utmost care should be used TD 
comparing the speed-performance relation oe 
in different tasks, found a logarithmic sl 
tionship between speed and perfon 
using errors of omission as the depen oe 
variable. Using errors of timing as the at 
of performance, however, Conrad found a 
performance was not affected by speed ei 
In the present study, when the four spi z 
conditions are examined according to i 
potential hits possible for each speed Pa 
condition, performance retained an essential a 
linear relationship with increased speed Ta 
resulting in decreased performance. ser 
the specific form of the relationship Kepmen 
speed stress and performance depends uP? 


; cee er- 
the particular task and the criterion of P 
formance. 


Cueing 


The analysis of variance had indicated e 
the cueing variable was significant at W 
P < 001 level, but only accounted for er 
of the variance as determined by the os 
The following two explanations could ameoill 
for the above finding: (a) All Ss performet 
Significantly better in the directional-cuche 
condition, but the magnitude of inoren E 
performance was small. Or (b) most Ss al 
formed equally well with or without ori 
but a few Ss performed exceptionally 2 
When percentage correct reversals are sum” 
across speed conditions, 
ter in the direction 
control condition, 
better in the 


et- 
all Ss performed , i 
al condition than in dë 
and 15 of 17 Ss respon 


5 i we in the 

f directional condition than r 50 
nondirectional Condition. Thus, the at 
for the small 


amount of variance ase se 
ueing variable is that SS koe 
onded better in the eg 
but the magnitude of differe 


for by the ¢ 
sistently resp 
Condition, 
Was small, 
In order 


ju! 
Consistency, 


5 ‘ndivic 
to further examine par 
Spearman-Brown composit 
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liabilities were computed using the individual 
reliabilities obtained by correlating S’s score 
on each of the cueing conditions with scores 
on the other cueing conditions as well as S’s 
scores in each speed condition with every other 
speed condition. The reliabilities were .93 
for the cueing condition and .98 for the speed 
condition. Thus, Ss’ relative level of perform- 
ance in one treatment condition predicted 
their level of performance relative to other Ss 
in the other treatment conditions. These 
results which indicated individual differences in 
performance remained stable across both speed 
and cueing conditions. 

In conclusion, Ss consistently performed 
better in the directional cueing condition where 
there was a spatial as well as a temporal cue. 
It appears that in this task where there was a 
time-sharing (three visual displays) and a speed 
stress dimension, a temporal cue, by itself, 
did not provide much additional information 
beyond what the control group obtained from 
simply observing the display. In some cases, 
the design engineer has not considered bringing 
Ss’ attention to the display by using a spatial 
cue or some association between a particular 
cue and a particular display. While it is dif- 
ficult to generalize from visual to auditory 
cues, it is interesting to note that there has 
been a growing concern over the use of an 
auditory cue if it just supplies temporal infor- 
mation. The difficulties related to associating 
such a cue with a particular display may, in 
part, furnish an explanation for the pilot’s 
skepticism of “a growing cacophony of 200 
noises [Hunter, 1969, p. 71)” that are being 
used as warning signals in aircraft. 7 

One interesting question about the utility 
of a temporal cue relates to the a priori 
probability of a specific display requiring 
attention. It is possible that temporal cues will 
suffice in those conditions where Ss have 
learned or have been informed that particular 
displays are more likely to necessitate atten- 
tion during certain temporal intervals. Further 


research is planned to investigate this issue. 
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EFFECTS OF ALCOHOL INGESTION ON TRACKING PERFORMANCE 
DURING ANGULAR ACCELERATION! 


Ricnarp D. GILSON 
FREDERICK E. GUEDRY, JR. ? 


Wittiam E. COLLINS 
Davip J. SCHROEF 


AND 


Civil Aeromedical Institute 
Federal Aviation Administration 
Oklahoma City 


United States Navy Aerospace 
Medical Research Laboratory 
Pensacola, Florida 


Following practice and baseline tests of tracking performance in both static (sta- 
tionary) and dynamic (whole body angular accelearation) conditions, 10 Ss re- 
ceived orange juice that contained 2.0 ml. of 100- proof vodka per kg. of S weight: 
others drank a control beverage. Tests, conducted 1-10 hr. after drinking, were 
in total darkness with the exception of the illuminated visual display. Static 
tracking errors for alcohol Ss were significantly higher than those of controls only 
at the 4-hr. session. However, alcohol Ss made significantly more dynamic tracking 
errors during the 1-, 2-, and 4-hr. sessions and, concomitantly, had significantly 
more nystagmic eye movements and higher rates of slow phase eye velocity than 
did controls. Although eye-hand coordination may show little or no impairment 
following alcohol ingestion in static situations, it may be seriously degraded during 


motion. 


Schroeder (1971a, 1971b) has shown that 
the ingestion of alcohol depresses both ocular 
nystagmus (a pattern of alternate slow sweeps 
and fast return movements of the eyes) and 
vertigo sensations to rotatory or caloric vesti- 
bular stimulation when Ss are in darkness. 
But in illumination, similarly provoked ny- 
stagmus is considerably stronger than it is 
normally (Heifer, 1969; Schroeder, 1971a, 
1971b). This stronger-than-normal nystagmus 
in the light is not related to changes in vesti- 
bular sensitivity but to the depressive action 
of alcohol on the visual fixation mechanism. 
In the absence of alcohol involvement, ny- 
stagmus in illumination is ordinarily abolished 
or substantially inhibited by visual fixation. 
For example, figure skaters typically use 
visual fixation following an on-ice spin to 


1 This study was conducted by the Federal Aviation 
Administration’s Civil Aeromedical Institute and was 
cosponsored by the Federal Aviation Administration, 
the Army (United States Army Medical Research and 
Development Command) and the Navy (United States 
Navy Aerospace Medical Research Laboratory, Pensa- 
cola, Florida). The expert assistance of Carl Moore in 
obtaining the blood samples and of Elizabeth Gilson, 
Carlyn Manley, Cynthia Cochran, and Blair Fennell in 
the collection and scoring of data are gratefully ac- 
knowledged. Gas chromatographic analyses of the blood 
Samples were performed under contract by Kurt 
Dubowski, University of Oklahoma Medical Center. i 

? Requests for reprints should be sent to W illiam E 
Collins, Psychology Laboratory AC-118, Ç ivil Acro- 
medica] Institute, Federal Aviation Administration, 
P. O, Box 25082, Oklahoma City, Oklahoma 73125. 


suppress both nystagmus and vertigo (Collins” 
1966). However, the alcohol-induced increase 
of ocular nystagmus occasioned by vestibular 
stimulation in the light suggests a decrease 
in the capacity to control blurring of the visual 
field and to inhibit dizziness. This suggestion 
poses some obvious questions regarding the 
ability of men to perform visual tasks ade- 
quately during vestibular stimulation after 
drinking alcohol. Most studies of the effects 
of alcohol on human performance involve 
static situations, that is, situations in which 
the men are not subjected to motion. It is 
conceivable that the addition of motion, which 
is involved in a variety of activities such as 
piloting an aircraft or driving an automobile, 
might produce deleterious effects on perform- 
ance not usually obtained in static situations. 
Therefore, the present study was designed to 
examine some of the effects of alcohol ingestion 
on visual tracking performance during angular 
accelerations. 
METHOD 

Subjects 

Twenty male college students, paid volunteers 
ranging in age from 21 to 30 yr., served as Ss. None 
had previous laboratory experience involving vestibular 
stimulation. Each S was assigned at random to one of 
two equal groups, a control and an alcohol group. 


-l pparatus 


Angular acceleration was supplied by a Stille-Werner 
RS-3 rotator fitted with a small cockpit in which § was 
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enclosed and seated upright with his head anteverted 
slightly so that his horizontal semicircular ee babe 
approximately in the plane of rotation. (The head was 
inclined so that an imaginary line between the ant 
canthus of the eye and the tragus of the ear were parallel 
to the floor of the room.) A fitted headrest helped to 
maintain the desired position. A triangular waveform 
input from a Wavetek Model 155 waveform generator 
was used as a command signal for the rotator, The 
velocity of the latter was proponinta the sa 
voltage, and a peak angular velocity of 120° per secon 
was attained in both the clockwise and counterclockwise 
directions. The waveform period was 48 sec. : 

A compensatory visual tracking task provided both 
a direct practical measure of performance and an in- 
direct measure of acuity. The tracking apparatus has 
been described in detail elsewhere (Gilson, Guedry, & 
Benson, 1970). Briefly, a 14-sec. sinusoidal “forcing 
function” input deflected the vertical needle of an 
aircraft localizer/glide-slope indicator while § at- 
tempted to maintain the needle in the null position by 
manipulation of a control stick. Deviations of the needle 
from this position were considered errors (“absolute” 
error), and a voltage proportional to these deviations 
was electronically integrated over consecutive 1-sec, 
intervals throughout a trial, 

The display was illuminated by a 3 v. de bulb 
mounted in a tube in front of S but below his line of 
sight. Light was projected through the tube to localize 
on the display and to minimize reflection in the other- 
wise darkened room. The luminance of the display 
was measured with the aid of a card sprayed with the 
same white paint as the display needles, This card was 
placed in the light, just in front of the display, and 
measurements were made with a MacBeth illumin 
eter from S’s viewing position. ‘The voltage 
bulb was adjusted until the luminance was 1 ft Lia 
level recommended for aircraft instruments (Morgan, 
Cook, Chapanis, & Lund, 1963, pp. 76-81). 
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Alcohol In gestion 


The Ss in the alcohol group consumed 
100-proof Smirnoff vodka and reconstit: 
concentrate of) orange juice. The mixtu 
contained 2.0 ml. of vodka per kg. 
(Vor a 165-Ib. S, this would amount to about 5 flui 
oz. of 100-proof liquor.) Control Ss mens uid 
orange juice with a few drops of rum extract a on only 
alter the odor and taste of the beverage, (They to 


a mixture of 
uted (frozen 
re (900 ml.) 
of body weight. 


Were 
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eres s 
led to believe that they were drinking alcohol.) All S! 
consumed their drinks within a 30-min. period. 


Procedure 


Prior to being tested, each S was given 5 a 4 
tracking practice with the cockpit stationary. KE 
experimental sequence that followed comprise! Pe 
testing sessions: a predrinking session (pre) and hal 
postdrinking sessions at 1, 2, 4, 8, and 10 hr. sh id 
completion of drinking. All se sions, both practice = 
experimental, were conducted with the room in T "7 
darkness with the exception of the visual disp! en 
Immediately before each testing session, venous bloo 
samples of from 3 to 5 cc. were drawn for analysis 0 
blood alcohol levels by gas chromatography. i 

Each session consisted of a 2-min. “static” Lager: 
trial with the cockpit stationary and a 4-min. on 
namic” tracking trial with the cockpit rotating hee 
five complete cycles (240 sec.). The order of these Sue 
alternated across Ss, and at least a 1-min. interva 
allowed between trials, | 
were obt 


ibrations 
“ye-movement lace 
J g oe a preroki 
ained prior to cach period of dynamic track 


Scoring 


The raw tracking errors for 1-sec. intervals wer? 
summed, and an average value was obtained for eac 
static and dynamic trial. 

Measures of nyst 


7 or of 
agmus included the number 
agmic beats 


and the amount of slow phase EE 
Sp'acement occurring in three 5-sec. intervals dure 
each dynamic tracking trial; one sampling interva 
was 32-37 sec. from the start, one was 131-136 Ai 
from the Start, and the other Was 10-15 sec. from the 
end of each trig The sampling intervals were chose? 
to include maximum output of nystagmus in a sing? 


haces " . . pu 
direction near the beginning, middle, and end al ee 
test period. Mean v. 


alues in degrees per secon 
eats per second were 


agent 
Sie calculated and used to repres? 
eye-movement activity, 


Resutrs 


The following average values of alcohol 


(ethanol) were obtained from the alcohol grouP 
for the pre, 1-, =- 4, 8-, and 10-hr. bloc 
samples, respectively: 0% 074%, 073 7 
Ei 001%, and 0%. (Note that the lowe? 
value specified in state motor vehicle statute? 
Or presumption that a driver is under the a 
fluence of liquor js 080% [Erwin, 1970 Y 
Control group Ss yielded no evidence 
ethanol in their samples, r 
Means and standard deviations for the s10“ 
Phase and frequency Measures of nystagm" 
and for the tracking error in both static ane 
dynamic Conditions appear in Table 1. Chane? 
m performance Cross sessions are shown 
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TABLE 1 


MEANS AND S 


SDARD DEVIATIONS BY SESSION FOR TRACKING ERROR (ARBITRARY Units), SLOW PHASE 


NYSTAGMUS (DEGREES PER SECOND), AND FREQUENCY OF NYSTAGMUS (BEATS PER SEC) 


| Si Session 
Measure Condition Group | oe 
| Pre | 1hr. 2hr. 4hr. 8hr. 10 hr. 
|. | 
| | 
Performance x 

Tracking crror Static Control | X 5.43 5.21 4.74 4.51 4.77 4.73 
SD 1.71 -86 B-ri .59 Al 53 

Static Alcohol | X 5.29 5.92 5.86 5.78 4.74 4.84 

| SD 1.46 1.15 1.01 1.28 1.74 62 

Dynamic Control x 6.73 5.44 5.06 4.88 4.91 4.50 

SD 1.86 65 65 52 Sl 5 

Dynamic Alcohol x 5.97 7.51 6.85 6.02 4.70 4.89 

SD 1.14 .82 1.13 8S 1,32 4 

Nystagmus = y E 

Slow phase Dynamic Control x 5.42 3. 3.29 3.06 2.35 3.10 
| SD 2.83 2: 1.81 2.24 1.37 1.54 

Frequency Dynamic Control x 1.40 1. 1.57 1.42 1.16 1.24 
SD -91 i -86 94 -15 43 

Slow phase Dynamic Alcohol x 3.85 12. 9.16 6.01 2.82 2.82 
SD 3.10 5 4.60 4.54 | 2.31 3.18 

Frequency Dynamic Alcohol x 1.53 3. 3.31 2.51 1.29 1.19 
SD 1.10 1, 1.09 | 1.16 83 .72 


centages of increase or decrease in tracking 
error based on the predrinking level. Plots 
of the nystagmus measures across sessions are 
in Figure 2. 


Control Group 

The control group evidenced only a slight 
decline in static tracking error (an expected 
improvement with practice) from the pre 
through the 10-hr. session (see Fig. 1). None 
of the changes was statistically significant. 
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However, dynamic tracking error evidenced a 
fairly steady decline from the pre through the 
10-hr. testing sessions; all 1- to 10-hr. error 
scores were significantly (p < .05-.01) below 
the pre scores. Frequency measures of ny- 
stagmus for control Ss showed no significant 
change across the six sessions, nor was there 
any significant difference in slow phase ocular 
velocity across the last five sessions. However, 
all but the last session of the latter were signifi- 
cantly below (p < .05-.01) the pretest level 
of slow phase activity (see Fig. 2). 
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Alcohol Group 


In contrast to control Ss, both the average 
static and dynamic tracking errors increased 
for the alcohol group at the 1-, 2-, and 4-hr, 
testing sessions (see Fig. 1), However, only 
the increase for the 1-hr. static session ($ 
< .05) and for the 1- and 2-ht. dynamic 
sessions (p < .05-.01) were significantly above 
the respective pre values, Measur 


es of ny- 
stagmus for alcohol Ss 


also presented a totally 
different picture from that of the control group 
(see Fig. 2), Both the degrees per second and 
the beats per second measures increased 
significantly (p < -05-.001) from the pretests 
through the 1-, 2-, and 4-hr. tests. At the 8- and 
10-hr. tests, nystagmus was below the pre 
levels, but not significantly so, 


Comparison of the Control and Alcohol Groups 


In comparing the two g 
conducted on “change” 
that is, the difference in s 
and the 1-hr. sessio 
sessions, etc, 

Static tracking 
control and alcohol Sroups were significant 
(p < .05) only for the 4 i 


roups, ¢ tests were 
(or difference) scores, 
cores between 


the pre 
ns, the pre 


and the 2-hr, 


differences between 


the two groups were 
at the 1-, 2-, and 4-hr, Sessions, Thys 
addition of motion, the 
formed with significan 


» With the 


Sroup per- 


tly more Errors than the 


ing 

control group during the first 4 hr. following 
alcohol ingestion, 

With respect to 

group had signific 

(p< .001) and 


nystagmus, the control 
antly less slow phase — 
frequency of NRE 
(p < .01-.001) than did the alcohol group e 
the 1-, 2-, and 4-hr, sessions and less slow Be 
nystagmus (p< .05) for the 8-hr. sessio ä 
Thus, the alcohol Ss were less able mm 
control Ss to Suppress their eye movements PY 


= k s z angular 
fixation on the visual display during angula 
acceleration, 


Discussion 


EIEE, 
Although alcohol effected an increase 


tracking errors during static performance er 
the increase was significantly above pee 
ing levels only during the 1-hr, sessio 
Moreover, differences between the alcohol oo 
Control groups in static tracking error ll 
significant only during the 4-hr, session wh 


sain anê: 

the effects of alcohol were beginning to W the 
However, during Vestibular stimulation, © 
eye-hand coordinati: 


on required by the trace 
task showed marked impairment by aa 
for the 1-, 2-, and 4-hr. sessions in comp? ng 
both with the alcohol group’s predrin ent 
performance and with the steady improRe” ign 
demonstrated by the control group. The y 
of this impairment appears to be et 
Correlated with the increased nystas ing 
activity to angular accelerations ee Í 
alcohol ingestion, Thus, degradation of ® 
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hand coordination during stimulation of the 
semicircular canals appears to be closely 
related to the alcohol-induced loss of the ability 
to maintain adequate visual fixation (and 
visual acuity) on an object and thereby in- 
hibits nystagmus. A similar degradation of 
Visual acuity and tracking performance with- 
out alcohol has been reported previously 
(Gilson, Guedry, & Benson, 1970; Guedry, 
1968). However, the magnitude of vestibular 
stimuli for commensurate losses was neces- 
sarily greater in those studies since, without 
alcohol, the visual fixation mechanism sup- 
pressing nystagmus was functioning normally. 

These data have several practical impli- 
cations. Activities that show little or no 
impairment in static situations following 
alcohol ingestion may be seriously degraded 
during motion, Further, the task required of 
the present Ss was a relatively simple one, 
that is, S could concentrate on the single 
stimulus display. However, in many work 
activities, such as piloting an aircraft, the 
attention of the pilot has to shift from one 
stimulus “display” to another. It has been 
shown that deleterious effects of alcohol on 
performance in static situations are greatest 
when time-sharing of attention across several 
tasks is required (Chiles & Jennings, 1970) 
or if the task requires “divided attention” 
(Moskowitz & DePry, 1968). The addition 
of motion to a complex time-sharing task 
where performance is already degraded by 
alcohol might be especially hazardous. As such, 
in those situations which require the identifi- 
cation of unsafe levels of alcohol (e.g. in 
automobile driving) the basis for determining 
significant amounts of impairment should take 
into account the total array of stimuli that will 
impinge upon the individual. 

As a final point, it should be noted that the 
average blood alcohol levels obtained in this 
study were considerably below most legal 
definitions of when an individual is presumed 
to be under the influence of intoxicating liquor 
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(Erwin, 1970). Only three Ss exceeded a blood 
alcohol level of .090% during any of the blood- 
sampling periods. 


* Motor vehicle laws in 23 states and the District of 
Columbia cite a blood alcohol level of .150°% or more 
as a presumptive legal index; 21 states use .100°%. 
Utah O80" as presumptive, and several other 
states are considering reducing their current levels to 
080. Five states have no defined levels, 
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University of Pennsylvania 


hi rpose of this study was to test the hypothesis that intensity and organization 

; p i . T . The 

3 shes activity is a function of S’s expectation of a task’s duration. The 

deciel BEUNUY of the following muscles was measured while S was holding a 
€ y 


19-Ib. weight under expectation for short and long 
-lb. weig 


dials, brachialis, and flexor carpi uln 
less electrical activity in all muscles 


It seems reasonable to suppose that fore- 
knowledge of the duration of a task affects 
one’s approach to it. For example, if S knows 
that he is to perform a task over a relatively 
long duration, he is likely to regulate his 
activity in such a way as to conserve some of 
his energy for later portions of the task. This 
can be manifested in a lower level of output 
per unit of time, or in a more economical or 
efficient expenditure of energy in the per- 
formance of the activity. In simple static work, 
for example, where S must keep a weight from 
dropping by flexing his forearm over a long 
period of time, expenditure of energy can be 
measured by means of electromyography, with 
electrodes attached to the skin above the 
relevant muscles. 

There is a linear relation 
mechanical force of the contracti 
and its electrical activity (L 
Thus, the electromyographic 
electrical changes that accom 
traction of the muscles can 


between the 
on of a muscle 
ippold, 1952), 
records of the 
pany the con- 
be used as an 
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tasks: biceps brachii, brachiora- 


aris. For the intervals measured, there was 
during the long task. 
muscle showed the largest relative decrease. The results were in 
of superior organization of activity under expectation of a t 
work over expectation of a short duration of work. 


The less appropriate 
nterpreted as evidence 
ask of long duration of 


indicator of the level of muscle activity. 1 
there is regulation of activity for the sake S 
energy conservation, it can be hypothesize 
that the level of the electrical activity of the 
muscles involved in static work will be lower 
for comparable periods of work if S knows oF 
expects that he has to work for a long duration 
than if he expects to work for a short duration 
Further, the decrease of electrical activity 
should be differential according to the degree 
of appropriateness of the various muse 
this type of task: namely, the decrease, wae 
a set for a long duration, should be greater in 
less appropriate muscles than in more appro 
Priate muscles (Wishner, 1955, 1962). a 
The task of static work selected for this 
experiment involved activity of the brachialis, 
the biceps brachii, and the brachioradialis, 
which are Primarily concerned with flexion O 
the forearm (Goss, 1966; Romanes, 1966). 
Within the Muscle system, the brachialis 15 
the “workhorse”? among the flexor muscles 
ing 

at the elbow, It is active especially durin’ 
maintenance of Specific flexed positions of the 
elbow, that is, isometric contraction in ® 
Positions of the forearm, This is not the case 
with the other two flexors of the foreat™ 
(Basmajian, 1967). With a large load, hor 
ever, all three muscles are active dur? K 
isometric flexion of the forearm (Buja® 
Pavlina, Vidaček, & Vodanović, 1967). á 
The other Muscles of the arm are also carl 
during isometric flexion of the forearm unde’ 
a large load, for example, the flexor carp 
ulnaris (Bujas et al., 1967), which, although *” 
1s the Primary flexor of the wrist, is on 
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ancillary in flexion of the forearm (Romanes, 
1966). 

On the basis of these considerations, the 
specific hypotheses of this study are: (a) When 
S expects a task of long duration, the electrical 
activity of all the muscles is less than the 
electrical activity of these muscles during a 
comparable period of the same task when S 
expects to work for a short period; (b) The 
muscle action potentials of the flexor carpi 
ulnaris is reduced by a greater amount, 
relatively, in the long task than the electrical 
activity of the three other muscles. 


METHOD 
Subjects 


The Ss were 10 males, students and instructors at 
the University of Pennsylvania, ranging in age from 
approximately 20 to 45 yr. 


Apparatus and Recordings 


The overall structure of the apparatus in S’s room 
is shown in Figure 1. The S was seated in a modified 
dental chair. The tasks were performed on an apparatus 
consisting of a frame, which allows flexing of the forearm 
but prevents other arm movements. The frame was 
connected to a 19-lb. load through a pulley. The S's 
task was to keep the weight above the floor and main- 
tain his forearm at approximately a 90° angle at the 
elbow. At the front of S’s seat was a light bulb that 


went on when his forearm was 


at right angles to the 


upper arm. When the forearm was in any other position, 
the light went out, and S had to correct his position. 

Recordings of integrated muscle action potentials 
during the tasks were made on a Grass Model 7 poly- 
graph from surface electrodes, silver discs, 10 mm. in 
diameter, on the following four muscles of S's right 
arm: biceps brachii, brachialis, brachioradialis, and 
flexor carpi ulnaris. The distance between each pair of 
electrodes was 1 in. The skin was appropriately pre- 
pared so that the resistance between pairs of electrodes 
was low (2,000-5,000 ohms) and similar. The raw 
signals were monitored on an osciloscope. The inte- 
grators were reset every 5 sec. 


Procedure 


There were four practice sessions on 4 days in the 
week preceding the experiment proper. The purpose of 
the practice sessions was to familiarize Ss with the 
general conditions of the experiment and with the 
specific nature of the tasks. On each day of practice, 
each S had two sessions of static work: one for 30 sec., 
and one for as long as he could manage. 

During the experiment proper, each S had a total of 
four static work sessions over 2 days. There were two 
sessions per day: one in the morning, the other in the 
afternoon. The order of the tasks was ABBA (A = 30- 
sec. task, B = work until subjective tolerance limit) 
for half the Ss, and BAAB for the other half. The 
interval between the two sessions was 4 hr. 

At the beginning of the fourth day of practice, S’s 
arm was prepared for recording of muscle action po- 
tentials during the task. The positions of electrodes 
were marked so they could be used during the experi- 
mental sessions the next day. At the beginning of the 
first session of the experiment proper, the electrodes 
were placed in the same position as during the last 


Fic. 1. The 


S, with electromyographic electrodes attached, 
holding the weight by means of a pulley. 
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practice session. A short period of calibration ensued, 
after which S began the first task. When the session 
was over, the electrodes were left on S’s arm, pro- 
tected by an elastic bandage until the next session the 
same day. After the second session, the electrodes were 
removed, but their positions were marked. 

The next day’s procedure was the same as the first, 
except that the order of conditions was reversed. 


Instructions 


In the short duration task, S was told that he would 
have to keep the weight aloft for 30 sec. The apparatus 
was explained, and he was instructed to prepare for 
the task. Five sec. later, the task was begun. The S 
was told to keep the light bulb on by keeping his elbow 
at the appropriate angle. If the bulb went out, he was 
instructed to correct his position slowly. He was to 
maintain this position until Æ signalled the end of the 
30-sec. period. 

In the long task, S was told to hold the weight aloft 
as long as he could. He was warned that the task was 
difficult and resulted in somewhat painful muscle 
fatigue. He was implored to try his best, to hold the 
weight as long as he could in the prescribed position. 


RESULTS 


The Ss had no difficulty in holding the 
weight in the prescribed position for the first 
30 sec. The bulb did go out once in each of 
6 sessions (of a total of 40 sessions) : four times 
in the short task, for a total of 5 sec., and twice 
in the long task, for a total of 2,9 Be The Ss 
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corrected their position smoothly, without 
significant effects on the recordings. 

Figure 2 shows the mean integrated elec- 
trical activity of the muscles during the first 
30-sec. period of static work. Except for the 
first 5-sec. period, the electrical activity of the 
muscles is similar from one 5-sec. period to the 
other. At the same time, the electrical activity 
of the muscles during the first 30 sec. of static 
work in the long task is lower in all muscles 
than the electrical activity of those muscles 
during the short task. : 

It can be seen that the electrical activity 
of the biceps brachii, and the brachioradialis 
registered by surface electrodes are higher than 
the electrical activity of the brachialis and the 
flexor carpi ulnaris. It is unreasonable, how- 
ever, to regard this difference in the records as 
reflecting a real difference in the muscles’ 
electrical activity. Various factors can atten- 
uate the signals from the muscles to various 
extents, so that surface electrodes may give 2 
very distorted Picture of the underlying 
seme (Davis, 1959). For example, such 
Geen npn of the muscle, the position vi 
abed Pin ation to the vector of cem a 
voltages ae etc., can influence y? 
thess e and the precise values ; 

es are not easily measurable. 


flexor carpi ulnaris 


a r 
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ity > 
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It is improbable that the main flexor of the 
forearm, the brachialis, is actually at as low a 
level of electrical activity in relation to the 
other two flexors as seems indicated in Figure 2. 
It seems much more likely that the recording 
of the electrical activity of the brachialis is 
attenuated to a greater degree by virtue of its 
depth and a number of other factors than the 
electrical activity of the other two muscles. 

In this experiment we are more interested 
in the changes in electrical activity of the 
various muscles from one task to the other, 
and in the comparison of those changes, than 
in absolute values. The question is whether we 
can compare changes in electrical activity of 
various muscles, or, more specifically, whether 
the differences are distorted if they are derived 
from distorted values. Obviously, if the action 
potentials of one muscle are attenuated more 
than those of the other, then equal increases 
of electrical activity at the muscle will appear 
differentially smaller when they are picked up 
by surface electrodes. The attenuation of 
signals seems to be relatively large: less than 
.5 mm. distance between the fibre and its 
electrode reduces the muscle action potential 
by a factor of 10 (Buchthal, Guld, & Rosen- 
falck, 1954). Therefore, the absolute dif- 
ferences in signals picked up by surface elec- 
trodes give a distorted picture of changes in 
the muscle itself. Thus, the absolute differences 
cannot be used for the evaluation of changes 
in electrical activity of different muscles 
(Grossman & Weiner, 1966). 

On the other hand, it is possible to assess 
the percentage of change of a muscle’s elec- 
trical activity with a higher degree of validity. 
If the true electrical activity at the actual 
muscle site is decreased from one task to the 
other, it is probable that the percentage of the 
change produced at the recording site on the 
skin will be more comparable across muscles 
than changes in absolute values. Assuming that 
the major attenuating factors are multiplica- 
tive, as is indicated by previous research 
(Buchthal et al., 1954), this will hold even if 
the muscles being compared are at different 
depths. i 

In the following analysis, the integrated 
electrical activity of a muscle in the short task 
was taken as a base level and the integrated 
electrical activity of the same muscle during 


TABLE 


VALUES OF THE INTEGRATED ELECTRICAL Activity 
OF MUSCLES DURING THE First 30 SEC. OF THE LONG 
Task, EXPRESSED AS A PERCENTAGE OF THE 
INTEGRATED ELECTRICAL Activity 
DURING THE SHORT TASK 


achio. | Flexor 
S | Biceps | Brachialis | Brachio- carpi > | aw 
| | alis | ulnaris 
1| 944| 102.2 | 
2| 100.5 944 | 
3| 102.0 | 
4| 1172 
5| 802 
6] 7 
7 3 
s| 86. 
9 
10 
= | 
uM 


the first 30 sec. of work in the long task was 
taken as a percentage of the value of the 
electrical activity of the same S in the short 
task. These are the values shown in Table 1. 
They are the relative values of the electrical 
activity during the first 30 sec. of static work 
in the long task. 

Table 2 shows the mean relative values of the 
electrical activity during the first 30 sec. 
of the static work in the long task, T values 
of the Wilcoxon matched-pairs signed-ranks 
test, and associated p values. 

The relative values of electrical activity of 
the main flexors of the forearm (the brachialis, 
the brachioradialis, and the biceps brachii) 
during the long task are similar, but only the 
decrease in the electrical activity of brachio- 
radialis is statistically significant. With a one- 


TABLE 2 


MEANS OF RELATIVE VALUES OF INTEGRATED 
ELECTRICAL AACTIVITY OF MUSCLES DURING 
THE First 30 SEC. oF THE Lone Task, T 
VALUES OF THE WILCOXON MATCHED- 
Pairs SIGNED-RANKS TEST, AND 
ASSOCIATED p VALUES 


ifade Relative 7 $ 

value 
Biceps brachii | 948 | 18 | >.05 
Brachialis | 93.4 10 <.10 >.05 
Brachioradialis 93.5 3 =.01 
Flexor carpi ulnaris 84.0 1 <.01 
M | O14 | =.02 
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tailed test, the relative value of se ae 
activity of the brachialis is also signilica 

i Us). es i 
ih a to the biceps brachii, Ta 
be noted that variability is greatly ‘liability 
It can be seen from the ange re ee 
that this variability does not oe oe 
main, underlying individual ar ta 
biceps activity. Test-retest reliability te he 
electrical activity of biceps brachii is given " 
re= .50 and .55(p > .05) for the long an 
ee tasks, respectively.* The reliabilities of 
the other muscles are higher: for brachialis, 
ra = .90 and .87; for brachioradialis, t= -94 
and .78; and for flexor carpi ulnaris, 7, = .87 
and .84 (p < 01 in all cases), for long and 
short tasks, respectively. The lability of the 
activity of the biceps is related to the vari- 
ability in the activity of the other flexors 
during the same task, since their activity must 
be accommodated to the task of producing the 
same force during flexion of the forearm. f 

The largest relative decrease of electrical 
activity in the long task is shown in the flexor 
carpi ulnaris (p < .01). Although the absolute 
decrease of recorded electrical activity of the 
flexor carpi ulnaris is small (see Fig. 2), it 
should be noted that its activity in the short 
task, which is taken as the base level, is quite 
low to begin with. Floor effects operate against 
a substantial absolute decrease in the long 
task, and the decrease recorded thus takes on 
added significance, as represented in the rela- 
tive, or percentage, decrease. The mean de- 
crease in relative values of the electrical 
activity of all the muscles taken together is 
also statistically significant. 

These results show that even if there 
large individual difference 
between the muscles, the 
reduction of electrical acti 
30 sec. of work in the lon 


the electrical activity during a 30-sec, task 
can be described as follows: there is a reduction 


of electrical activity in all four muscles, but 
the most inappropriate muscle, flexor carpi 
ulnaris, shows the greatest reduction. At the 
same time, biceps barchii shows the greatest 


are 
s in the interplay 
general trend in the 
vity during the first 
g task in relation to 


‘This finding may be interpreted as a warning 
against using muscle action potentials of the biceps as 
a general index of activity in the flexor muscle group 
of the forearm. 


i ical activity from 
variability of changes in clectrical activity Í 
one condition to the other. 


DISCUSSION 


The results of this experiment Show Saige 
ficant reduction in the electrical aa n 
the muscles of an S who knows or expects t ei 
he has to work for a long period of time, 
compared to the electrical activity of gar 
muscles of an S who knows or expects that 
has to work for a short period of time. ig 

This decrease in the electrical activity of te 
muscles implies a decrease in the force of iD 
muscles’ contractions, if the assumption ot ¢ 
linear relationship between the force of ied 
traction and the electrical activity” a 
accompanies it is valid. However, it is difficu A 
to give a quantitative statement of the amoun A 
of the reduction in the force of the muscles 
contractions, especially since the slope of the 
regression of muscle action potentials upos 
force exerted is different for the same muscle 
among different Ss (deVrics, 1968). Similarly 
the slope of the regression of muscle agho 
potentials upon force of contraction is differen’ 
for different muscles (Lenman, 1959). Never 
theless, the decrease of the electrical activit 
of the muscles in this experiment can be used 
as a general indicator of a true decrease in the 
activity of the muscles themselves, especially 
since it was possible to compare the electric 
activity of the same muscle of each S in bot 
conditions, that is, each S was his own contro” 
Further, the electrodes were in identical pos! 
tions in both conditions of the experime?” 
during a single day, and simila” 


in very 
positions from one d 


ay to another, and the 
apparatus was so constructed as to assure t he 
the position of the limb and body were t? 
same for all measurements, f 
It may seem paradoxical that there is di 
ferential force of contraction of muscles 
identical loads. This becomes possible wher, 
is realized that several muscles are act 
during an ison 


z 3 earn 
3 netric contraction of the fore? this 
Some of them are 


it 
ve 


oa n more appropriate for par 
task, anatomically and physiologically, * is 
others. The 


Bs most appropriate muscle 
brachialis. At the same time, the brachiora®” y- 
and the biceps help the brachialis to coMP 


sate for the external force of the load. re 


Some of the other muscles of the ar™ ` 


SS. A 
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active during static work. For example, flexor 
carpi ulnaris, which is the flexor of the wrist 
and only assists in flexing the forearm, shows 
considerable activity during static work, as 
does the main extensor of the forearm, the 
triceps. It is possible that at the beginning of 
a long static task, the activity of the muscles 
is more organized so that less appropriate and 
antagonistic muscles are not as active as during 
a short period of static work. In a long dura- 
tion task, the main flexors of the forearm do 
not compensate for the force of the contraction 
of the antagonists as much as during a short 
task. At the same time, in the long task, 
reducing the activity of the less appropriate 
muscles, which produce little useful force 
relative to the level of their activity, makes the 
performance of the task more efficient. 

In sum, then, it appears that the static work 
task used in this experiment is a viable test of 
the hypothesis of conservation of effort and 
energy as a function of a set for differing dura- 
tions of work. At the same time, it is essential 
to keep in mind that there was a set of back- 
ground conditions that maintained S’s motiva- 
tion for conservation. Thus, during the practice 
sessions, Ss experienced the differences in 
difficulty and in effort required between the 
tasks. In the long duration task, there was dis- 
comfort and pain. Most Ss were challenged and 
obviously put a great deal of effort into holding 
the weight as long as they could. 

It is also possible that conservation of the 
activity of the muscles under the set for a 
long period of work is related to more intense 
motivation that could result in some dynamo- 
genic changes in the vegetative nervous system. 
With higher motivation, the adjustment of 
the organism to the increased working require- 
ments of the muscles may be improved, and 
the muscles may be able to perform the same 
work more economically (Bujas & Petz, 1952). 

A related study is that of Walster and 
Aronson (1967) on the effect of expectancy of 
a task’s duration on the experience of fatigue. 
They found that those Ss who expected to 
continue a series of fatiguing tasks reported 
less fatigue than Ss who were led to believe that 
their chore was almost at an end. The authors 
believe that Ss suppress feelings of fatigue 
until their task is virtually completed. Our 
results show that expecting a long task gives 
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rise to economy of effort and energy from the 
very beginning of the task. Thus, there would 
be not only suppression of the feeling of fatigue, 
as shown in verbal reports, but also, if an 
objective index can be found, an actually 
lower level of fatigue for any given time, as a 
function of expectancy for a long duration of 
work. Indeed, it should be possible in future 
experiments to explore the possibility that the 
developing patterns of muscle activity are 
related to reports of fatigue. 
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While price and brand image have both been found to be BES runner ay ronner 
quality perception, the potency of these two cues has never been directly am : 
Moreover, those studies which found price to be a determiner of perceived quality 
manipulated only price information, without permitting actual composition 
characteristics to vary across brands. A 2X2X2x3 factorial experiment, 
using 136 adult male beer drinkers, and four test beers, examined the effects of 


price, composition differences, and brand image cues on the 
id . e 

quality. Price was found to serve as an indicant of 

only cue available but not when embedded in 


a stronger effect upon quality 
positive images. In addition, it v 
significant effects on perceived q 
istics were allowed to vary be 


and aroma) cues alone. 


As the literature on perceived risk (cf. Cox, 
1967) clearly demonstrates, consumer purchase 
decisions are frequently made under conditions 
of varying uncertainty regarding the product 
and its attributes. To reduce such uncertainty, 
consumers seek and process information re- 
garding the product and generally attempt to 
form accurate impressions of it. Given that 
products may be viewed “as an arra 
the “consumer’s task in evaluating 
is to use cues from the 
making judgments abo 
1962, p. 413),” 

One impression usuall 
portance to the consumer is the product’s 
(or brand’s) quality, Cues relevant to forming 
Impressions of quality include (a) price; (b) 
product composition characteristics such as 
taste, aroma, color, style, and size; (c) packag- 
ing; (d) brand, manufacturer (i.e., corporate), 
and store image; (e) advertising; (f) word-of- 
mouth reports; and (g) past 
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perception of beer 
product quality when it was the 
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perception, particularly for brands with strong 
vas found that neither price nor brand n 
juality except when product composition cl 
tween product samples. | 
findings, the data suggest that beer drinkers possess 
distinguish among different brands of be 


ame had 
haracter- 
«ast, in contrast to earlier 
at least some ability to 


laste 


ence. Of these cues, greatest attention thus 
far has been directed toward price. shape 
(1968) noted that “price is a powerful piece 0 
information for the consumer. ... Since price 
is concrete and measurable, the consumer views 
it with much confidence. He trusts it more than 
most cues concerned with quality (p. 20, 22). 
The first conceptual treatment of the paos 
quality relationship is usually credited ye 
Scitovsky (1967), and subsequent empirica’ 
Investigations (Gabor & Granger, 1966; 
Leavitt, 1954; McConnell, 1968a, 1068b; Tulb 
Boring, & Gonsior, 1964) have found that, 
with other cues held constant, price is indec 
a determiner of perceived quality. Illustrative 
of this approach is McConnell ’s work. Labeling 
different bottles of the same brand of heer 
M, L, P, and associating different prices with 
these labels ($1.30, $1,20, and 90¢ per ae 
pack, respectively), he was able to demonstrat? 
that the high-priced brand was perceived 
be of higher quality than the medium-price® 
brand (p = 06). There was no difference " 
perceived quality, however, between 
medium- and low-priced brands. it 
As Enis and Stafford (1969) noted, “- + + 
Seems fairly clear that if the only informal 
consumers are given about product quality, 7, 
price information, they will indeed JU ce 
quality by price (p. 340).” Combining P 
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and store image cues in one experiment, these 
Investigators still found price to strongly affect 
quality perception (Stafford & Enis, 1969), 

It is noteworthy that nowhere, in all those 
studies that placed Ss in choice situations in- 
volving real products, have actual differences in 
product composition been allowed to vary. 
Investigators have given Ss identical samples 
of beer, or carpet swatches cut from the same 
roll, and simply attributed different labels, 
prices, and images to them. In contrast, and 
in consonance with real-world situations where 
product differences can and do vary, the pres- 
ent investigation allowed actual physical dif- 
ferences to vary, while also manipulating price 
and brand image cues. Both product composi- 
tion differences and price may be considered 
objective attributes of the product, while 
brand image may be defined as the subjective, 
emotional cluster of meaning and symbols 
that the consumer attributes to particular 
brands. Usually brand names are sufficient to 
evoke brand images. 

Several considerations suggested beer as the 
product to be studied. First, a replication of 
McConnell’s (1968a, 1968b) often cited find- 
ings, in which price was the only cue to vary, 
was considered desirable. A second purpose was 
to examine whether beer drinkers could dis- 
criminate among different brands on the basis 
of taste and aroma cues alone, given a range 
encompassing substantial differences in product 
composition. Earlier work had indicated that 
consumers are unable to distinguish among 
different brands of beer in “blind” taste tests 
(Allison & Uhl, 1964). Reexamination — of 
similar data from cigarettes and cola studies, 
however, yielded evidence to suggest some 
ability to discriminate among various brands in 
these product classes (Myers & Reynolds, 
1967, p. 18). Therefore, examination for similar 
abilities with respect to beer seemed worth- 


while. i ~ 
With price absent, and given limited (or 


in-) ability to discriminate among different 
beers on the basis of taste and aroma cues, 
what does determine perception of product 
quality? Several investigators (Cox, 1962, 
Froman, 1953; White, 1966) have suggested 
that nonobjective criteria, especially various 
types of images (¢.g., brand, store, corporate, 
Product, and salesman), are frequently used by 
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consumers to evaluate alternative brands. 
Empirical support was provided by a second 
part of the Allison and Uhl (1964) study in 
which Ss received properly labeled bottles of 
beer and were again asked to make the same 
judgments regarding several product attributes 
including overall quality. The results of this 
manipulation clearly indicated that 


labels, and their associations, did influence their evalua- 
tions. In other words, product distinctions or differ- 
ences, in the minds of the participants, arose primarily 
through their receptiveness to the various firms’ 
marketing efforts rather than through perceived physi- 
cal product differences (p. 39). 


Given that price and brand image have each 
been demonstrated to affect perception of beer 
quality, the present experiment compared the 
relative efficacy of these two cues. 

In summary, the general purpose of this 
study was to examine the unique and inter- 
action effects of three cues—price, composition 
differences, and brand image—on_ perception 
of brand quality. Specific foci of interest were: 
(a) to determine whether the price-quality 
relationship could be replicated in a more 
complex, multicue environment; (b) to ex- 
amine the relative efficacy of price and brand 
image as determinants of perceived quality ; 
and (c) to determine whether beer drinkers are 
able to discriminate among different brands of 
beer on the basis of only taste and aroma cues, 
given substantially different beer samples. 


METHOD 

Subjects 

The Ss, 136 male volunteer students who answered 
an advertisement in the campus newspaper requesting 
participants for a “beer taste-testing study,” were all 
over 21 yr. old, and each stated that he was a frequent 
beer drinker and had consumed at least 12 oz. of beer 
during the preceding week. Potential Ss called a 
telephone answering service in response to the ad, and 
those that met the age and usage criteria were randomly 
assigned to one of the eight experimental groups. 


Design 

A 2X2X2X 3 factorial experiment with 17 Ss 
per cell was conducted which consisted of two levels of 
price (present, absent), brand name (present, absent), 
differences across beer samples 
(present, absent) as between-S factors, and three beer 
samples as within-S factors. The dependent variable of 
interest was perceived overall quality, and repeated 
measures were taken over three beer samples for each S. 


and composition 
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nstruments 


Each S received a questionnaire booklet that ee 
elevant information concerning his -— ig aad 
iabits and familiarity with various bran 2 o! ke y 
-xplained and administered the appropriate ag 3 
ations, and celles five dependeat Puchi 

ions of aroma, bitterness, Si " i 
orel quality) for each of the three beer ante 
The first four dependant a inclu 

isguise the purpose of the experiment, a 
e EA PaE 0-100-point scale eom E 
equally spaced, numbered gramon, a avie 
endpoints labeled “worst beer and “best beer” for each 
dependent variable. Written and verbal instructions 
directed Ss to evaluate the beer samples served to them 
in relation to the best and worst beers they had ever 
tasted. Evaluations were recorded by placing three 
arrows at appropriate points on each scale and labeling 
each with the name or letter of the beer sample being 
rated. 


Procedure 


Each of the eight cells was run separately. The Ss 
were seated at a long table and separated by table-top 
partitions to cut off between-S communications. A 
“credible-appearing” source in his early 40s, who served 
as E, introduced the experiment as a marketing study 
funded by a national brewer for the purpose of develop- 
ing reliable instruments to measure beer taste per- 
ceptions. The Ss were told that college students were 
large consumers of beer, and therefore the manufacturer 
was very much interested in their responses. 

The Ss were then given the questionnaire booklet 
and completed the first two pages regarding their beer- 
drinking habits and general familiarity with beer brands. 
Upon completion, Æ set up the appropriate manipula- 
tions and, using a blackboard, demonstrated how to 
use the graphic rating scales. 

Four different brands of beer were used. A nationally 
distributed ultrapremium beer (costing $1.60 per warm 
six-pack), a nationally distributed popular-priced beer 
(costing $1.35), and a regionally distributed inexpensive 
beer (costing 99¢) were used for the 68 Ss in the four 
“composition differences present” cells. These beers 
were labeled Y, X, and Z, respectively, in the “brand 
name absent” cells, The three brands were selected from 
among 20 different brands on the basis of a pretest that 


indicated that a comparable group of Ss could, relativ 
to other beers, accurately 


T estimate the actual cost per 
warm six-pack of these brands, with minimal variance 
in these estimates across Ss, These selection criteria 
had two purposes: (a) to minimize differential cross-S 
familiarity with res; 


n pect to the beers, a variable likely 
to influence preference ratings (cf. Zajonc, 1968); and 
; 


(b) to cover as broad a range A . 
5 ge of dom 
C a be nestic beers (in 


‘and image, and | ii 
differences) as possible, Another P aecompaition 


) nationally distrit 
popular-priced beer, whose price was ales si no 
accurately estimated, was us Ses 


ed for all three } a = 
presented to the 68 Ss in the four E a 


ences absent” cells (see Table 1), 


For those cells in which neither Price nor brand 


names were provided, Ss were simply told they hod 
evaluating three beers generally available in the ec 
area and here designated X, Y, and Z. When ue 
was given, the beers were listed in the auent 
booklet with their actual store prices (per six-pac 
appearing next to them. The same procedures were use z 
when brand name (used to evoke brand image) B 
present except that, instead of X, Y, and Z, the actua. 
brand names of the beer samples were provided. 

For the four cells in which the same brand of beer 
was used for all three beer samples, directions E 
identical to those given when the beer samples difiere: i 
That is, Ss in those cells were also led to believe, a 
they were tasting three brands of beer readily availab 
in the local area. ; a Bunt 

Beer samples were given to each S in plastic bagr 
colored cups that tended to disguise color cues. a 
appropriate, the cups were marked either with bran 
names or letters, and the accompanying instruction® 
(in the experimental booklet) did or did not presen 
information. The Ss were told to sample the beers 1n 
any order but to complete each dependent gene 
rating scale for all three beers before proceeding tot he 
next scale, for example, complete the aroma ratings 
for all three beers, then proceed to the bitterness scale, 
etc. Because of university regulations, 2 oz. of ae 
were served in each cup, making a total of 6 oz. pet 2- 

Precautions were taken to insure that the beer sam- 
ples were quite cold and had little foam. The thre 
samples for each § were poured at the same time (ow 
of sight of Ss) and immediately taken to him. Com 
ments elicited at the conclusion of the experiment an 
dicated that the state each sample arrived in was co™ 
parable to normal drinking conditions. 


RESULTS 


Two general assumptions should be met fot 
the repeated-measures analysis of variance 
model to hold: (a) homogeneity of the variance- 
covariance matrixes and (b) symmetry of st 
pooled variance-covariance matrix (see Wine!) 
1962, pp. 369-374). Tests conducted for both 
these assumptions were not statistically signifi- 
cant (x2 = 51.80, df = 42, and X = 5.69 
df =7, respectively, both ps > 10), pr 
therefore the model was assumed appropriat© 
for these data. ä 

Table 1 presents the means and standa" 
deviations of perceived quality ratings o 
each experimental cell. Simple inspectio? ie 
these data reveals at least two items © a 
terest. First, there was substantially great? 
variability of interbeer sample quality ae 
when composition differences were per, 
than when the samples were of the same bet e 
In fact, the range of the overall beer wanes 
means was twice as great when compos! 30 
differences were present (70,88-37.58 = 9" 
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TABLE 1 


BANS AND STANDARD DEVIATIONS OF OVERALL QUALITY RATINGS 


1 
Composition differences present Composition differences absent 
(samples of different beers) (samples of same popular-priced beer) 
7 — 7 | 
Condition i Beer Beer Beer | Beer 
Beer Beer | Y Z | Xx | ¥ 
| Z | X (ultra popular| (purported | (purported | (purported 
(regional (popular | priced regional popular ultra- 
inexpensive) priced) | premium) | inexpensive) | priced) premium) 
Brand name present 
Price present | | x 
Z% 33.47 65.71 79.82 | 50.76 | 64.18 76.18 
SD 17.82 23.65 13.21 19.49 16.47 17.96 
Price absent 
x 34.18 6488 | 714 | 50.12 56.88 71.06 
SD 17.23 15.81 | 15.37 | 19.99 23.33 13.65 
Brand name absent | | | 
Price present | 
X 34.47 66.12 | 67.65 55.35 | 54.65 68.41 
SD 18.08 17.90 22.84 21.08 | 21.79 21.25 
Price absent | | | 
X 48.18 55.12 | 64.65 | 51.47 | 68.53 | 57.18 
SD 22.54 22.99 | 1610 | 2166 | 2038 19.30 
Overall | | 
x 37.58 | 62.96 | 70.88 | 51.93 61.06 68.21 
SD 19.60 20.42 | 17.82 20.21 20.95 19.17 
Note.—n = 17 per cell. i 7 


vs. 68.21-51.93 = 16.28), with the regional in- 
expensive beer’s low ratings accounting for 
most of this difference. Second, the brand name 
(BrN) absent, price (P) present, compositional 
differences (CD) absent cell replicated Mc- 
Connell’s (1968a, 1968b) findings, that is, 
the high-priced brand was perceived to be of 
higher quality than the medium- and low- 
priced brands (¢ = 1.85, df = 16, p < .05 and 
t = 1.81, df = 16, p < .05, respectively), with 
no significant difference between the latter two. 
Results of a four-way ANOVA based on the 
data presented in Table 1 are summarized in 
Table 2. A strong beer samples (BS) main 
effect was obtained (F = 57.99, df = 2, 256, 
p < 0001), and a Newman-Keuls analysis 
indicated that all differences between pairs of 
means for the three beers (X, = 44.75, 
X. = 62.01, X, = 69.54) were significant be- 
yond the .01 level. These differences were all 
in the expected direction, with the ultrapre- 
mium beer (Y) receiving the highest mean 
quality rating and the regional inexpensive 
beer (Z) receiving the lowest mean rating. 


_ Two two-way interactions were significant 
in the overall ANOVA, CD X BS and BrN 


X BS, and are shown in Figure 1 (Parts a and 
b, respectively). 


TABLE 2 


ANALYSIS OF VARIANCE OF OVERALL Quarry 
RATINGS FOR COMPLETE DESIGN 


Source df MS F 
Between so 
rice (P) 1 378.55 
Brand name (BrN) 1 511.88 i $8 
Composition differences (CD) 1 1083.88 | 2.92 
EX BrN 1 285.00 <i 
P XCD 1 21.20 <1 
BrN X CD 1 06 <1 
P X BrN XCD 1 9.12 <i 
Error (between) 128 | 371.06 <1 
Wikis Ss i 
eer samples (BS) 2 | 21972.33 | 57.99%% 
P X BS ps 2 | ` 762.67 | “2.01 
Be Sue 2 2015.99 5.32%% 
S 2 3143.13 | 8.30** 
P X BrN X BS 2| 8790| <i 
E XED X BS 2| 756.71) 2.00 
™ XCD X BS 2 234.74) —" 
P XBrN X CD X BS 2| 1461.18) 3.80% 
Error (within) 256 378.88 3.86% 
* p <.025, 
“p Loi 


= > < 000 
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‘O— Composition Diff. present O—O Brand Name present 
30 £ ~AComposition Diff. absent Ar -^ Brand Name absent 


MEAN PERCEIVED OVERALL QUALITY 


x ¥ z x Y- 
Beer Sample Beer Sample 
POSITION DIFFERENCES BRAND NAME BY 
BY BEER SAMPLES BEER SAMPLES 
INTERACTION INTERACTION 


Fıc. 1. Two interactions with beer samples from overall 
analysis of variance. 


Analysis of variance of the si 

effects indicate that the effect of composition 
differences (see Part a, Figure 1) was signifi- 
cant only for Beer Z (F = 18.62, df = 1, 384, 
p < .01), and the effect of brand name (see 
Part b, Figure 1) was significant only for Beer 
Y (F = 9.30, df = 1, 384, p < 01). It appears 
that, for the three brands of beer used, pres- 
ence of the true compositional characteristics 
does not significantly affect one’s perception 
of their quality, except for the low-priced 
regional beer (Z), in which case presentation 
of the actual beer depressed the perceived 
quality ratings. One might say that Beer vA 
tastes worse than its image (or at least worse 
than the fourth beer used in CD absent 
conditions), Similarly, it appears that only the 
high-priced ultrapremium beer’s (Y) image (as 
mediated by brand name) was strong enough 
to significantly increase Perceptions of quality. 
As might be expected, 
a somewhat negative im 
of its brand name depr 
ratings, although not Significant extent 
(F = 2.48, df = 1, 384, p> -20). Bre 
had no significant effec 


The four-way interaction, 


mple main 


P X BEN x CD 
$< 025) and is 
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shown in Figure 2. The increase in i 
under CD present conditions is clearly a 
vious. To clarify interpretation, simpler leve 
of analysis were sought by analyzing four three- 
ray ANOVAs. i 
i ete the study into two three-way 
analyses—one in which CD are absent and ms 
in which they are present—one obtains sci 
sign similar to most of the previously eman 
quality perception research in the former m 3 
an extension of such studies by allowing tipor 
brand composition differences to vary as He 
do in a natural setting in the latter. Results a 
these two analyses are presented in Table 3. 
The only significant effect for CD absent 
was the BS main effect. All intermean differ- 
ences (X, = 51.93, %. = 61.06, X, = 68.21) 
Were significant beyond the .05 level using & 
Newman-Keuls analysis. A logical and par- 
simonious explanation for these differences 15 
that they are due to demand Ghiataoterist oa 
(cf. Orne, 1962) created by the exponiinenia 
procedures, Given that these Ss (under C 
absent) were told th 
three different beers 
of these 
mation (j 


at they were evaluating 
and that Ss in three out 
four cells received additional infor- 
-€ brand name and/or price) sup” 
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z x Y 


Beer Samples 


Beer Somples 
COMPOSITION COMPOSITION 
DIFFERENCES DIFFERENCES 
PRESENT ABSENT 
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TABLE 3 
ANALYSIS OF VARIANCE OF OVERALL Quanity Ratp 
OF Composition DIFFERENCES 


3S FOR Two LEVELS 


Composition differences absent Composition differences present 


Source | dj — — —|—— ; - 
| | MS F | MS F 
EE 
Between Ss | | 
Price (P) 1 | 280.46 | <1 110.29 <1 
Brand name (BrN) 1 261.57 <1 250.3 T | <= : 
PX BrN 1 189.04 <i 96.08 | < 
Error (between) | 64 | 369.48 372.64 | 
Within Ss we | xe * 
Beer samples (BS) 2 | 4527.66 11.13 | 20587.80 | 58.64"* 
Px BS 2 559.37 1.38 960.01 2.73 
BrN X BS | 2 946.51 2 1301.21 | ka a bige 
P X BrN X BS | 2 955.69 2.3 593.39 1.69 
Error (within) 128 406.65 


351.11 


* po .07, 
* Db < 03. 
wD < 0001, 


porting these cues, the average S te 
the quality of his three be 
consistent with these demand characteristics, 
In other words, he rated the supposed ultra- Although the P X BS interaction (at CD 
premium, high-priced beer (Y) relatively: high, present) approached but did not achieve a 
the popular-priced beer (X) lower, and the customary level of statistical significance 
regional inexpensive beer (Z) lowest. In con- (F = 2:73. df = 2, 128, P= OT). it was 
trast to earlier studies, however, price exerted examined further since it was of a priori im- 
no effect upon quality perception. portance. Subsequent ANOVA of the simple, 
For the ANOVA for CD present (see Table simple effects of price at the three levels of BS 
3), the main effect due to BS was again signif- failed to reveal any Statistically significant 
icant (F = 58.64, df = 2, 128, p< 0001), and effects (all ps > -20). In other words, knowl- 
l a Newman-Keuls analysis revealed that differ- edge of price (at this level of analysis) had no 
ences in mean quality ratings between Beers Z significant effect on perceived quality ratings 

and X, and Beers Z and Y were significant for any of the three real beers, 
A last set of three-way analysis that, with 


beyond the .01 level, with the difference be- 
| tween Beers X and Y significant beyond the its relevance to prior studies, seems to be most 
05 level (¥, = 37.57, X, = 62.96, ¥, = generalizable is for both levels of BrN (i.e., 
70.88). This main effect was much stronger BrN present and BrN absent). Results of 
in this instance than when CD was absent, these two ANOVAs are presented in Table 
indicating the amount of increased variability 4. A significant main effect of BS occurs under 
due to the Presence of actual composition both levels of BrN (both ps < 0001). As 
aste, carbonation, bitterness, might be expected, this main effect is stronger 


differences (e.g., t 
Cte.) between the beer samples above that due when the beer samples are identified by their 
and characteristics, .26, 


to the dem brand names (F = 60.83 vs. F= 12 
| The BrN X BS interaction was also signifi- df = 2, 128), as evidenced by the intermean 
cant (p < .03) under CD-present conditions. differences for BrN present (Ñ, = 49.13 
Again, only the simple, simple main effect of Ñ X, = 74.02) compared to ie 
the high-priced, ultrapre- i i tr “i 


Xx = 62.91, X 
brand name for for BrN absent (Aa = 47,31 Nx = 61.10 
was significant: (F = 4.26, X ~~ ‘hin 


mj ser j s is A 

u = ©) o pate As = 6447). Al mtermean differences within 

if = 1, 192, p< 03), indicating that brand — both sets are significant beyond the OL level 
a a À =f al x ve 


nded to rate image positively affected quality perceptions of 
er samples in a way this beer but had no significant effect on the 
other two beers 
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TABLE 4 


ANALYSIS OF VARIANCE OF OVERALL QUALITY RATINGS FOR 
= Two LEVELS OF BRAND NAME 


Brand name absent | Brand name present 
df i 
woe i MS F MS F 
— j Rai a snaz L 
Tan i 533.82 1.47 550. AS 
i ee differences (CD) A 5 fe a Èz 2 
PXC = 
Error (between) 64 362.70 7 
Mers 5 2 5581.78 12.26*** 18406.53 60.83 
ee 2 651.68 1.43 198.89 co 
Bene 2 1078.72 2.37 2299.15 7.60 
pee BS 2 2116.50 4.56" 101.39 <1 
— 2 128 445.16 | 302.60 
Er 
* 
++ 


i \ewman-Keuls analysis, except the 
r X-Y difference for BrN absent. 

A different perspective on the interesting 
phenomonon is reflected in the two analyses 
shown in Table 4. CD do not significantly 
(i.e. with a p < .05) interact with BS unless 
another cue (e.g, BrN or P) is present. The 
significant P X CD X BS interaction (p < .02) 
at BrN absent and the CD X BS interaction 
(p < .002) at BrN present both provide 
evidence to support this statement. To be 
more specific, the CD X BS interaction is not 
significant when BrN is absent (p = .095), 
but under BrN present conditions, the CD 
X BS interaction is significant beyond the .002 
level. Probing of this latter interaction reveals 


that the effect of CD is again significant only 
for Beer Z (F = 14.73, df = 1, 98, p< 01). 

Likewise, Probing of the three-way inter- 
action, P X CD X BS, reveals a significant 
nly when price cues 


CD X BS interaction o! 

are present (F = 5.04, df = 2, 64, p < 009). 
The CD X BS interaction when no cues were 
present (i.e, neither P nor Br 
significant (F = 2.01, df = 2, 64, pœ 14). 
Therefore, the results of this experiment in- 
dicate that composition differences have a 
significant effect on perceived quality ratings 
only when supported by another cue or, stated 
conversely, cues such as brand name have an 
effect only when product composition differ- 


N) was not 


ences are allowed to vary (e.g., see interaction 
under CD present, Table Sys 

A summary of the basic findings included 
the following: Composition differences jas: 
exert a significant effect upon quality percep” 
tion, but only for Beer Z, in which case a 
the “real” beer depressed quality ratings belon 
those obtained when using the “fake” Lip 
Similarly, brand image exerted a signi 
effect upon quality perception, but only od 
Beer Y, the most expensive and status-orient® 
beer. Price, except in isolation, did not al 
any significant effect upon quality percept? $, 
In other words, for the three brands of bee 
and the sample of Ss tested, brand name an 
actual composition characteristics were KS 
important determiners of perceived eat 
than was price. Last, there was substant 
greater variability in quality ratings uer 
composition differences were allowed to vake 
thereby suggesting that beer drinkers can a 
some distinctions among different brands a) 


. one . 3 mí 
the basis of composition (i.e., taste and are 
cues, 


Discussion J 
; i price 
In light of previous research on the P 


n A i strié 
quality relationship, one of the most st! 
aspects of the 


. -f e 
pe obtained results was it? 
es of the price cue (when combine? cep 


5 ea 3 a per 
€r cues) to significantly influence P 
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tion of product quality. While McConnell’s 
(1968a, 1968b) findings were replicated, it 
would appear that when price is presented in 
conjunction with other cues potentially indi- 
cative of product quality, it is used to a much 
lesser extent than when presented alone. 
It should be noted, however, that price was 
not manipulated as in earlier studies (i.e., by 
assigning different price levels to the same 
beer). Inasmuch as the purpose of the present 
research was primarily to determine the 
effect of price relative to other cues in a multi- 
cue setting, not to examine whether different 
Prices would affect quality perception, the 
appropriate manipulation was simply the 
Presence or absence of price. 

Comparing the relative strength of price and 
brand-image cues, the results indicate that, 
while the effect of price was not significant, 
awareness of the brand name did influence 
quality perception. A Priori, one would expect 
knowledge of the brand name to increase per- 
ception of quality for the ultrapremium beer 
and decrease it for the inexpensive regional 
beer, and this was precisely the trend observed 
(see Table 1), 

It would therefore seem as if brand image 
is a more important cue than price in deter- 
mining quality perception of beer. The price- 
brand name distinction is somewhat con- 
founded. However, especially for established 
products and knowledgeable Ss, knowing only 
that the brand name provides more informa- 
tion than simply knowing the price. Again, 
this is reflected by the data (see Table 1). 
The presence of brand name alone produced 
results more similar to the brand name-price 
Present condition than did the presence of 
price alone. This confound was virtually in- 
Sured by the pretest criteria used to select the 
experimental brands. 

The variables that affect quality perception 
Seem to manifest themselves primarily through 
interaction with other variables. Of the four 
variables considered, only one (beer samples) 
was strong enough to consistently affect quality 
Perception across a variety of situations. The 
effects of the other variables were moderated 
by the situation in which they were introduced. 
This tendency toward interactive effects may 
limit the number of variables that can be 


Studied in a single experiment, since higher 


577 


order interactions become exceedingly difficult 
to interpret. 

Considering the nature of these interactions 
more closely, one of the most interesting, and 
potentially important, results of the present 
study is the lack of Cue X Product Sample 
interactions when differences in brand com- 
Position were xof present, that is, when the 
samples were identical. Significant Cue X 
Product Sample interactions occurred only 
when actual composition differences existed 
between the three samples, that is, when they 
were really different beers. 

If this rather strong result is reliable, it has 
important implications for future research, 
Since past experimental investigations of 
quality perception have manipulated other 
cues while holding actual differences between 
the samples constant (i.e., nonexistent), they 
may be using an oversimplified representation 
of reality and, therefore, missing important 
and complex relationships between the cues 
and the brands themselves. Consequently, 
relationships obtained under these conditions 
may have little or no generalizability to real 
world considerations. 

One of the more interesting findings was the 
evidence to indicate that beer drinkers are 
capable of discriminating quality differences 
in the expected direction, given only taste and 
aroma cues. If one assumes that, at least for 
the brands studied, differences in price and 
Image reflected actual differences in quality, 
then perceived quality corresponded to ob- 
Jective quality for these Ss, 

Although it did tend to confound some of the 
results, another interesting finding 
“letter effect” which evidenced itself in the 
price absent- brand name absent- composition 
differences absent cell. In some respects, this 
corresponds to Tucker’s (1964) investigation 
in which housewives selected one out of four 
loaves of bread, differentiated only by the 
letters H, L, M, and P, over 12 trials, Despite 
the fact that all four loaves were identical, 
were taken from a single oven on the day of 
delivery, and, except for the letter labels, were 
packaged identically, half the housewives in 
the study manifested brand loyal behavior 
(i-e., selected the same brand on three or more 
Successive trials) by the end of the experiment, 


Such findings may be explained by the fact 


was a 
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that people tend to use cues systematically 
even when the cues possess no real predictive 
utility whatsoever (Dudycha & Naylor, 1966). 
In other words, “if a decision maker is placed 
in a situation where none of the cues available 
to him is of any value, he will still tend to 
pick out and use some of them as if they did 
have value [Blum & Naylor, 1968, P- 459].” 
This would apply not only to such trivial cue 
differences as that which exist between differ- 
ent letters of the alphabet, but also to such 
strongly differentiated cues as price and brand 
image. The implication here is that consumers 
mav use cues to form impressions of product 
quality even when these cues may have little 
or no relationship to actual (i.e., objectively 
ascertainable) product quality. Some tentative 
support for this conjecture is provided by 
Green (1966), who found that a large propor- 
tion of consumers do not differentiate between 
relevant and irrelevant information. 

A possible limitation of the current investi- 
gation is the type of Ss used. It could be 
argued that the screening procedures insured 
the use of Ss who were relatively “expert” 
consumers of the product. This would not only 
restrict the generality of our results, but may 
in part explain Ss’ tendency not to use price 
as a quality indicator. Both Scitovsky (1967) 
and Shapiro (1968) have hypothesized that 
consumers lacking experience with the product 
are more prone than expert consumers to use 


price in forming impressions of product 
quality. 


Another possible limitation was the use of a 
fourth brand of beer in the composition- 
differences-absent cells. In retrospect, it might 
have been more appropriate to have randomly 
assigned each of the 68 Ss in these four cells 
to one of the three experimental beers, 

In conclusion, to the extent one can gen- 
eralize from this experiment, product, ‘nl 
sample, the major findings of this investigation 
were as follows: 

1. Price does exert an effect upon the per- 
ception of product quality when it is the only 
cue permitted to vary. : 

2. Price does not operate as an indicant of 
product quality when embedded in 


$ a multicue 
setting 
g 
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3. Brand image (as mediated by brand name) 
does affect perception of quality, especially 
for brands with strong positive images. 

4. Given substantial differences in product 
composition, and on the basis of taste and 
aroma cues alone, beer drinkers are able to 
discriminate quality differences in the ex- 
pected direction across different brands of 
beer. 

5. Consumers may use cues systematically 
even when these cues possess little or no pre- 
dictive power. 


Clearly, attempts should be made to re- 
plicate these findings with different types of 
Ss (e.g., nonexperts), incorporating different 
cues and treatment combinations (¢.g., using 
several beers, each at different price levels, of 
pouring ultrapremium beer into cups bearing 
a regional inexpensive beer brand name) and 
using different types of products. Given the 
importance of quality perception in consumer 
decision processes, such effort. seems wel 
justified. 


or example, results of a recent study indicated that 
xtent to which housewives perceive differences tO 
xist across the various br: 
class is strongly related 
the perceived qu: 
hood of strong 


ands comprising a produc 
to brand loyalty: the greate" 
ality differences, the greater the likeli- 
brand loyalty (Jacoby, 1970). 
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Indian Institute of Manage 


Seventy-two male Ss were run in an experi 


ment, Ahmedabad, India 


ment testing the hypotheses: (a) fitting 


names v a a ing brand names; 
d would produce greater brand awareness than nonfitting 
bran 


(b) brand awareness would be superior fo 


i venience 
r shopping goods than for convenie: 


goods; a an ogan describing T show 
d: nd (c) unqualified slogan d bing a particular product would 
goods; 


better brand awareness than a qualified 


slogan describing the same product, 


though the superiority (Z n: eCa vould 
a gl periori i rith respect to brand recall wou! 

hi ority of unqualified slogans wit a 
: d to dissipate over time. The results confirmed Hypotheses a and b, but failed 
ten 


to support Hypothesis c. 


; d names such as Aspro for 
h Ps ci aad Dalda for hydrogenated 
fe chile oil are household words. Similarly, 
bie might reasonably argue that Kleenex for 
paper towels and Levis for blue jeans are 
effective American vocabulary. It is a matter 
of common observation that some successful 
brand names tend to stand for the name of 
the product itself. Despite the fact that in a 
free market various other brands of the same 
product are sold, consumers are often not so 
aware of their existence. The near omni- 
presence of certain brands of consumer prod- 
ucts has thus facilitated effective monopoly 
and has reduced the economic concept of a 
perfect competition to the status of a myth. 
Although the paramount importance of brand 
awareness on consumer behavior has been 
long realized, empirical studies of brand 
awareness per se are very recent, 

Using three letter nonsense syllables of 
varying meaningfulness value as brand names, 
Kanungo and Dutta (1966) found that their 
Indian Ss, male and female, showed greater 

y meaningful brand names 
ames of low meaningfulness, 
found superior brand aware- 
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w- 
ness for high-utility products than tor 
utility products. The product-utility po 5 
operationalized in terms of assumed A rE 
a particular product for the indinde a 
cluded three different categories of oosa 
products: products used by males, by fema os 
and by both. Recent extensions — 
1968, 1969) using Canadian Ss Tepon 
fittingness or appropriateness of brand ae 
as a better predictor of differences in ere 
of brand-product association. Cultural di ne 
ences in brand recall were also noticed for 
three different Product-utility categories €! 
ployed in these experiments, = +0 

One purpose of the present study is 


ane- SS 
test the generality of the effect of fittingne 
of brand names on 


Indian Ss who not on 
cultural miliey but 
backgrounds, It is o; 
that most educate 
English along 
languages, 


brand awareness on 
ly belong to a ditata 
also to multiple lingus 
Í some importance to ? a 
d people in India pm 
with one or more sal 
Due to Centuries of the form? 
British rule many facets of Indian commerci 
Operations, including brand naming and i 
vertising, continue to be in English oe 
though India gained its independence fro 
England more than 23 yr. ago. her 
The specific hypothesis at stake is whet es 
brand awareness is better when brand ee 
are fitting then when they are amen 
An adequate test of this hypothesis a 
that Potentially fitting brand names sho o 
be selected from Ss? everyday associations nd 
the generic Products whose specific pring 
ught. A method for selec 


names are so es! 
brand names of varying degrees of fitting 


BRAND AWARENESS 


in terms of the above operational criterion 
was proposed by Kanungo (1968). ; 
A second purpose of the present study is 
to determine the comparative influence of 
two types of products, convenience goods 
and shopping goods, upon the strength of 
brand-product association. Convenience goods 
are “those consumers’ goods which the cus- 
tomer usually purchases frequently, immedi- 
ately, and with the minimum effort in com- 
parison and buying. . . . These articles are 
usually of small unit value . . . (Alexander, 
1960, p. 11].” Shopping goods are “those 
consumers’ goods which the customer in the 
process of selection and purchase character- 
istically compares on such bases as suitability, 
quality, price, and style [Alexander, 1960, 
p. 21].” Thus, it may be reasonable to specu- 
late that the advertisements of shopping goods 
would be relatively more salient or important 
than the advertisements of convenience goods 
for the buyer making his purchase decisions. 
While purchasing convenience goods, the 
buyer is presumably making relatively unim- 
portant decisions involving little risk, whereas 
in purchasing shopping goods, he is making 
more important decisions involving greater 
risks and, hence, attempts to be aware of all 
possible alternatives. It is therefore expected 
that brand awareness would be superior for 
shopping goods than for convenience goods. 
Finally, this article also examines the so- 
called delayed action effects of two types of 
slogans, unqualified versus qualified, on brand 
awareness. In the attitude change literature 
the former refers to one-sided communication 
and the latter to two-sided communication. 
Simply put, an unqualified persuasive com- 
munication contains arguments that only favor 
its conclusion, whereas a qualified communica- 
tion adds some secondary qualifications and 
reservations against the main argument. Ex- 
tending the postwar studies carried out by 
Hovland and his associates of the Yale Com- 
munication and Attitude Change Program 
(Hovland, Lumsdaine, & Sheffield, 1949; 
Hovland & Weiss, 1951) Papageorgis (1963) 
predicted that the persuasive impact of a 
qualified message would decay more slowly 
than that of an unqualified (one-sided) 
message. However, despite their differing de- 
cay rates the net impact may be the same due 


581 


to greater initial impact of the unqualified 
message even though it declines faster than 
the qualified message. Papageorgis further rea- 
soned that his predictions concerning opinion 
change would also apply to learning and re- 
tention of messages in the communication. 
This is because learning and recall of content 
are positively related to opinion change 
(McGuire, 1957). The results confirmed Papa- 
georgis’s prediction concerning retention of 
the contents of persuasive communication, but 
provided little support for his hypothesis 
regarding opinion change. In the context 
of advertisements through formal communica- 
tion systems such as radio and television 
commercials, pamphlets, and newspaper ad- 
vertisements, one encounters many Instances 
of qualified as well as unqualified slogans 
characterizing various products. For example: 
“Things go better with coke” can be classified 
under unqualified advertisements, whereas 
the commercial for the dog food, Alpo, which 
claims to be ‘‘all meat” and, hence, costlier 
than others, can be classified under qualified 
advertisements. The purpose of the present 
study is to investigate the relative impact of 
these two types of slogans on brand awareness. 
Thus, it may be tentatively hypothesized 
that products described by unqualified slogans 
would show greater brand awareness immedi- 
ately after their exposure as compared to 
products described by qualified slogans. How- 
ever, the net impact of the two types of 
slogans may be the same after a delayed time 
interval of appreciable length. 

In sum, the present experiment examines 
the effect of: (a) fitting versus nonfitting 
brand names, (b) shopping versus convenience 
goods, and (c) qualified versus unqualified 
slogans on immediate brand awareness and its 
persistence over time. 


METHOD 
Subjects 


The Ss were 80 adult males of the Indian Institute 
of Management at Ahmedabad, India. These Ss came 
from different parts of India and were participating in 
2-yr. educational program of business management, 
None had participated in any kind of psychological 
experiment before, Data for four Ss had to be discarded 
because they failed to complete the experiment. In 
order to equalize condition Ns, four more randomly 
selected Ss were eliminated leaving 72 for final analysis. 
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Materials 


Products. Two categories of consumer goods = 
chosen, convenience and shopping goods. a p 
ucts from each of these ectiat ears y us y 

and females were selected. 

E For each of the 16 poii ie 
brand names were prepared, fitting and S hitting. 
The procedure used for the selection of prana nesa 
was much the same as that employed b arunge 
(1968). Briefly, 16 adult males and 16 adult = ss 
of the Indian Institute of Management at Ahmeda bad, 
India, were asked to give within a — — ha 
many relevant associations as they could to each of t : 
16 products. For each product a separate sheet o 
paper was provided. The Ss were instructed not to 
give any existing brand names that they might be 
aware of as their associative responses. They were 
also asked to give all responses in English. All re- 
sponses for each product were found to be meaningful 
words.* The most frequent (occurring 15 times or 
more) and rarest (occurring only once) responses were 
then determined. From those response „words, four 
brand names, two fitting and two nonfitting, for each 
product were constructed. An additional 20 adults, 
10 males and 10 females, were given a list of 64 brand 
names (for 16 products) in random order without any 
mention of the product they might represent. These 
Ss were asked to rate each brand name on a 5-point 
scale with ends labeled “very meaningful” and “mean- 
ingless."”” The mean meaningfulness value (m’) for each 
brand name was calculated from these ratings following 
which one fitting and one nonfitting brand name of 
least different meaningfulness value for each product 
was selected. This procedure of brand name selection 
was followed in order to minimize the possible con- 
founding of meaningfulness with the fittingness variable 
examined in the present study. As far as possible, fitting 
and nonfitting names were designed to resemble highly 
in sound, form, or meaning a frequent and a rare re- 
sponse, respectively. 
Slogans. Slogans for cach of the 16 products were 
prepared in qualified and unqualified versions. An un- 
qualified slogan consisted of a short message favoring 
the use or purchase of a particular brand, the brand 


4 Interestingly enough, some responses were 
in American slang terms, such as, cute, spic-n-span, ete, 
This is understandable considering the Popularity of 
hollywood movies, American magazines, and paper- 
backs among Indian college students in general and 
students of this Institute in particular. Moreover, the 
students here seem to be somewhat more familiar with 
the American language because the Harvard Business 
School serves as the educational collaborator of the 
Institute, and virtually all members of its faculty ob- 
tained their higher degrees in the United States, 

ë Response words like “greasy” for hair-oil and 
“nuisance” for alarm clock, although frequent, were 
not considered in the construction of brand names for 
those products. The authors viewed it rather un- 
realistic and awkward to label any consumer product 
in obviously negative terms. 


given 


name being either fitting or nonfitting for the ep 
it advertised. A qualified slogan not only port" s 
an identical message contained in an patia 
slogan, but also some secondary mention of a quali S 
tion or reservation against the main message. Howev E 
any negative implications of qualify ing the messag? 
for the product it represented were minimized co! 5 
sistent with one’s general expectations that epee 
ments do not normally stress conditions under SA 
advertising appeals might not apply. The products, T 
brand names, their meaningfulness values, and the 
slogans are presented in Table 1 


Design 


For each of the 16 products, there were two dimareni 
brand names: fitting (F) and nonfitting (NF). Each 
of these brand names was presented either with an ve 
qualified (UQ) or with a qualified (Q) slogan. Thus, ie 
each product, four different advertisements. were Ph 
pared F-Q, F-UQ, NF-Q, and NF-UQ. Copies of pie 
of these 64 advertisements were printed in the manner 
suggested by Kanungo (1968). However, the Kanungo 
series of studies presented a picture of the appropriate 
product along with each advertisement, The presen 
study did not. 

Finally, 80 booklets were compiled. Each booklet 
contained one advertisement of each of the 16 products 
of two categories: convenience goods and shopping 
goods. All advertisements in a given booklet comprise’ 
one of the four kinds: F-Q, F-UQ, NF-Q, NF-UQ. The 
advertisements. of shopping goods preceded the ad- 
vertisements of convenience goods half of the time and 
followed them for the other half in a counterbalanced 
order. Additionally, the Sequence of the eight adver- 
lisements within each product category in each booklet 
Was randomized to eliminate order effect in presenta- 
tion. The name and address of a fictitious advertising 
agency appeared on the front page of cach booklet. 


Procedure 


The experiment was conducted during the regular 
classroom session, Fach S was given a separate booklet 
and was asked to rate each advertisement in the booklet 
on each of the three 7-point scales: pleasant-unpleasan's 
good-bad, and ambiguous-clear, The three bipolar 
scales were printed on the answer sheets and were 
labeled at the ends, The intermediate labels were 
numbered but not verbally defined. The order © 
Presentation of the three scales was randomized for 
cach answer sheet. For each scale the positions of polar 
adjectives were reversed half of the time. Finally, each 
S rated the appropriateness of each brand name on & 
6-point appropriateness scale, The Ss were told that 
the advertising agency intended to study certain 
‘Psychological aspects” of brand naming and adver 
Using in order to Serve its clients better, Hence, tP? 
advertiser wanted to determine consumer reaction 
these advertisements and brand names, A 

The Ss were given approximately 2 min. to evaluat? 
each advertisement, Immediately following the rating 
of all 16 advertisements, the booklet and the answ® 


Propucts, Source Worps, BRAND NAMES, THEIR MEANINGFULNE 


BRAND AWARENESS 


TABLE 1 
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AND SLOGANS 


Product name Brand name m | Slogan 
1. Soap | z x 
F Cleno (Clean and Cleanliness) 2.80 | Use Cleno [Purry] for cleaner and softer skin. 
NF Purry (Pure) 2.75 | [It's worth paying a little more.] 
2. Tea 
F Flavora (Flavor) 3.05 | Take Flavora [Treat] for delightful armoa and 
NF Treat (Treating) | 2.95 | richness. [Close lid to keep it that way.] 
3. Talcum powder | | n 
F Fragrant (Fragrance) | 3.40 | Use Fragrant [Economy] for extra freshness. 
NF Economy (Economical) 3.55 | CHowever, it doesn't guarantee a brighter 
complexion. ] 
4. Biscuits | 
F Crispy (Crisp) | 2.70 | Buy Crispy [Golden] for delicious taste. [But 
NE Golden (Golden) 50 | watch your weight.] 
5. Fountain pen | | 
F Smooth-flo (Smooth & ilow) 3.40 | Use Smooth-flo [Perma] for the best in writing 
NF Perma (Permanent) 2.30 [although it dries a little slower. ] 
6. Tooth paste 
F Smile Wel (Smile) 3.20 | With Smile-Wel [Tastee] remove bad breath. 
NF Tastee (Taste) 3.40 [Soft brush recommended. ] 
7. Headache pill 
F Pain Kil (Pain Killer) Take Pain-Kil [Hope] for fast relief from head- 
NF Hope (Hope) ache. [But for recurrent headaches consult 
| your physician. ] 
8. Hair oil 
F Welgrom (Groom) 2.85 | With Welgrom [Crixo] ensure healthy hair 
NF Fixo (Fix) 2.95 growth. [For best results apply on dry hair. ] 
9. Radio 
F Musica (Music) 3.10 | Buy Musica [Leisure] for clearer reception [even 
NF Leisure (Leisure) 315 if it consumes a little extra electricity. ] 
10. Fabrics 
F Kolory (Color) 1.80 | Buy Kolory [Thicrap] for the latest in textile 
NF Thicrap (Thick) | 1.70 technology. [But you need to iron it for a 
sharp look. 
icycle 
i 2a Rider (Ride) 3.45 | With Rider [Fun] it’s the performance that 
NE Fun (Fun) 3.35 counts. [You wouldn’t regret paying up to 4% 
more.) 
12 area) Delikate (Delicate) 2.95 | Get Delikate [Exclusive] and play an elegant 
NE Exclusive (Exclusive) 345 host. [It’s as fragile as rest of the China.] 
13. Blank | 
E i Caze (Cozy) 2.60 | Have Cozee [Feel] for a cozy winter [although 
NF Feel (Feeling) | 2.75 | wool and silk don’t feel the same. ] 
14. Electric fa 
E ai Kool (Cool) 3.50 | Go for Kool [Smarty] in a hot summer. [It’s 
F Smarty (Smart) | 3.90 | worth investing just a few rupees more. ] 
15. Footwear | r 
r Durable (Durable) 3.05 | Use Durable [Shiney] and walk at ease. [Apply 
NF Shiney (Shine) 3.10 cream polish for a sharper look.] 
16. Alarm clock š 
F Wek-up (Waking up) 2.95 | With Wek-up [Signale] you will never miss an 
NF Signale (Signal) 3.10 appointment [though it looks a little old- 


fashioned. ] 


Note.—Items 1-8 are convenience goods, and Items 9-16 are shopping goods. 
associative response word(s) from which the brand name 
qualified slogan conditions are bracketed. 


The word(s) next to each brand name is the 


nar s derived. Nonfitting brand names and additional information used for 
Abbreviations: F = fitting, 


NF = nonfitting, m’ = meaningfulness value. 
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sheets were collected, and S was given a recall sheet 
wherein all the 16 products were listed in alphabetical 
order. He was asked to recall as many brand names as 
possible and write them beside the name of the products 
with which they were associated. Two delayed recalls— 
after 24 hr. and after 7 days—of brand names were 
taken from the same Ss. The procedure followed during 
delayed recalls was similar to that followed during 
immediate recall. 


RESULTS AND DISCUSSION 


There were no significant differences among 
groups in their evaluative ratings of adver- 
tisements on the three semantic differential- 
type scales mentioned earlier. As a check on the 
fittingness manipulation, Ss rated each brand 
name on a 6-point appropriateness scale 
with ends labelled “1—very inappropriate” 
and “6—very appropriate.” Results indicated 
that the F brand names were rated as more 
appropriate (M = 3.96) than the NF brand 
names (M = 2.92, t = 2.03, p < .05).6 This 
suggests that the experimental manipulation 
of fittingness was effective. 

The recall scores for individual Ss in this 
experiment were exceedingly high as com- 
pared to the low-recall scores reported by 
Kanungo (1968). No S showed complete 
absence of recall in any experimental condition 


Effects of Fittingness, Types of SI 
Delayed Action Effects ae AE Soh td 


In order to determine the effects fitting. 
ness, slogans, and time adael ae a; 
awareness, a 2 X 2 X 3 analysis of variance 
Was run on the recall of brand names Th 
three classifications were: fitting and n i 
fitting brand names; unqualified and qualified 
slogans; immediate recall, recall after 
day, and recall after 7 days (see Table 2) ene 
_ The main effect of fittingness was highly 
significant (F = 31.69, p < .001). The seni 
undoubtedly Support the earlier findings th: i 
a fitting brand name is retained DEHER th fA 
nonfitting brand name. No other main effe A 
or interaction was significant, However the 
brand recall for advertisements with unq i i 
fied slogans (overall M = 12.53) was stint 
higher than those with qualified slogans = 
all M = 11.72, F = 3.26, p< 40). For sho 
immediate recall measure, brand AATE tor 


All p values are two-tailed, 
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TABLE 2 


ANALYSIS OF VARIANCE OF 
RECALL SCORES 


Source of variation | df 


| ws | F 
Fittingness (F) I i | 340.01 | 31.69%* 
Type of Slogan (Q) | 1 35.05 | 3.26* 
Time (T) | 2 | 1.56 AS 
FXQ 1 | 1837 1.71 
FXT 2 | 1.24 09 
QxT 2 1.05 12 
FXQXT | 2 6.16 ST 
Within cell (error) | 204 10.73 
Total | 215 | 
*p <.10. o E 
**p <.001. 


advertisements with unqualified slogans was 
higher (M = 12.64) than those with qualified 
slogans (M = 11.72), but this difference Was 
statistically insignificant, thus failing to con- 
firm the authors’ hypothesis that the im- 
mediate impact of unqualified slogans would 
be greater than that for qualified slogans. 

One of the striking findings in this expeti- 
ment is that there was very slight forgetting 
of brand names as a function of time. It ÍS 
Possible that previous recall tests might have 
aided memory during the subsequent tests: 
A more adequate test of delayed action effect 
La have been to measure brand recall afte 
Hehe | Unfortunately, for various reasons 
Kecume not available for such a ce 
each condition of pe io teeta a it 
fees cert e time variable or Pid 

es of products to be remembere¢ 


oi three subgroups and test for brand reca 
ot one subgroup at a time, 


Effect of Product Types 


E another three-way design types of goods: 
Sereias fee d the effect of time were 6% 
it ies xed factors. The analysis of va" 
two Iel, be reported in detail, but Cae 
ness and atic sources were significant : ge 
5.59 and types of goods, with Fs of 51.2 2” 
zai » respectively, The latter F (p < .05) "° 
S greater weighted mean recall of bt” 
names of shopping goods (M = 6.75) as © 
Pape to brand names of convenience 8° i 
( = 6.32). This small effect may have a 
Practical importance, but its statistical 9% 


q 
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nificance does testify to the high intraindi- 
vidual reliability. It supports our hypothesis 
that brand awareness for shopping goods 
exceeds that for convenience goods. 


CONCLUSION 


The results of this experiment suggest that 
to create better brand awareness, advertisers 
of a new product should choose fitting brand 
names for consumer goods. Second, the failure 
to establish clearly the differential roles of 
unqualified and qualified slogans on brand 
awareness calls for further research in this 
area. There may be at least two reasons why 
these data did not bear out the predictions 
concerning the differential role of slogans. 
First, it may be argued that the qualified 
versus unqualified slogan effect is confounded 
with long versus short slogan effect. A second 
criticism suggests that the present research 
did not control for implicit versus explicit 
qualifications used in the qualified slogans. 
It may be difficult to eliminate these con- 
founding variables, but some control for these 
kinds of confounding should be sought 
future studies. 

Finally, only males were used in this study, 
despite the fact that certain parameters were 
previously established on both males and 
females. This is simply because the experi- 


in 


ment was carried out in an institution where 
there are very few female students. Although 
it would be imprudent to generalize our results 
for females, we are encouraged by the fact 
that our results paralleled some of the findings 
of earlier work using both males and females. 
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CHARACTERISTICS OF NONRESPONDENTS AMONG WORKERS 


ARTIN J. GANNON,! JOSEPH C. NOTHERN, axb STEPHEN J. CARROLL, Jr. 
MARTIN J. GANNON, 


Behavioral Science Division, Department of Busi 


Given the fact that surveys frequently 


ness Administration, University of Maryland 


+ 5002. it i 
have response rates below 50 o it is 


i tant to know the characteristics of nonrespondents. Comparisons of npo 
mn nd nonrespondents in survey research have indicated that females anc 
ents a y 


those with higher educational levels have higher response 


rates. The present study 


confirmed these findings and also indicated that surveys of workers seem to be 
0) 


biased toward more stable, older, more 


checkers working in 14 stores of a supermarket chain. 


It is often necessary to conduct surveys of 
mployed persons. Surveys are common for 
letermining worker attitudes about various 
ob conditions, unionism, or a variety of social 
ind political issues such as integration or the 
regulation of industry. Such surveys are 
sponsored or conducted by firms, government 
institutions, or behavioral scientists investi- 
gating some aspect of employee behavior. Four 
methods are in common use, namely, t 
conduct the survey on the job, through 
to the worker’s home, through 
naire completed at the place 
or through a mail questionn: 
worker’s home. 


The first method, the completion of the 
survey on the job, is prob 
frequently because of the hig 
from the population or s 
tion chosen for study. H 
can be so disruptive of 
impossible or too time 
conducting the survey. Also, it is possible that 
individuals will be less candid in the 
situation than they would be 
In addition, the costs of lost p 


o 
a visit 
a mail question- 
of employment, 
lire sent to the 


ably used most 
her response rate 
ample of the popula- 
owever, this approach 
work that it might be 
consuming for those 


work 
in their homes. 
roduction while 


the survey is being conducted may be 


significant. 

Surveys involving visits t 
have been used primarily 
workers in general rather th 
in a particular organization have been the 
focus of the study. It is obviou 
considerations why this is the 
research indicates that 


© workers’ homes 
when attitudes of 
an those of workers 


s from cost 
‘he case. In addition, 
individuals have 


1 Requests for reprints should be sen 
J. Gannon, Behavioral Science Division 
f Business Administration, Univ. 
College Park, Maryland 20742, 


a 


t to Martin 
Department 
f Maryland, 
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effective employees. The Ss were 552 


tendency to give more socially acceptable 
answers in the presence of an interviewer Lrg 
they do on a questionnaire (Hyman, Cob - 
Fredman, Harts, & Stember, 1960). Mai 
Surveys conducted at the work place also 
involve the cost of lost production to the 
organization. The mail survey to the worker 5 
home has the advantage of not being disrup- 
tive of work activities and being less costly to 
the organization in terms of lost production 
but, similar to the work place mail survey, bes 
the disadvantage of the nonresponses from Y 
certain proportion of the group under smid: 
(For further discussions, sce Sudman, 195 
and Parten, 1950.) 

In the situations w 


Si = Of 
here it is necessary © 
advantageous to ¢ 


onduct mail surveys ee 
workers for any of the reasons previously 
Cited, it is desirable to know how the non- 
respondents will bias the returns, This in- 
formation obviously will assist the researche" 
in formulating any conclusions based on the 
data. To contribute to this end, the authors 
have obtained some information on Ei 
characteristics of nonrespondents in a mal 
Survey of workers in a Supermarket chain an 
these data are presented in this report. 


Mernop 
_ Survey questionnaires w 
in 14 food stores of 
LE, along with 
to the authors. 
identify workers’ 
the social climat 


2y means of a small ink mark. The 
nl 


tion on specific i 
that using non 


r 
«form 

Y summary data and not any ae ted 
ndividuals. Research has demons! ihe 


: + șa piä 
anonymous questionnaires does bi 
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TABLE i 
PREDICTORS OF RESPONSE RATES AMONG WORKERS 
] 
Respondent Nonrespondent | N | x£ 
Item o | % 

Education . 61 

Under 11 yr, 57.4 56 

11 & 12 yr. 62.9 291 

13 & 14 yr. 64.6 116 zal 

15 yr. & over 80.3 66 8.87 
Marital status 6 

Single 51.3 1 1} 

Married 64.7 392 1.30** 

Widowed, divorced, or separated 76.4 ə 11.3 
Age | 

Under 20 yr. 57.7 42.3 26 

20-24 yr, 66.9 33.1 133 

25-29 yr, 51.1 48.9 139 

30-49 yr. | 69.4 | 30.6 232} ma 

50 yr. and above | 52.4 | 47.6 21 | 14.75** 
Sex | | 

Male 56.8 | 43.2 | 102 

Female 66.0 34.0 352 4.56" 
Supervisory ranking 

Twentieth percentile & under 53.4 | 46.6 73 

Twenty-first percentile & over 68.0 | 32.0 391 5.83* 


Note.—N = 552; the N in each subtable does not tot: 
*p < 05. 


research results obtained (See Hyman et al., 1960; 
Metzner & Mann, 1952; and Yoder, 1962). Conse- 
quently, most surveys of workers rely on anonymous 
questionnaires. $ 

The number of questionnaires returned was 348 of 
the 552 sent for a response rate of 63%. The 204 non- 
respondents were compared to the respondents with 
respect to certain personal characteristics obtained from 
personnel records of the organization. Comparisons 
were made on the basis of years of education, length of 
service with the organization, marital status, sex, age, 
number of children, and supervisory rankings of 
effectiveness. The supervisory rankings were made by 
Cach store manager for the checkers in his store. These 
rankings were then converted to percentile scores so 
that a common effectiveness measure of the employees 
in the 14 stores could be used. Chi-square analysis was 
used to determine if any differences between the 
respondents and nonrespondents were statistically 
significant. In addition, each characteristic analyzed 
was Compared to the response rate when every other 
characteristic was held constant. This procedure was 
necessary because all of the characteristics were related 
to each other. 


RESULTS 


Nonrespondents differed from respondents 
on five of the seven characteristics analyzed. 
Table 1 presents the results of the analys 


al 552 due to incomplete information. 


The table lists only those characteristics for 
which there was a statistically significant dif- 
ference between the respondents 
respondents. Length of service 
children were not found to be predictive of a 
nonrespondent and are not included in the 
table. Number of children initially had a 
relationship to the response rate, but this dis- 
appeared when marital status was held con- 
stant, In a similar manner, as indicated earlier, 
the five characteristics listed in Table 1 were 
found to be predictive of response rates among 
at least some categories of the other character- 
istics in the table when they were held constant. 

As Table 1 indicates, a lower response rate is 
associated with lower levels of education, 
single status, male status, being younger or 
older than the middle-aged group (between 
30 and 49 vr. of age), and ranking in the bottom 
fifth of all employees as measured by super- 
visory ratings. These findings confirm pre- 
vious research (Ellis, Endo, & Armer, 1970; 
Goode & Hatt, 1952) that showed that 
females and those with higher educational 
levels have higher response rates, 


and non- 
and number of 
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In general the results of the analysis indi- 
cate that mail surveys of workers may be 
biased toward more stable, older, more effec- 
tive employees. In addition, typical attitudes 
of female workers, to the extent that they 
differ from those of male workers, may be 
weighted too heavily by such surveys. 
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EFFECT OF PERSONAL SIGNATURE ON THE INITIAL RATE OF 
RETURN OF A MAILED QUESTIONNAIRE * 


MARY B. KAWASH axp LAWREN 


= M. ALEAMONI? 


University of Illinois 


An attempt was made to test the traditional rule of thumb that a personal signature 
on the cover letter is a sine qua non for yielding higher rates of return of mailed 
questionnaires. Three thousand and ninety-one members of the University of 
Illinois faculty were sent questionnaires concerning their uses of and attitudes 
toward audiovisual instructional materials. One group (NV = 1,546) was sent a 
cover letter bearing the researcher's personal signature. A second group (N = 1,545) 
was sent a cover letter identical to the first group's except that the researcher's 
signature wasa mimeographed facsimile. No difference was found between the 
initial return rates for the two groups even when the rank of the faculty member 
was identified. It was concluded that, for the University of Illinois faculty, a per- 
sonal signature on the cover letter accompanying a questionnaire concerning audio- 
visual materials was no greater an inducement to respond than a cover letter 


SS 


bearing a mimeographed facsimile. 


For decades the mailed questionnaire has 
been considered one of the most economical 
tools of researchers in a variety of fields. Like 
many favored procedures, however, it does 
have an Achilles’ heel, nonresponse, that under- 
mines the foundation stone of statistical 
inference—the assumption of random selection 
of respondents from the target population. 
Nonrespondents have been observed to differ 
from respondents on a number of character- 
istics (Hochstim & Athanasopoulos, 1970; 
Mayer & Pratt, 1966-1967; Schwirian & 
Blaine, 1966-1967 ; Scott, 1961; Wallace, 1954). 
Such a state of affairs can result in distorted 
estimates of population parameters (Schwirian 
& Blaine, 1966-1967; Shuttleworth, 1941; 
Wallace, 1954). Of no mean significance is the 
difficulty of ascertaining the extent to which 
these differential characteristics and those 
which have not been measured interact with 
the specific relationships being studied, thus 
affecting the researcher’s acceptance or rejec- 
tion of his hypotheses (Abeles, Iscoe, & Brown, 
1954-1955; Robin, 1965; Schwirian & Blaine, 
1966-1967; Suchman, 1962). 

Procedures for securing replies from the non- 
respondents have been proposed, some of which 


1 The authors are indebted to Dennis Williams for 
modifying the cover letter for the questionnaire to 
obtain this data. 

? Requests for reprints should be sent to Lawrence 
M. Aleamoni, Measurement and Research Division, 
University of Illinois, 307 Engineering Hall, Urbana, 
Illinois 61801. 


have been highly successful (Bachrack & 
Scoble, 1967; Eckland, 1965; Hochstim & 
Athanasopoulos, 1970; Robin, 1965; Sirken, 
Pifer, & Brown, 1960). These attempts, how- 
ever, may be viewed as harassments and yield 
distorted information. It, therefore, indubita- 
bly behooves the researcher to construct his 
questionnaire in such a manner as to ensure 
the highest possible ivifial return rate. 

A great deal of research has been undertaken 
throughout the past 30 yr. to determine the 
effects of a number of variables on return rate. 
Some of the variables studied are: (a) postage 
on return envelopes (Champion & Sear, 1969; 
Gullahorn & Gullahorn, 1963; Mayer, 1946; 
Robinson & Agisim, 1951); (b) form of mailing 
(Gullahorn & Gullahorn, 1963; Russell, Kon- 
rad, & Kaluzny, 1970); (c) length of question- 
naire (Brown, 1965; Durant & Maas, 1956; 
Scott, 1961; Sletto, 1940); (d) appearance of 
questionnaire (Bender, 1957; Ford, 1968; 
Scott, 1961); (e) questionnaire rewards (Han- 
cock, 1940; Kephart & Bressler, 1958; Robin- 
son & Agisim, 1951); (f) guarantees of ano- 
nymity (Hoppe, 1952; Mason, Dressel, & 
Bain, 1961); and (g) characteristics of the 
cover letter, such as the sponsor (Filipello, 
Birg, & Webb, 1958; Roeher, 1963); appear- 
ance (Moore, 1941; Simon, 1967), and nature 
of appeal (Champion & Sear, 1969; Hammond, 
1959; Linsky, 1965; Sletto, 1940). These are 
the variables that have been most frequently 
assessed systematically. 
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It was with this latter category that the 
present study was concerned. Several authors 
(Frazier & Bird, 1958; Hammond, 1959; 
Longworth, 1953; Parten, 1950) have implied 
that the handwritten signature of the sender 
of the questionnaire is one of the accouter- 
ments requisite for obtaining a high rate of 
return. No one, however, at least as far as 
these authors have been able to ascertain, has 
restricted his research to the effect of this 
single cover letter variable on the initial rate 
of return. Linsky (1965), for example, designed 
a survey of members of a state nurses’ associa- 
tion in which he systematically varied three 
characteristics of the cover letter, one of which 
was the personalization of the salutation and 
signature. Initial returns to a cover letter with 
handwritten salutation and signature 
significantly greater than one containing a 
mimeographed “Dear Member” and mimeo- 
graphed signature. On the other hand, a study 
by Emmett (see Scott, 1961) of TV viewers 
yielded no significant difference between initial 
return rates to letters that contained personal 
salutations and signatures and those addressed 
“Dear Viewer” and bearing a facsimile signa- 
ture. Similarly, for Weilbacher and W. 
(1952), personalization of the greeting and 
signature yielded no increase in response rate 
over that for a “nonpersonalized” letter, 

. Only one study was discovered that varied 
signature only. Clausen and Ford (1947), dis- 
mayed at receiving a 23% return rate to a 


were 


alsh 


TABLE 1 
QUESTIONNAIRE RETURNS By TypE OF Cover 
LETTER SIGNATURE 


Type of signature 
| 

Returns Personal | Mimeo- m 
| graphed m 

| aa 

N|%lnl| a 

Total returns 441 | 28.52 | 420 27.1 
Completed 389 | 25.16 | 359 | 2323 i 
Uneompleteds | 52] 336 | 61 | 3o ‘2 
Total mailed 1546 1545 ` 


a Questionnaires in this group we: 
uncompleted but accompanied by 
they could not be filled out, 
completed but without an ex 
responses, 


hosi 


which were returned 


an explanation a 
as to 7 
All questionna es returned Wy 


Planation were counted as non. 
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letter employing a “Dear Veteran” salutation 
and a facsimile signature of General Omar 
Bradley (celebrated World War II leader), 
randomly assigned their nonrespondents to 
one of each of the following conditions: im- 
personal salutation-personal signature, im- 
personal salutation-facsimile signature, per- 
sonal salutation-facsimile signature, personal 
salutation-personal signature. Return rates for 
the second mailing did not differ significantly 
for the four groups. Clausen and Ford, how- 
ever, maintain that “one cannot conclude that 
Personalization of the initial letter would like- 
wise have made no difference in response rate. 
The mere fact of follow-up is in itself a per- 
sonalizing device [p. 501].” 

The present study was an attempt to clarify 
the nature of the effect of personalization, 


namely, that of the sender’s signature, on the 
initial return rate, 


METHOD 


mailed to all 3,091 
University of Illinois who ap- 

'’s mailing list. They were asked 
about their uses of, and feelings concerning, audiovisual 
instructional materials. The questionnaire contained 
52 Structured items with alternatives ranging from 2 
to 10. Seven of the 52 items allowed the faculty mem- 
ber the additional option of recording his own alterna- 
tives, Fifteen hundred and forty-six of the cover letters 
Sena iing the questionnaires were personally 
signed with the researcher’s name by the researcher 


ved at over a 4-day period. This method was 
sloppy. For ae keep the signature from becoming 
rast eS of the letters, the researcher's signa- 

as a mimeographed facsimile. The letters’ con- 


tents and salutati 0 ` ere 
een utations (Dear Faculty Member) were 


ResuLTS 


Table 1 presents the 
types of cover letter si 
than 28% of those 


return rates for the two 
: §natures, Slightly more 
: aculty members who re- 
ceived the personalized e en re- 
turned the questionnaire “completed or un- 
completed with an explanation, Of those 
ere sent the facsimile signature 

S, approximately 27% completed the 
questionnaire or returned it uncompleted with 
an explanation, Tests of significance for the 
ae pela Proportions were performed 
“tween the tw ag 
Sit af ena 

» 4nd uncompleted retu 


PERSONAL SIGNATURE AND QUESTIONNAIRE RETURN RATES 


TABLE 2 


QUESTIONNAIRE RETURNS BY TYPE OF COVER LETTER 
SIGNATURE AND RANK OF THE 
Facuity MEMBER 


‘Type of signature 
Mimeo- i 
Returns by rank Personal graphed 7 
| = n 
IN % N i 
| | 

Professor | 

Completed 132 | 33.938 | 124 | 34.54 | 18 

Uncompleted 7 | 13.46 10 | 16.39 | 43 

Mailed 477 | 30.85 | 476 | 30.81 02 
Associate professor | 

Completed 74) 19.02 70 .50 | 17 

Uncompleted 11 21.15 9 .89 

Mailed 233 | 15.07 | 263 1.48 
Assistant professor 

Completed 103 | 26.48 80 | 22.28 |1.33 

Uncompleted 7 | 13.46 T | 1148 | .32 

Mailed 304 | 19.66 | 269 | 17.41 | 1.61 
Instructor 

Completed 38 | 9.77 27 | 7.52 | 1.08 

Uncompleted 6 | 11.54 10 | 16.39 | .74 

Mailed 153 | 9.90 | 163 | 10.55 | .59 
Research professor 

Completed 1 -26 1 -28 | .05 

Uncompleted 1| 1.92 0| 00 | 1.09 

Mailed 7 45 3 19 |144 
Research associate | 

Completed 3 77 @) £05 | 1.40 

Uncompleted 4) 7.69 2) 3.28 | 3.29 

Mailed 52 | 3.36 76 | 4.92 | 2.23 
Graduate assistant 

Completed 0 .00 1 -28 | 1.08 

Uncompleted 0 00 1 1.64 | .93 

Mailed 6 .39 7) A5) .26 
Other | 

Completed 38 | 9.77 49 | 13.67 | 1.75 

Uncompleted 16 | 30.80 | 22 | 36.06 | .59 

Mailed 314 | 20.31 | 288 | 18.64 | 1.16 


® This percentage is based on the number of completed per- 
sonal signature returns (N = 389), 


No significant differences in initial return rates 
were found for any of these return Categories. 

Table 2 presents the mailed and return 
rates by the faculty member’s rank. Tests 
of significance on the number of mailed letters 
at each rank indicates that a significantly 
larger proportion of research associates re- 
ceived the mimeographed signature. When the 
same tests were run on the return rates, how- 


ever, only the uncompleted returns exhibited 
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significance differences, and that was in favor 
of the personal signature. 


DISCUSSION 


Contrary to traditional rules of thumb, the 
presence of the handwritten signature of the 
researcher had little effect on the initial rate 
of return of a mailed questionnaire even when 
the rank of the faculty member was identified. 
Whether or not one may generalize to other 
populations of respondents and ete 
to other types of questionnaires are additiona 
empirical questions. s 

The E difference observed in the 
number of mailed questionnaires to research 
associates is not surprising as no attempt was 
made to randomly mail by rank. Also, the 
significant difference for the uncompleted re- 
turns is probably an artifact of the small 
number in that category. 

One might speculate that the content of the 
letter was sufficiently personal to counteract 
the effect of the handwritten signature or that 
no single variable is very effective in inducing 
higher return rates. It may be that combina- 
tions of variables are more effective, such as 
the type of letterhead used and the prestige 
or authority of the signer, For example, if one 
wanted a higher rate of return from full pro- 
fessors, then the signature of a full professor 
may be needed. In any case, the small rates 
of return for both forms indicate that there 
are other variables, either singly or in combina- 


tion, that are more powerful inducements for 
informants to reply. 
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